Source author record

Xin Wei

Xin Wei appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Artificial Intelligence cond-mat.mes-hall Computation and Language cond-mat.mtrl-sci cs.CY eess.SP Information Retrieval Machine Learning Multiagent Systems Multimedia physics.atom-ph physics.optics quant-ph Robotics Tissues and Organs

Catalog footprint

What is connected

13works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A 10 Megahertz Spatial Light Modulator

Rapid and programmable shaping of light fields is central to modern microscopy, display technologies, optical communications and sensing, quantum engineering, and quantum information processing. Current wavefront shaping technologies face a fundamental dichotomy: spatial light modulators (SLMs) offer high pixel count but suffer from low refresh rates, while acousto-optic deflectors (AODs) provide moderate speed with restricted optical beam geometries. Though recent advances in photonic integrated circuits achieve fast switching, there is currently no tool that provides MHz-rate, continuous motion, and arbitrarily reconfigurable control over a set of diffraction-limited spots. Here we introduce a new class of spatial light modulator that provides both 2D pixel geometry and high speed. The device operates by encoding spatial information in frequency bins via a broadband optical phase modulator, and decoding them via a first-of-its-kind, high-resolution 2D spectrometer. The spectrometer, based on the architecture which we call the Re-Imaging Phased Array (RIPA), achieves its sensitivity through long path-lengths, enabled by intra-spectrometer re-imaging lens-guides. We demonstrate site-resolved optical pulsing with a 44(1)~ns rise time, corresponding to frame rates exceeding 10 million frames per second, as well as arbitrary, reconfigurable 2D addressing and multi-site operations, including asynchronous, independent beam motion, splitting, and recombination. Leveraging these tools opens new horizons in rapid optical manipulation of matter across science, from fast, scalable control that approaches the inertial and radiation limits of atoms in quantum processors, to dynamically programmable, microsecond-resolved illumination in microscopy and neuro-biological imaging.

preprint2026arXiv

Integrating Movable Antennas and Intelligent Reflecting Surfaces (MA-IRS): Fundamentals, Practical Solutions, and ISAC

Movable antennas (MAs) and intelligent reflecting surfaces (IRSs) enable active antenna repositioning and passive phase-shift tuning for channel reconfiguration, respectively. Integrating MAs and IRSs boosts spatial degrees of freedom, significantly enhancing wireless network capacity, coverage, and reliability. In this article, we first present the fundamentals of MA-IRS integration, involving clarifying the key design issues, revealing performance gain, and identifying the conditions where MA-IRS synergy persists. Then, we examine practical challenges and propose pragmatic design solutions, including optimization schemes, hardware architectures, deployment strategies, and robust designs for hardware impairments and mobility management. In addition, we highlight how MA-IRS architectures uniquely support advanced integrated sensing and communication, enhancing sensing performance and dual-functional flexibility. Overall, MA-IRS integration emerges as a compelling approach toward next-generation reconfigurable wireless systems.

preprint2022arXiv

Decreased serum vitamin D level as a prognostic marker in patients with COVID-19

Background: The corona virus disease 2019 (COVID-19) pandemic, which is caused by severe acute respiratory syndrome coronavirus 2, is still localized outbreak and has resulted in a high rate of infection and severe disease in older patients with comorbidities. The vitamin D status of the population has been found to be an important factor that could influence outcome of COVID-19. However, whether vitamin D can lessen the symptoms or severity of COVID-19 still remains controversial. Methods: A total of 719 patients with confirmed COVID-19 were enrolled retrospectively in this study from April 13 to June 6, 2022 at Shanghai Forth People's Hospital. The circulating levels of 25(OH)D3, inflammatory factors, and clinical parameters were assayed. Time to viral RNA clearance (TVRC), classification and prognosis of COVID-19 were used to evaluate the severity of COVID-19 infection. Results: The median age was 76 years (interquartile range, IQR, 64.5-84.6), 44.1% of patients were male, and the TVRC was 11 days (IQR, 7-16) in this population. The median level of 25(OH)D3 was 27.15 (IQR, 19.31-38.89) nmol/L. Patients with lower serum 25(OH)D3 had prolonged time to viral clearance, more obvious inflammatory response, more severe respiratory symptoms and higher risks of impaired hepatic and renal function. Multiple regression analyses revealed that serum 25(OH)D3 level was negatively associated with TVRC independently. ROC curve showed the serum vitamin D level could predict the severity classification and prognosis of COVID-19 significantly.Conclusions: Serum 25(OH)D3 level is independently associated with the severity of COVID-19 in elderly, and it could be used as a predictor of the severity of COVID-19. In addition, supplementation with vitamin D might provide beneficial effects in old patients with COVID-19.

preprint2022arXiv

GREN: Graph-Regularized Embedding Network for Weakly-Supervised Disease Localization in X-ray Images

Locating diseases in chest X-ray images with few careful annotations saves large human effort. Recent works approached this task with innovative weakly-supervised algorithms such as multi-instance learning (MIL) and class activation maps (CAM), however, these methods often yield inaccurate or incomplete regions. One of the reasons is the neglection of the pathological implications hidden in the relationship across anatomical regions within each image and the relationship across images. In this paper, we argue that the cross-region and cross-image relationship, as contextual and compensating information, is vital to obtain more consistent and integral regions. To model the relationship, we propose the Graph Regularized Embedding Network (GREN), which leverages the intra-image and inter-image information to locate diseases on chest X-ray images. GREN uses a pre-trained U-Net to segment the lung lobes, and then models the intra-image relationship between the lung lobes using an intra-image graph to compare different regions. Meanwhile, the relationship between in-batch images is modeled by an inter-image graph to compare multiple images. This process mimics the training and decision-making process of a radiologist: comparing multiple regions and images for diagnosis. In order for the deep embedding layers of the neural network to retain structural information (important in the localization task), we use the Hash coding and Hamming distance to compute the graphs, which are used as regularizers to facilitate training. By means of this, our approach achieves the state-of-the-art result on NIH chest X-ray dataset for weakly-supervised disease localization. Our codes are accessible online (https://github.com/qibaolian/GREN).

preprint2022arXiv

Learning High-quality Proposals for Acne Detection

Acne detection is crucial for interpretative diagnosis and precise treatment of skin disease. The arbitrary boundary and small size of acne lesions lead to a significant number of poor-quality proposals in two-stage detection. In this paper, we propose a novel head structure for Region Proposal Network to improve the proposals' quality in two ways. At first, a Spatial Aware Double Head(SADH) structure is proposed to disentangle the representation learning for classification and localization from two different spatial perspectives. The proposed SADH ensures a steeper classification confidence gradient and suppresses the proposals having low intersection-over-union(IoU) with the matched ground truth. Then, we propose a Normalized Wasserstein Distance prediction branch to improve the correlation between the proposals' classification scores and IoUs. In addition, to facilitate further research on acne detection, we construct a new dataset named AcneSCU, with high-resolution imageries, precise annotations, and fine-grained lesion categories. Extensive experiments are conducted on both AcneSCU and the public dataset ACNE04, and the results demonstrate the proposed method could improve the proposals' quality, consistently outperforming state-of-the-art approaches. Code and the collected dataset are available in https://github.com/pingguokiller/acnedetection.

preprint2022arXiv

Modeling Temporal-Modal Entity Graph for Procedural Multimodal Machine Comprehension

Procedural Multimodal Documents (PMDs) organize textual instructions and corresponding images step by step. Comprehending PMDs and inducing their representations for the downstream reasoning tasks is designated as Procedural MultiModal Machine Comprehension (M3C). In this study, we approach Procedural M3C at a fine-grained level (compared with existing explorations at a document or sentence level), that is, entity. With delicate consideration, we model entity both in its temporal and cross-modal relation and propose a novel Temporal-Modal Entity Graph (TMEG). Specifically, graph structure is formulated to capture textual and visual entities and trace their temporal-modal evolution. In addition, a graph aggregation module is introduced to conduct graph encoding and reasoning. Comprehensive experiments across three Procedural M3C tasks are conducted on a traditional dataset RecipeQA and our new dataset CraftQA, which can better evaluate the generalization of TMEG.

preprint2022arXiv

Nature of novel moiré exciton states in WSe$_2$/WS$_2$ heterobilayers

Moiré patterns of transition metal dichalcogenide (TMD) heterobilayers have proven to be an ideal platform to host unusual correlated electronic phases, emerging magnetism, and correlated exciton physics. While the existence of novel moiré excitonic states is established through optical measurements, the microscopic nature of these states is still poorly understood, often relying on empirically fit models. Here, combining large-scale first-principles GW-BSE calculations and micro-reflection spectroscopy, we identify the nature of the exciton resonances in WSe$_2$/WS$_2$ moiré superlattices, discovering a surprisingly rich set of moiré excitons that cannot be even qualitatively captured by prevailing continuum models. Our calculations reveal moiré excitons with distinct characters, including modulated Wannier excitons and previously unindentified intralayer charge-transfer excitons. Signatures of these distinct excitonic characters are confirmed experimentally via the unique carrier-density and magnetic-field dependences of different moiré exciton resonances. Our study highlights the highly non-trivial exciton states that can emerge in TMD moiré superlattices, and suggests novel ways of tuning many-body physics in moiré systems by engineering excited-states with specific spatial characters.

preprint2022arXiv

Spiral Contrastive Learning: An Efficient 3D Representation Learning Method for Unannotated CT Lesions

Computed tomography (CT) samples with pathological annotations are difficult to obtain. As a result, the computer-aided diagnosis (CAD) algorithms are trained on small datasets (e.g., LIDC-IDRI with 1,018 samples), limiting their accuracies and reliability. In the past five years, several works have tailored for unsupervised representations of CT lesions via two-dimensional (2D) and three-dimensional (3D) self-supervised learning (SSL) algorithms. The 2D algorithms have difficulty capturing 3D information, and existing 3D algorithms are computationally heavy. Light-weight 3D SSL remains the boundary to explore. In this paper, we propose the spiral contrastive learning (SCL), which yields 3D representations in a computationally efficient manner. SCL first transforms 3D lesions to the 2D plane using an information-preserving spiral transformation, and then learn transformation-invariant features using 2D contrastive learning. For the augmentation, we consider natural image augmentations and medical image augmentations. We evaluate SCL by training a classification head upon the embedding layer. Experimental results show that SCL achieves state-of-the-art accuracy on LIDC-IDRI (89.72%), LNDb (82.09%) and TianChi (90.16%) for unsupervised representation learning. With 10% annotated data for fine-tune, the performance of SCL is comparable to that of supervised learning algorithms (85.75% vs. 85.03% on LIDC-IDRI, 78.20% vs. 73.44% on LNDb and 87.85% vs. 83.34% on TianChi, respectively). Meanwhile, SCL reduces the computational effort by 66.98% compared to other 3D SSL algorithms, demonstrating the efficiency of the proposed method in unsupervised pre-training.

preprint2021arXiv

A Synthetic Prediction Market for Estimating Confidence in Published Work

Explainably estimating confidence in published scholarly work offers opportunity for faster and more robust scientific progress. We develop a synthetic prediction market to assess the credibility of published claims in the social and behavioral sciences literature. We demonstrate our system and detail our findings using a collection of known replication projects. We suggest that this work lays the foundation for a research agenda that creatively uses AI for peer review.

preprint2021arXiv

Ground-SLAM: Ground Constrained LiDAR SLAM for Structured Multi-Floor Environments

This paper proposes a 3D LiDAR SLAM algorithm named Ground-SLAM, which exploits grounds in structured multi-floor environments to compress the pose drift mainly caused by LiDAR measurement bias. Ground-SLAM is developed based on the well-known pose graph optimization framework. In the front-end, motion estimation is conducted using LiDAR Odometry (LO) with a novel sensor-centric sliding map introduced, which is maintained by filtering out expired features based on the model of error propagation. At each key-frame, the sliding map is recorded as a local map. The ground nearby is extracted and modelled as an infinite planar landmark in the form of Closest Point (CP) parameterization. Then, ground planes observed at different key-frames are associated, and the ground constraints are fused into the pose graph optimization framework to compress the pose drift of LO. Finally, loop-closure detection is carried out, and the residual error is jointly minimized, which could lead to a globally consistent map. Experimental results demonstrate superior performances in the accuracy of the proposed approach.

preprint2020arXiv

Object-based Illumination Estimation with Rendering-aware Neural Networks

We present a scheme for fast environment light estimation from the RGBD appearance of individual objects and their local image areas. Conventional inverse rendering is too computationally demanding for real-time applications, and the performance of purely learning-based techniques may be limited by the meager input data available from individual objects. To address these issues, we propose an approach that takes advantage of physical principles from inverse rendering to constrain the solution, while also utilizing neural networks to expedite the more computationally expensive portions of its processing, to increase robustness to noisy input data as well as to improve temporal and spatial stability. This results in a rendering-aware system that estimates the local illumination distribution at an object with high accuracy and in real time. With the estimated lighting, virtual objects can be rendered in AR scenarios with shading that is consistent to the real scene, leading to improved realism.

preprint2019arXiv

Optical detection of Mott and generalized Wigner crystal states in WSe2/WS2 moiré superlattices

Moiré superlattices are emerging as a new route for engineering strongly correlated electronic states in two-dimensional van der Waals heterostructures, as recently demonstrated in the correlated insulating and superconducting states in magic-angle twisted bilayer graphene and ABC trilayer graphene/boron nitride moiré superlattices. Transition metal dichalcogenide (TMDC) moiré heterostructures provide another exciting model system to explore correlated quantum phenomena, with the addition of strong light-matter interactions and large spin-orbital coupling. Here we report the optical detection of strongly correlated phases in semiconducting WSe2/WS2 moiré superlattices. Our sensitive optical detection technique reveals a Mott insulator state at one hole per superlattice site (ν = 1), and surprising insulating phases at fractional filling factors ν = 1/3 and 2/3, which we assign to generalized Wigner crystallization on an underlying lattice. Furthermore, the unique spin-valley optical selection rules of TMDC heterostructures allow us to optically create and investigate low-energy spin excited states in the Mott insulator. We reveal an especially slow spin relaxation lifetime of many microseconds in the Mott insulating state, orders-of-magnitude longer than that of charge excitations. Our studies highlight novel correlated physics that can emerge in moiré superlattices beyond graphene.

preprint2018arXiv

Minimum Margin Loss for Deep Face Recognition

Face recognition has achieved great progress owing to the fast development of the deep neural network in the past a few years. As an important part of deep neural networks, a number of the loss functions have been proposed which significantly improve the state-of-the-art methods. In this paper, we proposed a new loss function called Minimum Margin Loss (MML) which aims at enlarging the margin of those overclose class centre pairs so as to enhance the discriminative ability of the deep features. MML supervises the training process together with the Softmax Loss and the Centre Loss, and also makes up the defect of Softmax + Centre Loss. The experimental results on MegaFace, LFW and YTF datasets show that the proposed method achieves the state-of-the-art performance, which demonstrates the effectiveness of the proposed MML.

Xin Wei

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

A 10 Megahertz Spatial Light Modulator

Integrating Movable Antennas and Intelligent Reflecting Surfaces (MA-IRS): Fundamentals, Practical Solutions, and ISAC

Decreased serum vitamin D level as a prognostic marker in patients with COVID-19

GREN: Graph-Regularized Embedding Network for Weakly-Supervised Disease Localization in X-ray Images

Learning High-quality Proposals for Acne Detection

Modeling Temporal-Modal Entity Graph for Procedural Multimodal Machine Comprehension

Nature of novel moiré exciton states in WSe$_2$/WS$_2$ heterobilayers

Spiral Contrastive Learning: An Efficient 3D Representation Learning Method for Unannotated CT Lesions

A Synthetic Prediction Market for Estimating Confidence in Published Work

Ground-SLAM: Ground Constrained LiDAR SLAM for Structured Multi-Floor Environments

Object-based Illumination Estimation with Rendering-aware Neural Networks

Optical detection of Mott and generalized Wigner crystal states in WSe2/WS2 moiré superlattices

Minimum Margin Loss for Deep Face Recognition