Source author record

Sheng Zhang

Sheng Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

61works

38topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CodeClinic: Evaluating Automation of Coding Skills for Clinical Reasoning Agents

Clinical reasoning agents based on large language models (LLMs) aim to automate tasks such as intensive care unit (ICU) monitoring and patient state tracking from electronic health records (EHRs). Existing systems typically rely on manually curated clinical tools or skills for concepts such as sepsis detection and organ failure assessment. However, maintaining these tool libraries requires substantial expert effort, while zero-shot querying or code generation often produces inefficient and unreliable reasoning chains, especially under institution-specific clinical policies. We introduce CodeClinic, a benchmark built on MIMIC-IV for evaluating whether LLM agents can synthesize and compose reusable clinical skills instead of relying on fixed toolboxes. The benchmark contains two complementary tasks: longitudinal ICU surveillance and compositional information seeking. The longitudinal setting simulates monitoring patient trajectories with structured decisions every four hours across 25 findings and eight clinical families, while the compositional setting spans 63k instances across 259 tasks in nine domains and is stratified by compositional dependency depth to evaluate increasingly complex multi-step reasoning. We further propose an offline autoformalization pipeline that converts natural-language clinical guidelines into reusable and verified Python skill libraries through iterative LLM refinement. Compared with zero-shot code generation, the resulting libraries improve consistency while reducing per-query token usage by up to 40%.

preprint2026arXiv

CSCBench: A PVC Diagnostic Benchmark for Commodity Supply Chain Reasoning

Large Language Models (LLMs) have achieved remarkable success in general benchmarks, yet their competence in commodity supply chains (CSCs) -- a domain governed by institutional rule systems and feasibility constraints -- remains under-explored. CSC decisions are shaped jointly by process stages (e.g., planning, procurement, delivery), variety-specific rules (e.g., contract specifications and delivery grades), and reasoning depth (from retrieval to multi-step analysis and decision selection). We introduce CSCBench, a 2.3K+ single-choice benchmark for CSC reasoning, instantiated through our PVC 3D Evaluation Framework (Process, Variety, and Cognition). The Process axis aligns tasks with SCOR+Enable; the Variety axis operationalizes commodity-specific rule systems under coupled material-information-financial constraints, grounded in authoritative exchange guidebooks/rulebooks and industry reports; and the Cognition axis follows Bloom's revised taxonomy. Evaluating representative LLMs under a direct prompting setting, we observe strong performance on the Process and Cognition axes but substantial degradation on the Variety axis, especially on Freight Agreements. CSCBench provides a diagnostic yardstick for measuring and improving LLM capabilities in this high-stakes domain.

preprint2026arXiv

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Test-time scaling (TTS) has become an effective approach for improving large language model performance by allocating additional computation during inference. However, existing TTS strategies are largely hand-crafted: researchers manually design reasoning patterns and tune heuristics by intuition, leaving much of the computation-allocation space unexplored. We propose an environment-driven framework, AutoTTS, that changes what researchers design: from individual TTS heuristics to environments where TTS strategies can be discovered automatically. The key to AutoTTS lies in environment construction: the discovery environment must make the control space tractable and provide cheap, frequent feedback for TTS search. As a concrete instantiation, we formulate width--depth TTS as controller synthesis over pre-collected reasoning trajectories and probe signals, where controllers decide when to branch, continue, probe, prune, or stop and can be evaluated cheaply without repeated LLM calls. We further introduce beta parameterization to make the search tractable and fine-grained execution trace feedback to improve discovery efficiency by helping the agent diagnose why a TTS program fails. Experiments on mathematical reasoning benchmarks show that the discovered strategies improve the overall accuracy--cost tradeoff over strong manually designed baselines. The discovered strategies generalize to held-out benchmarks and model scales, while the entire discovery costs only $39.9 and 160 minutes. Our data, and code will be open-source at https://github.com/zhengkid/AutoTTS.

preprint2026arXiv

Machine learning nonequilibrium phase transitions in charge-density wave insulators

Nonequilibrium electronic forces play a central role in voltage-driven phase transitions but are notoriously expensive to evaluate in dynamical simulations. Here we develop a machine learning framework for adiabatic lattice dynamics coupled to nonequilibrium electrons, and demonstrate it for a gating induced insulator to metal transition out of a charge density wave state in the Holstein model. Although exact electronic forces can be obtained from nonequilibrium Green's function (NEGF) calculations, their high computational cost renders long time dynamical simulations prohibitively expensive. By exploiting the locality of the electronic response, we train a neural network to directly predict instantaneous local electronic forces from the lattice configuration, thereby bypassing repeated NEGF calculations during time evolution. When combined with Brownian dynamics, the resulting machine learning force field quantitatively reproduces domain wall motion and nonequilibrium phase transition dynamics obtained from full NEGF simulations, while achieving orders of magnitude gains in computational efficiency. Our results establish direct force learning as an efficient and accurate approach for simulating nonequilibrium lattice dynamics in driven quantum materials.

preprint2026arXiv

Majorana Zero Modes and Topological Nature in Bi2Ta3S6-family Superconductors

In this work, we report that Bi2Ta3S6-family superconductors exhibit nontrivial band topology. They possess a natural quantum-well structure consisting of alternating stacks of TaS2 and honeycomb Bi layers, which contribute superconducting and topological properties, respectively. Symmetry-based indicators $(\mathbb{Z}_4;\mathbb{Z}_{2}\mathbb{Z}_{2}\mathbb{Z}_{2})=(2;000)$ reveal that the topological nature arises entirely from the Bi layers, which belong to a quantum spin Hall phase characterized by a $p_x-p_y$ model on a honeycomb lattice. The topological zigzag (ZZ) and armchair (AC) edge states are obtained. Using VASP2KP, the in-plane $g$ factors of these topological edge states are computed from the ab initio calculations: $g_{x/y}^{\mathrm{ZZ}}=2.07/1.60$ and $g_{x/y}^{\mathrm{AC}}=0.50/0.06$. The strong anisotropy of the edge-state $g$ factors allows us to explore Majorana zero modes in the Bi monolayer on a superconductor, which can be obtained by exfoliation or molecular beam epitaxy. The relaxed structures of the Bi2Ta3Se6, Bi2Nb3S6 and Bi2Nb3Se6 are obtained. Their superconducting transition temperature $T_c$ are estimated based on the electron-phonon coupling and the McMillan formula. Furthermore, using the experimental superconducting gap $Δ$ and the computed $g$ factors, we obtain the phase diagram, which shows that the in-plane field $B_y>2.62\mathrm{ T}$ can generate corner Majorana zero modes in the Bi monolayer of the superconductor Bi2Ta3S6. A similar paradigm also applies to the Bi2Ta3S6 bulk with the emergence of Majorana hinge states. These natural quantum-well superconductors therefore offer ideal platforms for exploring topological superconductivity and Majorana zero modes.

preprint2026arXiv

RAGR: Review-Augmented Generative Recommendation

Sequential recommendation (SR) is traditionally formulated as next-item prediction over a chronological sequence of interacted items. Although recent generative recommendation (GR) methods introduce new machinery, such as semantic IDs, autoregressive decoding, and unified token spaces, they largely inherit the same item-only modeling assumption. We argue that this design constitutes a structural bottleneck, because user decision-making is not purely behavioral: while item interactions reveal what users choose, review feedback often explain why they choose it by exposing latent evaluative factors. Motivated by this observation, we propose Review-Augmented Generative Recommendation (RAGR), a novel GR framework that incorporates review feedback directly into the generative user sequence rather than treating reviews as auxiliary side information. Specifically, RAGR introduces a Review-Augmented User Sequence Modeling mechanism that interleaves item semantic IDs and review semantic IDs in chronological order to construct a mixed behavioral-semantic sequence, enabling review signals to participate directly in autoregressive next-token generation. To preserve the recommendation objective, we further introduce an Item-Centric Task Generation Alignment strategy based on direct preference optimization (DPO), which encourages the model to favor item tokens over review tokens at prediction positions. Experiments on three real-world datasets show that RAGR yields consistent and significant gains over strong GR backbones across all metrics. Our code and data are available at \url{https://github.com/Zhang-Yingyi/TKDE_RAGR}.

preprint2026arXiv

Test-time generative augmentation for medical image segmentation

Medical image segmentation is critical for clinical diagnosis, treatment planning, and monitoring, yet segmentation models often struggle with uncertainties stemming from occlusions, ambiguous boundaries, and variations in imaging devices. Traditional test-time augmentation (TTA) techniques typically rely on predefined geometric and photometric transformations, limiting their adaptability and effectiveness in complex medical scenarios. In this study, we introduced Test-Time Generative Augmentation (TTGA), a novel augmentation strategy specifically tailored for medical image segmentation at inference time. Different from conventional augmentation strategies that suffer from excessive randomness or limited flexibility, TTGA leverages a domain-fine-tuned generative model to produce contextually relevant and diverse augmentations tailored to the characteristics of each test image. Built upon diffusion model inversion, a masked null-text inversion method is proposed to enable region-specific augmentations during sampling. Furthermore, a dual denoising pathway is designed to balance precise identity preservation with controlled variability. We demonstrate the efficacy of our TTGA through extensive experiments across three distinct segmentation tasks spanning nine datasets. Our results consistently demonstrate that TTGA not only improves segmentation accuracy (with DSC gains ranging from 0.1% to 2.3% over the baseline) but also offers pixel-wise error estimation (with DSC gains ranging from 1.1% to 29.0% over the baseline). The source code and demonstration are available at: https://github.com/maxiao0234/TTGA.

preprint2026arXiv

Video Models Can Reason with Verifiable Rewards

Video diffusion models have made rapid progress in perceptual realism and temporal coherence, but they remain primarily optimized for plausible generation rather than verifiable reasoning. This limitation is especially pronounced in tasks where generated videos must satisfy explicit spatial, temporal, or logical constraints. Inspired by the role of reinforcement learning with verifiable rewards (RLVR) in reasoning-oriented language models, we introduce VideoRLVR, a practical recipe for optimizing video diffusion models with rule-based feedback. VideoRLVR formulates video reasoning as the generation of verifiable visual trajectories and consists of an SDE-GRPO optimization backbone, dense decomposed rewards, and an Early-Step Focus strategy for efficient training. The Early-Step Focus strategy restricts policy optimization to the early denoising phase, reducing training latency by about 40% while preserving performance. We evaluate VideoRLVR on Maze, FlowFree, and Sokoban, three procedurally generated domains with objective success criteria. Across these tasks, VideoRLVR consistently improves over supervised fine-tuning baselines, with dense decomposed rewards proving especially important in low-success-rate settings. Our RL-optimized model also outperforms the evaluated proprietary and open-source video generation models on these verifiable reasoning benchmarks and out-of-domain benchmarks. These results suggest that verifiable RL can move video models beyond perceptual imitation toward more reliable rule-consistent visual reasoning.

preprint2025arXiv

Renormalization Group Guided Tensor Network Structure Search

Tensor network structure search (TN-SS) aims to automatically discover optimal network topologies and rank configurations for efficient tensor decomposition in high-dimensional data representation. Despite recent advances, existing TN-SS methods face significant limitations in computational tractability, structure adaptivity, and optimization robustness across diverse tensor characteristics. They struggle with three key challenges: single-scale optimization missing multi-scale structures, discrete search spaces hindering smooth structure evolution, and separated structure-parameter optimization causing computational inefficiency. We propose RGTN (Renormalization Group guided Tensor Network search), a physics-inspired framework transforming TN-SS via multi-scale renormalization group flows. Unlike fixed-scale discrete search methods, RGTN uses dynamic scale-transformation for continuous structure evolution across resolutions. Its core innovation includes learnable edge gates for optimization-stage topology modification and intelligent proposals based on physical quantities like node tension measuring local stress and edge information flow quantifying connectivity importance. Starting from low-complexity coarse scales and refining to finer ones, RGTN finds compact structures while escaping local minima via scale-induced perturbations. Extensive experiments on light field data, high-order synthetic tensors, and video completion tasks show RGTN achieves state-of-the-art compression ratios and runs 4-600$\times$ faster than existing methods, validating the effectiveness of our physics-inspired approach.

preprint2024arXiv

Aircraft Landing Time Prediction with Deep Learning on Trajectory Images

Aircraft landing time (ALT) prediction is crucial for air traffic management, especially for arrival aircraft sequencing on the runway. In this study, a trajectory image-based deep learning method is proposed to predict ALTs for the aircraft entering the research airspace that covers the Terminal Maneuvering Area (TMA). Specifically, the trajectories of all airborne arrival aircraft within the temporal capture window are used to generate an image with the target aircraft trajectory labeled as red and all background aircraft trajectory labeled as blue. The trajectory images contain various information, including the aircraft position, speed, heading, relative distances, and arrival traffic flows. It enables us to use state-of-the-art deep convolution neural networks for ALT modeling. We also use real-time runway usage obtained from the trajectory data and the external information such as aircraft types and weather conditions as additional inputs. Moreover, a convolution neural network (CNN) based module is designed for automatic holding-related featurizing, which takes the trajectory images, the leading aircraft holding status, and their time and speed gap at the research airspace boundary as its inputs. Its output is further fed into the final end-to-end ALT prediction. The proposed ALT prediction approach is applied to Singapore Changi Airport (ICAO Code: WSSS) using one-month Automatic Dependent Surveillance-Broadcast (ADS-B) data from November 1 to November 30, 2022. Experimental results show that by integrating the holding featurization, we can reduce the mean absolute error (MAE) from 82.23 seconds to 43.96 seconds, and achieve an average accuracy of 96.1\%, with 79.4\% of the predictions errors being less than 60 seconds.

preprint2022arXiv

2-d signature of images and texture classification

We introduce a proper notion of 2-dimensional signature for images. This object is inspired by the so-called rough paths theory, and it captures many essential features of a 2-dimensional object such as an image. It thus serves as a low-dimensional feature for pattern classification. Here we implement a simple procedure for texture classification. In this context, we show that a low dimensional set of features based on signatures produces an excellent accuracy.

preprint2022arXiv

A Dual Accelerated Method for Online Stochastic Distributed Averaging: From Consensus to Decentralized Policy Evaluation

Motivated by decentralized sensing and policy evaluation problems, we consider a particular type of distributed stochastic optimization problem over a network, called the online stochastic distributed averaging problem. We design a dual-based method for this distributed consensus problem with Polyak--Ruppert averaging and analyze its behavior. We show that the proposed algorithm attains an accelerated deterministic error depending optimally on the condition number of the network, and also that it has an order-optimal stochastic error. This improves on the guarantees of state-of-the-art distributed stochastic optimization algorithms when specialized to this setting, and yields -- among other things -- corollaries for decentralized policy evaluation. Our proofs rely on explicitly studying the evolution of several relevant linear systems, and may be of independent interest. Numerical experiments are provided, which validate our theoretical results and demonstrate that our approach outperforms existing methods in finite-sample scenarios on several natural network topologies.

preprint2022arXiv

A Multi-User Effective Computation Offloading Mechanism for MEC System: Batched Multi-Armed Bandits Approach

With the development of 5G technology, mobile edge computing (MEC) is becoming a useful architecture, which is envisioned as a cloud computing extension version. Users within MEC system could deal with data processing at edge terminals, which can reduce time for communication or data transmission. Multi-armed bandits (MAB) algorithms are powerful tools helping users offloading tasks to their best servers in MEC. However, as the number of users and tasks growing, the frequency of selecting servers and the cost of making decision is growing rapidly under traditional MAB algorithms. Inspired by this, in this paper, we propose a Batch-based Multi-user Server Elimination (BMSE) algorithm to solve such problem, which includes two sub-algorithms. We firstly propose a sub-algorithm in user level (BMSE-UL) to reduce the time cost. In BMSE-UL, users can simplify its own available server groups and offload tasks collectively. Then another sub-algorithm in system level (BMSE-SL) is proposed to reduce the frequency of making decision. In BMSE-SL, the system can cut down all the suboptimal task offloading actions and make the choosing option unique. Furthermore, we establish the optimality of the proposed algorithms by proving the sub-linearity convergence of their regrets and demonstrate the effectiveness of BMSE by extensive experiments.

preprint2022arXiv

A Review on Graph Neural Network Methods in Financial Applications

With multiple components and relations, financial data are often presented as graph data, since it could represent both the individual features and the complicated relations. Due to the complexity and volatility of the financial market, the graph constructed on the financial data is often heterogeneous or time-varying, which imposes challenges on modeling technology. Among the graph modeling technologies, graph neural network (GNN) models are able to handle the complex graph structure and achieve great performance and thus could be used to solve financial tasks. In this work, we provide a comprehensive review of GNN models in recent financial context. We first categorize the commonly-used financial graphs and summarize the feature processing step for each node. Then we summarize the GNN methodology for each graph type, application in each area, and propose some potential research areas.

preprint2022arXiv

Can depth-adaptive BERT perform better on binary classification tasks

In light of the success of transferring language models into NLP tasks, we ask whether the full BERT model is always the best and does it exist a simple but effective method to find the winning ticket in state-of-the-art deep neural networks without complex calculations. We construct a series of BERT-based models with different size and compare their predictions on 8 binary classification tasks. The results show there truly exist smaller sub-networks performing better than the full model. Then we present a further study and propose a simple method to shrink BERT appropriately before fine-tuning. Some extended experiments indicate that our method could save time and storage overhead extraordinarily with little even no accuracy loss.

preprint2022arXiv

Client Selection and Bandwidth Allocation for Federated Learning: An Online Optimization Perspective

Federated learning (FL) can train a global model from clients' local data set, which can make full use of the computing resources of clients and performs more extensive and efficient machine learning on clients with protecting user information requirements. Many existing works have focused on optimizing FL accuracy within the resource constrained in each individual round, however there are few works comprehensively consider the optimization for latency, accuracy and energy consumption over all rounds in wireless federated learning. Inspired by this, in this paper, we investigate FL in wireless network where client selection and bandwidth allocation are two crucial factors which significantly affect the latency, accuracy and energy consumption of clients. We formulate the optimization problem as a mixed-integer problem, which is to minimize the cost of time and accuracy within the long-term energy constrained over all rounds. To address this optimization, we propose the Perround Energy Drift Plus Cost (PEDPC) algorithm in an online perspective, and the performance of the PEDPC algorithm is verified in simulation results in terms of latency, accuracy and energy consumption in IID and NON-IID dat distributions.

preprint2022arXiv

Descriptors for Machine Learning Model of Generalized Force Field in Condensed Matter Systems

We outline the general framework of machine learning (ML) methods for multi-scale dynamical modeling of condensed matter systems, and in particular of strongly correlated electron models. Complex spatial temporal behaviors in these systems often arise from the interplay between quasi-particles and the emergent dynamical classical degrees of freedom, such as local lattice distortions, spins, and order-parameters. Central to the proposed framework is the ML energy model that, by successfully emulating the time-consuming electronic structure calculation, can accurately predict a local energy based on the classical field in the intermediate neighborhood. In order to properly include the symmetry of the electron Hamiltonian, a crucial component of the ML energy model is the descriptor that transforms the neighborhood configuration into invariant feature variables, which are input to the learning model. A general theory of the descriptor for the classical fields is formulated, and two types of models are distinguished depending on the presence or absence of an internal symmetry for the classical field. Several specific approaches to the descriptor of the classical fields are presented. Our focus is on the group-theoretical method that offers a systematic and rigorous approach to compute invariants based on the bispectrum coefficients. We propose an efficient implementation of the bispectrum method based on the concept of reference irreducible representations. Finally, the implementations of the various descriptors are demonstrated on well-known electronic lattice models.

preprint2022arXiv

Experimental demonstration of memory-enhanced scaling for entanglement connection of quantum repeater segments

The quantum repeater protocol is a promising approach to implement long-distance quantum communication and large-scale quantum networks. A key idea of the quantum repeater protocol is to use long-lived quantum memories to achieve efficient entanglement connection between different repeater segments with a polynomial scaling. Here we report an experiment which realizes efficient connection of two quantum repeater segments via on-demand entanglement swapping by the use of two atomic quantum memories with storage time of tens of milliseconds. With the memory enhancement, scaling-changing acceleration is demonstrated in the rate for a successful entanglement connection. The experimental realization of entanglement connection of two quantum repeater segments with an efficient memory-enhanced scaling demonstrates a key advantage of the quantum repeater protocol, which makes a cornerstone towards future large-scale quantum networks.

preprint2022arXiv

FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing

We present a benchmark suite of four datasets for evaluating the fairness of pre-trained language models and the techniques used to fine-tune them for downstream tasks. Our benchmarks cover four jurisdictions (European Council, USA, Switzerland, and China), five languages (English, German, French, Italian and Chinese) and fairness across five attributes (gender, age, region, language, and legal area). In our experiments, we evaluate pre-trained language models using several group-robust fine-tuning techniques and show that performance group disparities are vibrant in many cases, while none of these techniques guarantee fairness, nor consistently mitigate group disparities. Furthermore, we provide a quantitative and qualitative analysis of our results, highlighting open challenges in the development of robustness methods in legal NLP.

preprint2022arXiv

Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning

Motivated by applications in reinforcement learning (RL), we study a nonlinear stochastic approximation (SA) algorithm under Markovian noise, and establish its finite-sample convergence bounds under various stepsizes. Specifically, we show that when using constant stepsize (i.e., $α_k\equiv α$), the algorithm achieves exponential fast convergence to a neighborhood (with radius $O(α\log(1/α))$) around the desired limit point. When using diminishing stepsizes with appropriate decay rate, the algorithm converges with rate $O(\log(k)/k)$. Our proof is based on Lyapunov drift arguments, and to handle the Markovian noise, we exploit the fast mixing of the underlying Markov chain. To demonstrate the generality of our theoretical results on Markovian SA, we use it to derive the finite-sample bounds of the popular $Q$-learning with linear function approximation algorithm, under a condition on the behavior policy. Importantly, we do not need to make the assumption that the samples are i.i.d., and do not require an artificial projection step in the algorithm to maintain the boundedness of the iterates. Numerical simulations corroborate our theoretical results.

preprint2022arXiv

Knowledge-Rich Self-Supervision for Biomedical Entity Linking

Entity linking faces significant challenges such as prolific variations and prevalent ambiguities, especially in high-value domains with myriad entities. Standard classification approaches suffer from the annotation bottleneck and cannot effectively handle unseen entities. Zero-shot entity linking has emerged as a promising direction for generalizing to new entities, but it still requires example gold entity mentions during training and canonical descriptions for all entities, both of which are rarely available outside of Wikipedia. In this paper, we explore Knowledge-RIch Self-Supervision ($\tt KRISS$) for biomedical entity linking, by leveraging readily available domain knowledge. In training, it generates self-supervised mention examples on unlabeled text using a domain ontology and trains a contextual encoder using contrastive learning. For inference, it samples self-supervised mentions as prototypes for each entity and conducts linking by mapping the test mention to the most similar prototype. Our approach can easily incorporate entity descriptions and gold mention labels if available. We conducted extensive experiments on seven standard datasets spanning biomedical literature and clinical notes. Without using any labeled information, our method produces $\tt KRISSBERT$, a universal entity linker for four million UMLS entities that attains new state of the art, outperforming prior self-supervised methods by as much as 20 absolute points in accuracy.

preprint2022arXiv

Locally Aggregated Feature Attribution on Natural Language Model Understanding

With the growing popularity of deep-learning models, model understanding becomes more important. Much effort has been devoted to demystify deep neural networks for better interpretability. Some feature attribution methods have shown promising results in computer vision, especially the gradient-based methods where effectively smoothing the gradients with reference data is key to a robust and faithful result. However, direct application of these gradient-based methods to NLP tasks is not trivial due to the fact that the input consists of discrete tokens and the "reference" tokens are not explicitly defined. In this work, we propose Locally Aggregated Feature Attribution (LAFA), a novel gradient-based feature attribution method for NLP models. Instead of relying on obscure reference tokens, it smooths gradients by aggregating similar reference texts derived from language model embeddings. For evaluation purpose, we also design experiments on different NLP tasks including Entity Recognition and Sentiment Analysis on public datasets as well as key feature detection on a constructed Amazon catalogue dataset. The superior performance of the proposed method is demonstrated through experiments.

preprint2022arXiv

Machine learning predictions for local electronic properties of disordered correlated electron systems

We present a scalable machine learning (ML) model to predict local electronic properties such as on-site electron number and double occupation for disordered correlated electron systems. Our approach is based on the locality principle, or the nearsightedness nature, of many-electron systems, which means local electronic properties depend mainly on the immediate environment. A ML model is developed to encode this complex dependence of local quantities on the neighborhood. We demonstrate our approach using the square-lattice Anderson-Hubbard model, which is a paradigmatic system for studying the interplay between Mott transition and Anderson localization. We develop a lattice descriptor based on group-theoretical method to represent the on-site random potentials within a finite region. The resultant feature variables are used as input to a multi-layer fully connected neural network, which is trained from datasets of variational Monte Carlo (VMC) simulations on small systems. We show that the ML predictions agree reasonably well with the VMC data. Our work underscores the promising potential of ML methods for multi-scale modeling of correlated electron systems.

preprint2022arXiv

Quantum-Memory-Enhanced Preparation of Nonlocal Graph States

Graph states are an important class of multipartite entangled states. Previous experimental generation of graph states and in particular the Greenberger-Horne-Zeilinger (GHZ) states in linear optics quantum information schemes is subjected to an exponential decay in efficiency versus the system size, which limits its large-scale applications in quantum networks. Here we demonstrate an efficient scheme to prepare graph states with only a polynomial overhead using long-lived atomic quantum memories. We generate atom-photon entangled states in two atomic ensembles asynchronously, retrieve the stored atomic excitations only when both sides succeed, and further project them into a four-photon GHZ state. We measure the fidelity of this GHZ state and further demonstrate its applications in the violation of Bell-type inequalities and in quantum cryptography. Our work demonstrates the prospect of efficient generation of multipartite entangled states in large-scale distributed systems with applications in quantum information processing and metrology.

preprint2022arXiv

REKnow: Enhanced Knowledge for Joint Entity and Relation Extraction

Relation extraction is an important but challenging task that aims to extract all hidden relational facts from the text. With the development of deep language models, relation extraction methods have achieved good performance on various benchmarks. However, we observe two shortcomings of previous methods: first, there is no unified framework that works well under various relation extraction settings; second, effectively utilizing external knowledge as background information is absent. In this work, we propose a knowledge-enhanced generative model to mitigate these two issues. Our generative model is a unified framework to sequentially generate relational triplets under various relation extraction settings and explicitly utilizes relevant knowledge from Knowledge Graph (KG) to resolve ambiguities. Our model achieves superior performance on multiple benchmarks and settings, including WebNLG, NYT10, and TACRED.

preprint2022arXiv

Shilling Black-box Recommender Systems by Learning to Generate Fake User Profiles

Due to the pivotal role of Recommender Systems (RS) in guiding customers towards the purchase, there is a natural motivation for unscrupulous parties to spoof RS for profits. In this paper, we study Shilling Attack where an adversarial party injects a number of fake user profiles for improper purposes. Conventional Shilling Attack approaches lack attack transferability (i.e., attacks are not effective on some victim RS models) and/or attack invisibility (i.e., injected profiles can be easily detected). To overcome these issues, we present Leg-UP, a novel attack model based on the Generative Adversarial Network. Leg-UP learns user behavior patterns from real users in the sampled ``templates'' and constructs fake user profiles. To simulate real users, the generator in Leg-UP directly outputs discrete ratings. To enhance attack transferability, the parameters of the generator are optimized by maximizing the attack performance on a surrogate RS model. To improve attack invisibility, Leg-UP adopts a discriminator to guide the generator to generate undetectable fake user profiles. Experiments on benchmarks have shown that Leg-UP exceeds state-of-the-art Shilling Attack methods on a wide range of victim RS models. The source code of our work is available at: https://github.com/XMUDM/ShillingAttack.

preprint2022arXiv

Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning

Large multilingual pretrained language models such as mBERT and XLM-RoBERTa have been found to be surprisingly effective for cross-lingual transfer of syntactic parsing models (Wu and Dredze 2019), but only between related languages. However, source and training languages are rarely related, when parsing truly low-resource languages. To close this gap, we adopt a method from multi-task learning, which relies on automated curriculum learning, to dynamically optimize for parsing performance on outlier languages. We show that this approach is significantly better than uniform and size-proportional sampling in the zero-shot setting.

preprint2021arXiv

A submetric characterization of Rolewicz's property ($β$)

The main result is a submetric characterization of the class of Banach spaces admitting an equivalent norm with Rolewicz's property ($β$). As applications we prove that up to renorming, property ($β$) is stable under coarse Lipschitz embeddings and coarse quotients.

preprint2021arXiv

L-SNet: from Region Localization to Scale Invariant Medical Image Segmentation

Coarse-to-fine models and cascade segmentation architectures are widely adopted to solve the problem of large scale variations in medical image segmentation. However, those methods have two primary limitations: the first-stage segmentation becomes a performance bottleneck; the lack of overall differentiability makes the training process of two stages asynchronous and inconsistent. In this paper, we propose a differentiable two-stage network architecture to tackle these problems. In the first stage, a localization network (L-Net) locates Regions of Interest (RoIs) in a detection fashion; in the second stage, a segmentation network (S-Net) performs fine segmentation on the recalibrated RoIs; a RoI recalibration module between L-Net and S-Net eliminating the inconsistencies. Experimental results on the public dataset show that our method outperforms state-of-the-art coarse-to-fine models with negligible computation overheads.

preprint2021arXiv

Pattern Transfer Learning for Reinforcement Learning in Order Dispatching

Order dispatch is one of the central problems to ride-sharing platforms. Recently, value-based reinforcement learning algorithms have shown promising performance on this problem. However, in real-world applications, the non-stationarity of the demand-supply system poses challenges to re-utilizing data generated in different time periods to learn the value function. In this work, motivated by the fact that the relative relationship between the values of some states is largely stable across various environments, we propose a pattern transfer learning framework for value-based reinforcement learning in the order dispatch problem. Our method efficiently captures the value patterns by incorporating a concordance penalty. The superior performance of the proposed method is supported by experiments.

preprint2020arXiv

A Sterile Neutrino Search at compact materials irradiation facility

The compact material irradiation facility (CMIF) is a current project in China that will provide a compact deuteron-beryllium neutron source. The target of this facility will be an intense and compact Isotope Decay-At-Rest (IsoDAR) neutrino source. In this paper, we propose to test the sterile neutrino hypothesis using CMIF as the neutrino source. At CMIF platform, the electron antineutrino production rate can be up to $2.0\times 10^{19}$ per day. When paired with an 80 t liquid scintillator detector to study short baseline electron antineutrino disappearance, the inverse beta decay (IBD) event rate is large enough to investigate the parameter ranges of interest for neutrino anomalies. Our sensitivity analysis shows that a short baseline experiment at this platform will provide a very competitive sterile neutrino search, especially in the high-$Δm^2$ region ($Δm^2 >10\,\text{eV}^2$).

preprint2020arXiv

Error Model of Radio Fingerprint and PDR Fusion Indoor Localization

Multi-source fusion positioning is one of the technical frameworks for obtaining sufficient indoor positioning accuracy. In order to evaluate the effect of multi-source fusion positioning, it is necessary to establish a fusion error model. In this paper, we first use the least squares method to fuse the radio fingerprint and the PDR positioning, and then apply the variance propagation laws to calculate the error distribution of indoor multi-source localization methods. Based on the fusion error model, we developed an indoor positioning simulation system. The system can give a better positioning source layout scheme under a given condition, and can evaluate the signal strength distribution and the error distribution.

preprint2020arXiv

Universal Decompositional Semantic Parsing

We introduce a transductive model for parsing into Universal Decompositional Semantics (UDS) representations, which jointly learns to map natural language utterances into UDS graph structures and annotate the graph with decompositional semantic attribute scores. We also introduce a strong pipeline model for parsing into the UDS graph structure, and show that our transductive parser performs comparably while additionally performing attribute prediction. By analyzing the attribute prediction errors, we find the model captures natural relationships between attribute groups.

preprint2019arXiv

High dimensional entanglement between a photon and a multiplexed atomic quantum memory

Multiplexed quantum memories and high-dimensional entanglement can improve the performance of quantum repeaters by promoting the entanglement generation rate and the quantum communication channel capacity. Here, we experimentally generate a high-dimensional entangled state between a photon and a collective spin wave excitation stored in the multiplexed atomic quantum memory. We verify the entanglement dimension by the quantum witness and the entanglement of formation. Then we use the high-dimensional entangled state to test the violation of the Bell-type inequality. Our work provides an effective method to generate multidimensional entanglement between the flying photonic pulses and the atomic quantum interface.

preprint2019arXiv

Imaging the stochastic microstructure and dynamic development of correlations in perpendicular artificial spin ice

We use spatially resolved magneto-optical Kerr microscopy to track the complete microstates of arrays of perpendicular anisotropy nanomagnets during magnetization hysteresis cycles. These measurements allow us to disentangle the intertwined effects of nearest neighbor interaction, disorder, and stochasticity on magnetization switching. We find that the nearest neighbor correlations depend on both interaction strength and disorder. We also find that although the global characteristics of the hysteretic switching are repeatable, the exact microstate sampled is stochastic with the behavior of individual islands varying between nonminally identical runs.

preprint2019arXiv

Quantum Communication between Multiplexed Atomic Quantum Memories

The use of multiplexed atomic quantum memories (MAQM) can significantly enhance the efficiency to establish entanglement in a quantum network. In the previous experiments, individual elements of a quantum network, such as the generation, storage and transmission of quantum entanglement have been demonstrated separately. Here we report an experiment to show the compatibility of these basic operations. Specifically, we generate photon-atom entanglement in a $6\times 5$ MAQM, convert the spin wave to time-bin photonic excitation after a controllable storage time, and then store and retrieve the photon in a second MAQM for another controllable storage time. The preservation of quantum information in this process is verified by measuring the state fidelity. We also show that our scheme supports quantum systems with higher dimension than a qubit.

preprint2016arXiv

Discrete Weierstrass Fourier Transform and Experiments

We established a new method called Discrete Weierstrass Fourier Transform, a faster and more generalized Discrete Fourier Transform, to approximate discrete data. The theory of this method as well as some experiments are analyzed in this paper. In some examples, this method has a faster convergent speed than Discrete Fourier Transform.

preprint2016arXiv

Five-Collinear Sierpinski Gasket is Not Self-Similar

A geometrical conclusion: Sierpinski gasket, two Sierpinski gaskets in a line, three Sierpinski gaskets in a line, and four Sierpinski gaskets in a line are self-similar, but five Sierpinski gaskets in a line is not, which is proved in this paper.

preprint2015arXiv

Direct Visualization of Memory Effects in Artificial Spin Ice

We experimentally demonstrate that arrays of interacting nanoscale ferromagnetic islands, known as artificial spin ice, develop reproducible microstates upon cycling an applied magnetic field. The onset of this memory effect is determined by the strength of the applied field relative to the array coercivity. Specifically, when the applied field strength is almost exactly equal to the array coercivity, several training cycles are required before the array achieves a nearly completely repeatable microstate, whereas when the applied field strength is stronger or weaker than the array coercivity, a repeatable microstate is achieved after the first minor loop. We show through experiment and simulation that this memory exhibited by artificial spin ice is due to a ratchet effect on interacting, magnetically-charged defects in the island moment configuration and to the complexity of the network of strings of reversed moments that forms during magnetization reversal.

preprint2015arXiv

Improved Direct Counterfactual Quantum Communication

Recently, a novel direct counterfactual quantum communication protocol was proposed using chained quantum Zeno effect. We found that this protocol is far from being widely used in practical channels, due to the side effect of 'chained', which leads to a dramatic increase of the equivalent optical distance between Alice and Bob. Therefore, not only the transmission time of a single bit increases in multiple times, but also the protocol is more sensitive to the noise. Here, we proposed an improved protocol, in which quantum interference is employed to destroy the nested structure induced by 'chained' effect. Moreover, we proved that a better counterfactuality is easier to be achieved, and showed that our protocol outperforms the former in the presence of noises.

preprint2015arXiv

Quantum coin flipping secure against channel noises

So far, most of existed single-shot quantum coin flipping(QCF) protocols failed in a noisy quantum channel. Here, we present a nested-structured framework that makes it possible to achieve partially noise-tolerant QCF, due to that there is a trade-off between the security and the justice correctness. It is showed that noise-tolerant single-shot QCF protocols can be produced by filling the presented framework up with existed or even future protocols. We also proved a lower bound of 0.25, with which a cheating Alice or Bob could bias the outcome.

preprint2015arXiv

Randomized Structural Sparsity based Support Identification with Applications to Locating Activated or Discriminative Brain Areas: A Multi-center Reproducibility Study

In this paper, we focus on how to locate the relevant or discriminative brain regions related with external stimulus or certain mental decease, which is also called support identification, based on the neuroimaging data. The main difficulty lies in the extremely high dimensional voxel space and relatively few training samples, easily resulting in an unstable brain region discovery (or called feature selection in context of pattern recognition). When the training samples are from different centers and have betweencenter variations, it will be even harder to obtain a reliable and consistent result. Corresponding, we revisit our recently proposed algorithm based on stability selection and structural sparsity. It is applied to the multi-center MRI data analysis for the first time. A consistent and stable result is achieved across different centers despite the between-center data variation while many other state-of-the-art methods such as two sample t-test fail. Moreover, we have empirically showed that the performance of this algorithm is robust and insensitive to several of its key parameters. In addition, the support identification results on both functional MRI and structural MRI are interpretable and can be the potential biomarkers.

preprint2015arXiv

Randomized Structural Sparsity via Constrained Block Subsampling for Improved Sensitivity of Discriminative Voxel Identification

In this paper, we consider voxel selection for functional Magnetic Resonance Imaging (fMRI) brain data with the aim of finding a more complete set of probably correlated discriminative voxels, thus improving interpretation of the discovered potential biomarkers. The main difficulty in doing this is an extremely high dimensional voxel space and few training samples, resulting in unreliable feature selection. In order to deal with the difficulty, stability selection has received a great deal of attention lately, especially due to its finite sample control of false discoveries and transparent principle for choosing a proper amount of regularization. However, it fails to make explicit use of the correlation property or structural information of these discriminative features and leads to large false negative rates. In other words, many relevant but probably correlated discriminative voxels are missed. Thus, we propose a new variant on stability selection "randomized structural sparsity", which incorporates the idea of structural sparsity. Numerical experiments demonstrate that our method can be superior in controlling for false negatives while also keeping the control of false positives inherited from stability selection.

preprint2014arXiv

A discontinuous Galerkin method for the Naghdi shell model

We propose a mixed discontinuous Galerkin method for the bending problem of Naghdi shell, and present an analysis for its accuracy. The error estimate shows that when components of the curvature tensor and Christoffel symbols are piecewise linear functions, the finite element method has the optimal order of accuracy, which is uniform with respect to the shell thickness. Generally, the error estimate shows how the accuracy is affected by the shell geometry and thickness. It suggests that to achieve optimal rate of convergence, the triangulation should be properly refined in regions where the shell geometry changes dramatically. These are the results for a balanced method in which the primary displacement components and rotation components are approximated by discontinuous piecewise quadratic polynomials, while components of the scaled membrane stress tensor and shear stress vector are approximated by continuous piecewise linear functions. On elements that have edges on the free boundary of the shell, finite element space for displacement components needs to be enriched slightly, for stability purpose. Results on higher order finite elements are also included.

preprint2014arXiv

A linear finite element procedure for the Naghdi shell model

We prove the accuracy of a mixed finite element method for bending dominated shells in which a major part of the membrane/shear strain is reduced, to free up membrane/shear locking. When no part of the membrane/shear strain is reduced, the method becomes a consistent discontinuous Galerkin method that is proven accurate for membrane/shear dominated shells and intermediate shells. The two methods can be coded in a single program by using a parameter. We propose a procedure of numerically detecting the asymptotic behavior of a shell, choosing the parameter value in the method, and producing accurate approximation for a given shell problem. The method uses piecewise linear functions to approximate all the variables. The analysis is carried out for shells whose middle surfaces have the most general geometries, which shows that the method has the optimal order of accuracy for general shells and the accuracy is robust with respect to the shell thickness. In the particular case that the geometrical coefficients of the shell middle surface are piecewise constants the accuracy is uniform with respect to the shell thickness.

preprint2014arXiv

Analysis of a discontinuous Galerkin method for Koiter shell

We present an analysis for a mixed finite element method for the bending problem of Koiter shell. We derive an error estimate showing that when the geometrical coefficients of the shell mid-surface satisfy certain conditions the finite element method has the optimal order of accuracy, which is uniform with respect to the shell thickness. Generally, the error estimate shows how the accuracy is affected by the shell geometry and thickness. It suggests that to achieve optimal rate of convergence, the triangulation should be properly refined in regions where the shell geometry changes dramatically. The analysis is carried out for a balanced method in which the normal component of displacement is approximated by discontinuous piecewise cubic polynomials, while the tangential components are approximated by discontinuous piecewise quadratic polynomials, with some enrichment on elements that have edges on the free boundary. Components of the membrane stress are approximated by continuous piecewise linear functions.

preprint2014arXiv

Coarse Quotient Mappings between Metric Spaces

We give a definition of coarse quotient mapping and show that several results for uniform quotient mapping also hold in the coarse setting. In particular, we prove that any Banach space that is a coarse quotient of $L_p\equiv L_p[0,1]$, $1<p<\infty$, is isomorphic to a linear quotient of $L_p$. It is also proved that $\ell_q$ is not a coarse quotient of $\ell_p$ for $1<p<q<\infty$ using Rolewicz's property ($β$).

preprint2014arXiv

Discrete Korn's inequality for shells

We prove Korn's inequalities for Naghdi and Koiter shell models defined on spaces of discontinuous piecewise functions. They are useful in study of discontinuous finite element methods for shells.

preprint2013arXiv

Compact embedding in the space of piecewise H1 functions

We prove a compact embedding theorem in a class of spaces of piecewise H1 functions subordinated to a class of shape regular, but not necessarily quasi-uniform triangulations of a polygonal domain. This result generalizes the Rellich--Kondrachov theorem. It is used to prove generalizations to piecewise functions of nonstandard Poincaré--Friedrichs inequalities. It can be used to prove Korn inequalities for piecewise functions associated with elastic shells.

preprint2013arXiv

Oscillation in microRNA Feedback Loop

The dynamic behaviors of microRNA and mRNA under external stress are studied with biological experiments and mathematics models. In this study, we developed a mathematic model to describe the biological phenomenon and for the first time reported that, as responses to external stress, the expression levels of microRNA and mRNA sustained oscillation. And the period of the oscillation is much shorter than several reported transcriptional regulation negative feedback loop.

preprint2013arXiv

Warburg Effect due to Exposure to Different Types of Radiation

Cancer cells maintain a high level of aerobic glycolysis (the Warburg effect), which is associated with their rapid proliferation. Many studies have reported that the suppression of glycolysis and activation of oxidative phosphorylation can repress the growth of cancer cells through regulation of key regulators. Whether Warburg effect of cancer cells could be switched by some other environmental stimulus? Herein, we report an interesting phenomenon in which cells alternated between glycolysis and mitochondrial respiration depending on the type of radiation they were exposed to. We observed enhanced glycolysis and mitochondrial respiration in HeLa cells exposed to 2-Gy X-ray and 2-Gy carbon ion radiation, respectively. This discovery may provide novel insights for tumor therapy.

preprint2012arXiv

Magnetization states and switching in narrow-gapped ferromagnetic nanorings

We study permalloy nanorings that are lithographically fabricated with narrow gaps that break the rotational symmetry of the ring while retaining the vortex ground state, using both micromagnetic simulations and magnetic force microscopy (MFM). The vortex chirality in these structures can be readily set with an in-plane magnetic field and easily probed by MFM due to the field associated with the gap, suggesting such rings for possible applications in storage technologies. We find that the gapped ring edge characteristics (i.e., edge profile and gap shape) are critical in determining the magnetization switching field, thus elucidating an essential parameter in the controls of devices that might incorporate such structures.

preprint2012arXiv

Perpendicular Magnetization and Generic Realization of the Ising Model in Artificial Spin Ice

We have studied frustrated kagome arrays and unfrustrated honeycomb arrays of magnetostatically-interacting single-domain ferromagnetic islands with magnetization normal to the plane. The measured pairwise spin correlations of both lattices can be reproduced by models based solely on nearest-neighbor correlations. The kagome array has qualitatively different magnetostatics but identical lattice topology to previously-studied 'artificial spin ice' systems composed of in-plane moments. The two systems show striking similarities in the development of moment pair correlations, demonstrating a universality in artificial spin ice behavior independent of specific realization in a particular material system.

preprint2011arXiv

Ignoring your neighbors: Moment correlations dominated by indirect or distant interactions in an ordered nanomagnet array

We have studied the moment correlations within triangular lattice arrays of single-domain co-aligned nanoscale ferromagnetic islands. Independent variation of lattice spacing along and perpendicular to the island axis tunes the magnetostatic interactions between islands through a broad range of relative strengths. For certain lattice parameters, the sign of the correlations between near-neighbor island moments is opposite to that favored by the pair-wise interaction. This finding, supported by analysis of the total correlation in terms of direct and convoluted indirect contributions across multiple pairwise interactions, indicates that indirect interactions and/or those mediated by further neighbors can be tuned to be dominant, with implications for the wide range of systems composed of interacting nanomagnets.

preprint2011arXiv

Magneto-optical Kerr Effect Studies of Square Artificial Spin Ice

We report a magneto-optical Kerr effect study of the collective magnetic response of artificial square spin ice, a lithographically-defined array of single-domain ferromagnetic islands. We find that the anisotropic inter-island interactions lead to a non-monotonic angular dependence of the array coercive field. Comparisons with micromagnetic simulations indicate that the two perpendicular sublattices exhibit distinct responses to island edge roughness, which clearly influence the magnetization reversal process. Furthermore, such comparisons demonstrate that disorder associated with roughness in the island edges plays a hitherto unrecognized but essential role in the collective behavior of these systems.

preprint2011arXiv

Security proof of Counterfactual Quantum Cryptography against General Intercept-resend Attacks and Its Vulnerability

Counterfactual quantum cryptography (CQC), recently proposed by Noh, is featured with no transmission of signal particles. This exhibits evident security advantage, such as its immunity to the well known PNS attack. In this paper, the theoretical security of CQC protocol against the general intercept-resend attacks is proved by bounding the information of an eavesdropper Eve more tightly than in Yin's proposal[Phys. Rev. A 82, 042335 (2010)]. It is also showed that practical CQC implementations may be vulnerable when equipped with imperfect apparatuses, by proving that a negative key rate can be achieved when Eve launches a time-shift attack based on imperfect detector efficiency.

preprint2011arXiv

Semiquantum key distribution using entangled states

Recently, Boyer et al. presented a novel semiquantum key distribution protocol [M. Boyer, D. Kenigsberg, and T. Mor, Phys. Rev. Lett. 99, 140501 (2007)], by using four quantum states, each of which is randomly prepared by Z basis or X basis. Here we present a semiquantum key distribution protocol by using entangled states in which quantum Alice shares a secret key with classical Bob. We also show the protocol is secure against eavesdropping.

preprint2011arXiv

Semiquantum secret sharing using two-particle entangled state

Recently, Boyer et al. presented a novel semiquantum key distribution protocol [M. Boyer, D. Kenigsberg, and T. Mor, Phys. Rev. Lett. 99, 140501 (2007)], in which quantum Alice shares a secret key with classical Bob. Li et al. proposed two semiquantum secret sharing protocols [Q. Li, W. H. Chan, and D. Y. Long, Phys. Rev. A 82, 022303 (2010)] by using maximally entangled Greenberger-Horne-Zeilinger states. In this paper, we present a semiquantum secret sharing protocol by using two-particle entangled states in which quantum Alice shares a secret key with two classical parties, Bob and Charlie. Classical Bob and Charlie are restricted to performing measurement in the computational basis, preparing a particle in the computational basis, or reflecting the particles. None of them can acquire the secret unless they collaborate. We also show the protocol is secure against eavesdropping.

preprint2010arXiv

Generalized Tu Formula and Hamilton Structures of Fractional Soliton Equation Hierarchy

With the modified Riemann-Liouville fractional derivative, a fractional Tu formula is presented to investigate generalized Hamilton structure of fractional soliton equations. The obtained results can be reduced to the classical Hamilton hierachy of ordinary calculus.

preprint2009arXiv

Mesoscopic Speckle

We have measured the local and global statistics of singularity velocity, v, and have related these through the spatial correlation function of v. The distribution of v is a mixture of a mesoscopic distribution of global change in the speckle pattern and the distribution for v for Gaussian random fields. When v is normalized by the standard deviation of the fractional intensity change, probability distributions and correlation function of v approach those for random Gaussian fields. These results are directly analogous to the statistics of transmitted intensity normalized by the total transmission and provide a unified framework for understanding statistics of speckle evolution and intensity.

preprint2009arXiv

Universal mesoscopic statistics and the localization of light

We follow the evolution with sample thickness, of intensity statistics for localized light transmitted through layered media in a crossover from one to three dimensions occasioned by transverse disorder. The probability distribution of intensity changes from one dimensional to a mixture of a mesoscopic function of a single parameter, the "statistical conductance," and a distribution of intensity for Gaussian waves. This suggests that the change to a universal statistics beyond 1D is associated with the topological change in the spatial field distribution.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2406.17608:author:6:sheng-zhang

Imported May 21, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2512.24663:author:3:sheng-zhang

Imported May 21, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.08083:author:5:sheng-zhang

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.09675:author:5:sheng-zhang

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.17267:author:6:sheng-zhang

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.15458:author:2:sheng-zhang

Imported May 20, 2026Synced May 20, 2026

6 works

Peter Schiffer

Researcher

Peter Schiffer contributes to research discovery and scholarly infrastructure.

Open to collaborate

5 works

Vincent H. Crespi

Researcher

Vincent H. Crespi contributes to research discovery and scholarly infrastructure.

Open to collaborate

4 works

Chang Li

Researcher

Chang Li contributes to research discovery and scholarly infrastructure.

Open to collaborate

4 works

Gia-Wei Chern

Researcher

Gia-Wei Chern contributes to research discovery and scholarly infrastructure.

Open to collaborate

Sheng Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

61 published item(s)

CodeClinic: Evaluating Automation of Coding Skills for Clinical Reasoning Agents

CSCBench: A PVC Diagnostic Benchmark for Commodity Supply Chain Reasoning

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Machine learning nonequilibrium phase transitions in charge-density wave insulators

Majorana Zero Modes and Topological Nature in Bi2Ta3S6-family Superconductors

RAGR: Review-Augmented Generative Recommendation

Test-time generative augmentation for medical image segmentation

Video Models Can Reason with Verifiable Rewards

Renormalization Group Guided Tensor Network Structure Search

Aircraft Landing Time Prediction with Deep Learning on Trajectory Images

2-d signature of images and texture classification

A Dual Accelerated Method for Online Stochastic Distributed Averaging: From Consensus to Decentralized Policy Evaluation

A Multi-User Effective Computation Offloading Mechanism for MEC System: Batched Multi-Armed Bandits Approach

A Review on Graph Neural Network Methods in Financial Applications

Can depth-adaptive BERT perform better on binary classification tasks

Client Selection and Bandwidth Allocation for Federated Learning: An Online Optimization Perspective

Descriptors for Machine Learning Model of Generalized Force Field in Condensed Matter Systems

Experimental demonstration of memory-enhanced scaling for entanglement connection of quantum repeater segments

FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing

Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning

Knowledge-Rich Self-Supervision for Biomedical Entity Linking

Locally Aggregated Feature Attribution on Natural Language Model Understanding

Machine learning predictions for local electronic properties of disordered correlated electron systems

Quantum-Memory-Enhanced Preparation of Nonlocal Graph States

REKnow: Enhanced Knowledge for Joint Entity and Relation Extraction

Shilling Black-box Recommender Systems by Learning to Generate Fake User Profiles

Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning

A submetric characterization of Rolewicz's property ($β$)

L-SNet: from Region Localization to Scale Invariant Medical Image Segmentation

Pattern Transfer Learning for Reinforcement Learning in Order Dispatching

A Sterile Neutrino Search at compact materials irradiation facility

Error Model of Radio Fingerprint and PDR Fusion Indoor Localization

Universal Decompositional Semantic Parsing

High dimensional entanglement between a photon and a multiplexed atomic quantum memory

Imaging the stochastic microstructure and dynamic development of correlations in perpendicular artificial spin ice

Quantum Communication between Multiplexed Atomic Quantum Memories

Discrete Weierstrass Fourier Transform and Experiments

Five-Collinear Sierpinski Gasket is Not Self-Similar

Direct Visualization of Memory Effects in Artificial Spin Ice

Improved Direct Counterfactual Quantum Communication

Quantum coin flipping secure against channel noises

Randomized Structural Sparsity based Support Identification with Applications to Locating Activated or Discriminative Brain Areas: A Multi-center Reproducibility Study

Randomized Structural Sparsity via Constrained Block Subsampling for Improved Sensitivity of Discriminative Voxel Identification

A discontinuous Galerkin method for the Naghdi shell model

A linear finite element procedure for the Naghdi shell model

Analysis of a discontinuous Galerkin method for Koiter shell

Coarse Quotient Mappings between Metric Spaces

Discrete Korn's inequality for shells

Compact embedding in the space of piecewise H1 functions

Oscillation in microRNA Feedback Loop

Warburg Effect due to Exposure to Different Types of Radiation

Magnetization states and switching in narrow-gapped ferromagnetic nanorings

Perpendicular Magnetization and Generic Realization of the Ising Model in Artificial Spin Ice

Ignoring your neighbors: Moment correlations dominated by indirect or distant interactions in an ordered nanomagnet array

Magneto-optical Kerr Effect Studies of Square Artificial Spin Ice

Security proof of Counterfactual Quantum Cryptography against General Intercept-resend Attacks and Its Vulnerability

Semiquantum key distribution using entangled states

Semiquantum secret sharing using two-particle entangled state

Generalized Tu Formula and Hamilton Structures of Fractional Soliton Equation Hierarchy

Mesoscopic Speckle

Universal mesoscopic statistics and the localization of light