Source author record

Yilun Xu

Yilun Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning cond-mat.mtrl-sci quant-ph eess.IV

Catalog footprint

What is connected

10works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Non-commutativity as a Universal Characterization for Enhanced Quantum Metrology

A central challenge in quantum metrology is to effectively harness quantum resources to surpass classical precision bounds. Although recent studies suggest that the indefinite causal order may enable sensitivities to attain the super-Heisenberg scaling, the physical origins of such enhancements remain elusive. Here, we introduce the nilpotency index $\mathcal{K}$, which quantifies the depth of non-commutativity between operators during the encoding process, can act as a fundamental parameter governing quantum-enhanced sensing. We show that a finite $\mathcal{K}$ yields an enhanced scaling of root-mean-square error as $N^{-(1+\mathcal{K})}$. Meanwhile, the requirement for indefinite causal order arises only when the nested commutators become constant. Remarkably, in the limit $\mathcal{K} \to \infty$, exponential precision scaling $N^{-1}e^{-N}$ is achievable. We propose experimentally feasible protocols implementing these mechanisms, providing a systematic pathway towards practical quantum-enhanced metrology.

preprint2026arXiv

QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding

Quantum computing calibration depends on interpreting experimental data, and calibration plots provide the most universal human-readable representation for this task, yet no systematic evaluation exists of how well vision-language models (VLMs) interpret them. We introduce QCalEval, the first VLM benchmark for quantum calibration plots: 243 samples across 87 scenario types from 22 experiment families, spanning superconducting qubits and neutral atoms, evaluated on six question types in both zero-shot and in-context learning settings. The best general-purpose zero-shot model reaches a mean score of 72.3, and many open-weight models degrade under multi-image in-context learning, whereas frontier closed models improve substantially. A supervised fine-tuning ablation at the 9-billion-parameter scale shows that SFT improves zero-shot performance but cannot close the multimodal in-context learning gap. As a reference case study, we release NVIDIA Ising Calibration 1, an open-weight model based on Qwen3.5-35B-A3B that reaches 74.7 zero-shot average score.

preprint2022arXiv

A multi-scale approach to microstructure-sensitive thermal fatigue in solder joints

This paper presents a multi-scale modelling approach to investigate the underpinning mechanisms of microstructure-sensitive damage of single crystal Sn-3Ag-0.5Cu (wt%, SAC305) solder joints of a Ball Grid Array (BGA) board assembly subject to thermal cycling. The multi-scale scheme couples board-scale modelling at the continuum macro-scale and individual solder modelling at the crystal micro-scale. Systematic studies of tin crystal orientation and its role in fatigue damage have been compared to experimental observations. Crystallographic orientation is examined with respect to damage development, providing evidence-based optimal solder microstructural design for in-service thermomechanical fatigue.

preprint2022arXiv

Controlling Directions Orthogonal to a Classifier

We propose to identify directions invariant to a given classifier so that these directions can be controlled in tasks such as style transfer. While orthogonal decomposition is directly identifiable when the given classifier is linear, we formally define a notion of orthogonality in the non-linear case. We also provide a surprisingly simple method for constructing the orthogonal classifier (a classifier utilizing directions other than those of the given classifier). Empirically, we present three use cases where controlling orthogonal variation is important: style transfer, domain adaptation, and fairness. The orthogonal classifier enables desired style transfer when domains vary in multiple aspects, improves domain adaptation with label shifts and mitigates the unfairness as a predictor. The code is available at http://github.com/Newbeeer/orthogonal_classifier

preprint2022arXiv

Multi-scale plasticity homogenization of Sn-3Ag-0.5Cu: from β-Sn micropillars to polycrystals with intermetallics

The mechanical properties of $β$-Sn single crystals have been systematically investigated using a combined methodology of micropillar tests and rate-dependent crystal plasticity modelling. The slip strength and rate sensitivity of several key slip systems within $β$-Sn single crystals have been determined. Consistency between the numerically predicted and experimentally observed slip traces has been shown for pillars oriented to activate single and double slip. Subsequently, the temperature-dependent, intermetallic-size-governing behaviour of a polycrystal $β$-Sn-rich alloy SAC305 (96.5Sn-3Ag-0.5Cu wt%) is predicted through a multi-scale homogenization approach, and the predicted temperature- and rate-sensitivity reproduce independent experimental results. The integrated experimental and numerical approaches provide mechanistic understanding and fundamental material properties of microstructure-sensitive behaviour of electronic solders subject to thermomechanical loading, including thermal fatigue.

preprint2022arXiv

On the Origin of Plasticity-Induced Microstructure Change under Sliding Contacts

Discrete dislocation plasticity (DDP) calculations are carried out to investigate the response of a single crystal contacted by a rigid sinusoidal asperity under sliding loading conditions to look for causes of microstructure change in the dislocation structure. The mechanistic driver is identified as the development of lattice rotations and stored energy in the subsurface, which can be quantitatively correlated to recent tribological experimental observations. Maps of surface slip initiation and substrate permanent deformation obtained from DDP calculations for varying contact size and normal load suggest ways of optimally tailoring the interface and microstructural material properties for various frictional loads.

preprint2021arXiv

Anytime Sampling for Autoregressive Models via Ordered Autoencoding

Autoregressive models are widely used for tasks such as image and audio generation. The sampling process of these models, however, does not allow interruptions and cannot adapt to real-time computational resources. This challenge impedes the deployment of powerful autoregressive models, which involve a slow sampling process that is sequential in nature and typically scales linearly with respect to the data dimension. To address this difficulty, we propose a new family of autoregressive models that enables anytime sampling. Inspired by Principal Component Analysis, we learn a structured representation space where dimensions are ordered based on their importance with respect to reconstruction. Using an autoregressive model in this latent space, we trade off sample quality for computational efficiency by truncating the generation process before decoding into the original data space. Experimentally, we demonstrate in several image and audio generation tasks that sample quality degrades gracefully as we reduce the computational budget for sampling. The approach suffers almost no loss in sample quality (measured by FID) using only 60\% to 80\% of all latent dimensions for image data. Code is available at https://github.com/Newbeeer/Anytime-Auto-Regressive-Model .

preprint2021arXiv

Deep Joint Demosaicing and High Dynamic Range Imaging within a Single Shot

Spatially varying exposure (SVE) is a promising choice for high-dynamic-range (HDR) imaging (HDRI). The SVE-based HDRI, which is called single-shot HDRI, is an efficient solution to avoid ghosting artifacts. However, it is very challenging to restore a full-resolution HDR image from a real-world image with SVE because: a) only one-third of pixels with varying exposures are captured by camera in a Bayer pattern, b) some of the captured pixels are over- and under-exposed. For the former challenge, a spatially varying convolution (SVC) is designed to process the Bayer images carried with varying exposures. For the latter one, an exposure-guidance method is proposed against the interference from over- and under-exposed pixels. Finally, a joint demosaicing and HDRI deep learning framework is formalized to include the two novel components and to realize an end-to-end single-shot HDRI. Experiments indicate that the proposed end-to-end framework avoids the problem of cumulative errors and surpasses the related state-of-the-art methods.

preprint2020arXiv

A Theory of Usable Information Under Computational Constraints

We propose a new framework for reasoning about information in complex systems. Our foundation is based on a variational extension of Shannon's information theory that takes into account the modeling power and computational constraints of the observer. The resulting \emph{predictive $\mathcal{V}$-information} encompasses mutual information and other notions of informativeness such as the coefficient of determination. Unlike Shannon's mutual information and in violation of the data processing inequality, $\mathcal{V}$-information can be created through computation. This is consistent with deep neural networks extracting hierarchies of progressively more informative features in representation learning. Additionally, we show that by incorporating computational constraints, $\mathcal{V}$-information can be reliably estimated from data even in high dimensions with PAC-style guarantees. Empirically, we demonstrate predictive $\mathcal{V}$-information is more effective than mutual information for structure learning and fair representation learning.

preprint2020arXiv

TCGM: An Information-Theoretic Framework for Semi-Supervised Multi-Modality Learning

Fusing data from multiple modalities provides more information to train machine learning systems. However, it is prohibitively expensive and time-consuming to label each modality with a large amount of data, which leads to a crucial problem of semi-supervised multi-modal learning. Existing methods suffer from either ineffective fusion across modalities or lack of theoretical guarantees under proper assumptions. In this paper, we propose a novel information-theoretic approach, namely \textbf{T}otal \textbf{C}orrelation \textbf{G}ain \textbf{M}aximization (TCGM), for semi-supervised multi-modal learning, which is endowed with promising properties: (i) it can utilize effectively the information across different modalities of unlabeled data points to facilitate training classifiers of each modality (ii) it has theoretical guarantee to identify Bayesian classifiers, i.e., the ground truth posteriors of all modalities. Specifically, by maximizing TC-induced loss (namely TC gain) over classifiers of all modalities, these classifiers can cooperatively discover the equivalent class of ground-truth classifiers; and identify the unique ones by leveraging limited percentage of labeled data. We apply our method to various tasks and achieve state-of-the-art results, including news classification, emotion recognition and disease prediction.

Yilun Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

Non-commutativity as a Universal Characterization for Enhanced Quantum Metrology

QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding

A multi-scale approach to microstructure-sensitive thermal fatigue in solder joints

Controlling Directions Orthogonal to a Classifier

Multi-scale plasticity homogenization of Sn-3Ag-0.5Cu: from β-Sn micropillars to polycrystals with intermetallics

On the Origin of Plasticity-Induced Microstructure Change under Sliding Contacts

Anytime Sampling for Autoregressive Models via Ordered Autoencoding

Deep Joint Demosaicing and High Dynamic Range Imaging within a Single Shot

A Theory of Usable Information Under Computational Constraints

TCGM: An Information-Theoretic Framework for Semi-Supervised Multi-Modality Learning