Researcher profile

Xingyu Gao

Xingyu Gao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

AGC: Adaptive Geodesic Correction for Adversarial Robustness on Vision-Language Models

Vision-language models like CLIP have demonstrated remarkable zero-shot transfer capabilities. However, their susceptibility to imperceptible adversarial perturbations remains a critical security concern. While test-time defenses offer a pragmatic solution for deployed models, existing approaches typically rely on gradient-based optimization during inference, incurring significant computational overhead. In this paper, we revisit the role of data augmentation in CLIP robustness and observe that augmentations are not equally effective: specific augmentations consistently provide robust geometric cues that align with correct class semantics in the hyperspherical feature space. Based on this, we propose Adaptive Geodesic Correction (AGC), a training-free defense mechanism that requires no parameter updates. AGC identifies a reliable augmentation as a geometric anchor and corrects the input feature towards it, utilizing an adaptive step size to balance robustness against clean accuracy preservation. AGC achieves superior performance across eight fine-grained datasets and three CLIP backbones, improving average robust accuracy by 44.4\% over state-of-the-art baseline while delivering a 10$\times$ reduction in inference latency. Our findings reveal a fundamental geometric property of CLIP features, offering a highly efficient and effective paradigm for robust multimodal deployment.

preprint2026arXiv

ContextFlow: Hierarchical Task-State Alignment for Long-Horizon Embodied Agents

Long-horizon embodied agents increasingly delegate navigation, search, approach, and manipulation to specialist executors. As these executors become stronger, the main bottleneck shifts from local skill execution to maintaining a coherent task frontier across planning, monitoring, memory, and execution. We study task-state misalignment, a task-level consistency failure in which the planner's active stage, runtime evidence, remembered context, and delegated executor no longer justify the same next-step decision. This failure can lead to unsupported handoffs, stage lock, executor-context mismatch, and unnecessary replanning. We propose ContextFlow, an inspectable alignment framework that represents stages as explicit contracts, converts runtime observations into evidence packets, and applies scoped updates including continue, refine, transfer, promote, and repair. ContextFlow keeps specialist executors responsible for local closed-loop control while making task-frontier alignment explicit and auditable. Experiments and demonstration traces on long-horizon embodied tasks illustrate how evidence-grounded scoped updates diagnose and mitigate recurring task-state failures.

preprint2026arXiv

IGA-LWP: An Iterative Gradient-based Adversarial Attack for Link Weight Prediction

Link weight prediction extends classical link prediction by estimating the strength of interactions rather than merely their existence, and it underpins a wide range of applications such as traffic engineering, social recommendation, and scientific collaboration analysis. However, the robustness of link weight prediction against adversarial perturbations remains largely unexplored.In this paper, we formalize the link weight prediction attack problem as an optimization task that aims to maximize the prediction error on a set of target links by adversarially manipulating the weight values of a limited number of links. Based on this formulation, we propose an iterative gradient-based attack framework for link weight prediction, termed IGA-LWP. By employing a self-attention-enhanced graph autoencoder as a surrogate predictor, IGA-LWP leverages backpropagated gradients to iteratively identify and perturb a small subset of links. Extensive experiments on four real-world weighted networks demonstrate that IGA-LWP significantly degrades prediction accuracy on target links compared with baseline methods. Moreover, the adversarial networks generated by IGA-LWP exhibit strong transferability across several representative link weight prediction models. These findings expose a fundamental vulnerability in weighted network inference and highlight the need for developing robust link weight prediction methods.

preprint2026arXiv

SDFlow: Similarity-Driven Flow Matching for Time Series Generation

Vector quantization (VQ) with autoregressive (AR) token modeling is a widely adopted and highly competitive paradigm for time-series generation. However, such models are fundamentally limited by exposure bias: during inference, errors can accumulate across sequential predictions, leading to pronounced quality degradation in long-horizon generation. To address this, we propose SDFlow ($\textbf{S}$imilarity-$\textbf{D}$riven $\textbf{Flow}$ Matching), a non-autoregressive framework that operates entirely in the frozen VQ latent space and enables parallel sequence generation via flow matching. We tackle three key challenges in making this transition: (1) eliminating exposure bias by replacing step-wise token prediction with a global transport map; (2) mitigating the high-dimensionality of VQ token spaces via a low-rank manifold decomposition with a learned anchor prior over the latent manifold; and (3) incorporating discrete supervision into continuous transport dynamics by introducing a categorical posterior over codebook indices within a variational flow-matching formulation. Extensive experiments show that SDFlow achieves state-of-the-art performance, improving Discriminative Score and substantially reducing Context-FID, particularly for challenging long-sequence generation. Moreover, SDFlow provides significant inference speedups over autoregressive baselines, offering both high fidelity and computational efficiency. Code is available at https://anonymous.4open.science/r/SDFlow-D6F3/

preprint2022arXiv

Extensibility of Hohenberg-Kohn Theorem to general quantum systems

Hohenberg-Kohn (HK) theorem is a cornerstone of modern electronic structure calculations. For interacting electrons, given that the internal part of the Hamiltonian ($\hat H_{int}$), containing the kinetic energy and Couloumb interaction of electrons, has a fixed form, the theorem states that when the electrons are subject to an external electrostatic field, the ground-state density can inversely determine the field, and thus the full Hamiltonian completely. For a general quantum system, a HK-type Hamiltonian in the form of $\hat H_{hk}\{g_i\}=\hat H_{int}+\sum_i g_i \hat O_i$ can always be defined, by grouping those terms with fixed or preknown coefficients into $\hat H_{int}$, and factorizing the remaining as superposition of a set of Hermitian operators $\{\hat O_i\}$. We ask whether the HK theorem can be extended, so that the ground-state expectation values of $\{\hat O_i\}$ as the generalized density can in principle be used as the fundamental variables determining all the properties of the system. We show that the question can be addressed by introducing the concept of generalized density correlation matrix (GDCM) defined with respect to the $\{\hat O_i\}$ operators. The invertibility of the GDCM represents a mathematically rigorous and practically useful criterion for the extension of HK theorem to be valid. We apply this criterion to several representative systems, including the quantum Ising dimer, the frustration-free systems, N-level quantum systems with fixed inter-level transition amplitude and tunable level energies, and a fermionic Hubbard chain with inhomogeneous on-site interactions. We suggest that for a finite-size system, finding an invertible GDCM under one single $\{g_i\}$ configuration is typically sufficient to establish the generic extensibility of the HK theorem in the entire parameter space.

preprint2022arXiv

GTac: A Biomimetic Tactile Sensor with Skin-like Heterogeneous Force Feedback for Robots

The tactile sensing capabilities of human hands are essential in performing daily activities. Simultaneously perceiving normal and shear forces via the mechanoreceptors integrated into the hands enables humans to achieve daily tasks like grasping delicate objects. In this paper, we design and fabricate a novel biomimetic tactile sensor with skin-like heterogeneity that perceives normal and shear contact forces simultaneously. It mimics the multilayers of mechanoreceptors by combining an extrinsic layer (piezoresistive sensors) and an intrinsic layer (a Hall sensor) so that it can perform estimation of contact force directions, locations, and joint-level torque. By integrating our sensors, a robotic gripper can obtain contact force feedback at fingertips; accordingly, robots can perform challenging tasks, such as tweezers usage, and egg grasping. This insightful sensor design can be customized and applied in different areas of robots and provide them with heterogeneous force sensing, potentially supporting robotics in acquiring skin-like tactile feedback.

preprint2022arXiv

Melting curve of magnesium up to 460 GPa from ab initio molecular dynamics simulations

Based on ab initio molecular dynamics simulations, we determined the melting curve of magnesium (Mg) up to ~460 GPa using the solid-liquid coexistence method. Between ~30 and 100 GPa, our melting curve is noticeably lower than those from static experiments, but is in good agreement with recent shock experiments. Up to ~450 GPa, our melting curve is generally consistent with the melting points from first-principles calculations using the small-cell coexistence method. We found that, at high pressures of a few hundred GPa, due to the strong softening of interatomic interactions in the liquid phase, solid-liquid coexistence simulations of Mg show some characteristics distinctively different from other metal systems, such as aluminum. For example, at a given volume, the pressure and temperature range for maintaining a stable solid-liquid coexistence state can be very small. The strong softening in the liquid phase also causes the unusual behavior of reentrant melting to occur at very high pressures. The onset of reentrant melting is predicted at ~305 GPa, close to that at ~300 GPa from the small-cell coexistence method. We show that the calculated melting points, considering reentrant melting, can be excellently fitted to a low-order Kechin equation, thereby making it possible for us to obtain a first-principles melting curve of Mg at pressures above 50 GPa for the first time. Similar characteristics in solid-liquid coexistence simulations, as well as reentrant melting, are also expected for other systems with strong softening in the liquid phase at high pressures.

preprint2022arXiv

Nuclear spin polarization and control in a van der Waals material

Van der Waals layered materials are a focus of materials research as they support strong quantum effects and can easily form heterostructures. Electron spins in van der Waals materials played crucial roles in many recent breakthroughs, including topological insulators, two-dimensional (2D) magnets, and spin liquids. However, nuclear spins in van der Waals materials remain an unexplored quantum resource. Here we report the first demonstration of optical polarization and coherent control of nuclear spins in a van der Waals material at room temperature. We use negatively-charged boron vacancy ($V_B^-$) spin defects in hexagonal boron nitride to polarize nearby nitrogen nuclear spins. Remarkably, we observe the Rabi frequency of nuclear spins at the excited-state level anti-crossing of $V_B^-$ defects to be 350 times larger than that of an isolated nucleus, and demonstrate fast coherent control of nuclear spins. We also detect strong electron-mediated nuclear-nuclear spin coupling that is 5 orders of magnitude larger than the direct nuclear spin dipolar coupling, enabling multi-qubit operations. Nitrogen nuclear spins in a triangle lattice will be suitable for large-scale quantum simulation. Our work opens a new frontier with nuclear spins in van der Waals materials for quantum information science and technology.

preprint2022arXiv

Parameterization of Cross-Token Relations with Relative Positional Encoding for Vision MLP

Vision multi-layer perceptrons (MLPs) have shown promising performance in computer vision tasks, and become the main competitor of CNNs and vision Transformers. They use token-mixing layers to capture cross-token interactions, as opposed to the multi-head self-attention mechanism used by Transformers. However, the heavily parameterized token-mixing layers naturally lack mechanisms to capture local information and multi-granular non-local relations, thus their discriminative power is restrained. To tackle this issue, we propose a new positional spacial gating unit (PoSGU). It exploits the attention formulations used in the classical relative positional encoding (RPE), to efficiently encode the cross-token relations for token mixing. It can successfully reduce the current quadratic parameter complexity $O(N^2)$ of vision MLPs to $O(N)$ and $O(1)$. We experiment with two RPE mechanisms, and further propose a group-wise extension to improve their expressive power with the accomplishment of multi-granular contexts. These then serve as the key building blocks of a new type of vision MLP, referred to as PosMLP. We evaluate the effectiveness of the proposed approach by conducting thorough experiments, demonstrating an improved or comparable performance with reduced parameter complexity. For instance, for a model trained on ImageNet1K, we achieve a performance improvement from 72.14\% to 74.02\% and a learnable parameter reduction from $19.4M$ to $18.2M$. Code could be found at https://github.com/Zhicaiwww/PosMLP.

preprint2022arXiv

Stability of the discrete time-crystalline order in spin-optomechanical and open cavity QED systems

Discrete time crystals (DTC) have been demonstrated experimentally in several different quantum systems in the past few years. Spin couplings and cavity losses have been shown to play crucial roles for realizing DTC order in open many-body systems out of equilibrium. Recently, it has been proposed that eternal and transient DTC can be present with an open Floquet setup in the thermodynamic limit and in the deep quantum regime with few qubits, respectively. In this work, we consider the effects of spin damping and spin dephasing on the DTC order in spin-optomechanical and open cavity systems in which the spins can be all-to-all coupled. In the thermodynamic limit, it is shown that the existence of dephasing can destroy the coherence of the system and finally lead the system to its trivial steady state. Without dephasing, eternal DTC is displayed in the weak damping regime, which may be destroyed by increasing the all-to-all spin coupling or the spin damping. By contrast, the all-to-all coupling is constructive to the DTC in the moderate damping regime. We also focus on a model which can be experimentally realized by a suspended hexagonal boron nitride (hBN) membrane with a few spin color centers under microwave drive and Floquet magnetic field. Signatures of transient DTC behavior are demonstrated in both weak and moderate dissipation regimes without spin dephasing. Relevant experimental parameters are also discussed for realizing transient DTC order in such an hBN optomechanical system.

preprint2021arXiv

Excited-state spin-resonance spectroscopy of V$_\text{B}^-$ defect centers in hexagonal boron nitride

The recently discovered spin-active boron vacancy (V$_\text{B}^-$) defect center in hexagonal boron nitride (hBN) has high contrast optically-detected magnetic resonance (ODMR) at room-temperature, with a spin-triplet ground-state that shows promise as a quantum sensor. Here we report temperature-dependent ODMR spectroscopy to probe spin within the orbital excited-state. Our experiments determine the excited-state spin Hamiltonian, including a room-temperature zero-field splitting of 2.1 GHz and a g-factor similar to that of the ground-state. We confirm that the resonance is associated with spin rotation in the excited-state using pulsed ODMR measurements, and we observe Zeeman-mediated level anti-crossings in both the orbital ground- and excited-state. Our observation of a single set of excited-state spin-triplet resonance from 10 to 300 K is consistent with an orbital-singlet, which has consequences for understanding the symmetry of this defect. Additionally, the excited-state ODMR has strong temperature dependence of both contrast and transverse anisotropy splitting, enabling promising avenues for quantum sensing.

preprint2021arXiv

Non-reciprocal energy transfer through the Casimir effect

A fundamental prediction of quantum mechanics is that there are random fluctuations everywhere in a vacuum because of the zero-point energy. Remarkably, quantum electromagnetic fluctuations can induce a measurable force between neutral objects, known as the Casimir effect, which has attracted broad interests. The Casimir effect can dominate the interaction between microstructures at small separations and has been utilized to realize nonlinear oscillation, quantum trapping, phonon transfer, and dissipation dilution. However, a non-reciprocal device based on quantum vacuum fluctuations remains an unexplored frontier. Here we report quantum vacuum mediated non-reciprocal energy transfer between two micromechanical oscillators. We modulate the Casimir interaction parametrically to realize strong coupling between two oscillators with different resonant frequencies. We engineer the system's spectrum to have an exceptional point in the parameter space and observe the asymmetric topological structure near it. By dynamically changing the parameters near the exceptional point and utilizing the non-adiabaticity of the process, we achieve non-reciprocal energy transfer with high contrast. Our work represents an important development in utilizing quantum vacuum fluctuations to regulate energy transfer at the nanoscale and build functional Casimir devices.

preprint2020arXiv

A structural modeling approach to solid solutions based on the similar atomic environment

Solid solution is an important way to enhance the structural and functional performances of materials. In this work, we develop a structural modeling approach to solid solutions based on the similar atomic environment (SAE). We propose the similarity function associated with any type of atom cluster to describe quantitatively the configurational deviation from the desired solid solution structure that is fully disordered or contains short-range order (SRO). In this manner, the structural modeling for solid solution is transferred to a minimization problem in the configuration space. Moreover, we pay efforts to enhance the practicality and functionality of this approach. The approach and implementation are demonstrated by the cross-validations with the special quasi-random structure (SQS) method. We apply the SAE method to the typical quinary CoCrFeMnNi high-entropy alloy, continuous binary Ta-W alloy and ternary CoCrNi medium-entropy alloy with SRO as prototypes. In combination with ab initio calculations, we investigate the structural properties and compare the calculation results with experiments.

preprint2020arXiv

High-speed quantum transducer with a single-photon emitter in a 2D resonator

Quantum transducers can transfer quantum information between different systems. Microwave-optical photon conversion is important for future quantum networks to interconnect remote superconducting quantum computers with optical fibers. Here we propose a high-speed quantum transducer based on a single-photon emitter in an atomically thin membrane resonator that can couple single microwave photons to single optical photons. The 2D resonator is a freestanding van der Waals heterostructure (may consist of hexagonal boron nitride, graphene, or other 2D materials) that hosts a quantum emitter. The mechanical vibration (phonon) of the 2D resonator interacts with optical photons by shifting the optical transition frequency of the single-photon emitter with strain or the Stark effect. The mechanical vibration couples to microwave photons by shifting the resonant frequency of a LC circuit that includes the membrane. Thanks to the small mass of the 2D resonator, both the single-photon optomechanical coupling strength and the electromechanical coupling strength can reach strong coupling regimes. This provides a way for high-speed quantum state transfer between a microwave photon, a phonon, and an optical photon.

preprint2020arXiv

Parsing-based View-aware Embedding Network for Vehicle Re-Identification

Vehicle Re-Identification is to find images of the same vehicle from various views in the cross-camera scenario. The main challenges of this task are the large intra-instance distance caused by different views and the subtle inter-instance discrepancy caused by similar vehicles. In this paper, we propose a parsing-based view-aware embedding network (PVEN) to achieve the view-aware feature alignment and enhancement for vehicle ReID. First, we introduce a parsing network to parse a vehicle into four different views, and then align the features by mask average pooling. Such alignment provides a fine-grained representation of the vehicle. Second, in order to enhance the view-aware features, we design a common-visible attention to focus on the common visible views, which not only shortens the distance among intra-instances, but also enlarges the discrepancy of inter-instances. The PVEN helps capture the stable discriminative information of vehicle under different views. The experiments conducted on three datasets show that our model outperforms state-of-the-art methods by a large margin.

preprint2019arXiv

Ultrasensitive torque detection with an optically levitated nanorotor

Torque sensors such as the torsion balance enabled the first determination of the gravitational constant by Cavendish and the discovery of Coulomb's law. Torque sensors are also widely used in studying small-scale magnetism, the Casimir effect, and other applications. Great effort has been made to improve the torque detection sensitivity by nanofabrication and cryogenic cooling. The most sensitive nanofabricated torque sensor has achieved a remarkable sensitivity of $10^{-24} \rm{Nm}/\sqrt{\rm{Hz}}$ at millikelvin temperatures in a dilution refrigerator. Here we dramatically improve the torque detection sensitivity by developing an ultrasensitive torque sensor with an optically levitated nanorotor in vacuum. We measure a torque as small as $(1.2 \pm 0.5) \times 10^{-27} \rm{Nm}$ in 100 seconds at room temperature. Our system does not require complex nanofabrication or cryogenic cooling. Moreover, we drive a nanoparticle to rotate at a record high speed beyond 5 GHz (300 billion rpm). Our calculations show that this system will be able to detect the long-sought vacuum friction near a surface under realistic conditions. The optically levitated nanorotor will also have applications in studying nanoscale magnetism and quantum geometric phase.