Researcher profile

Yan Xia

Yan Xia contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
22works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

22 published item(s)

preprint2026arXiv

ScribbleDose: Scribble-Guided Dose Prediction in Radiotherapy

Anatomical structure masks are widely adopted in radiotherapy dose prediction, as they provide explicit geometric constraints that facilitate structure-dose coupling. However, conventional manual delineation of these masks requires precise annotation of structure boundaries relevant to radiotherapy, which is time-consuming and labor-intensive. To address these limitations, we propose a scribble-guided dose prediction framework that relies solely on anatomical structures annotated with sparse scribbles. Specifically, we design a Scribble Completion Module (SCM) to generate dense anatomical masks by propagating sparse scribble labels to semantically similar voxels. During the propagation process, a supervoxel-based regularization is introduced to preserve geometric boundary consistency to ensure anatomical plausibility. Furthermore, we propose a Structure-Guided Dose Generation Module (SGDGM) to strengthen the correspondence between sparse structural cues and dose distribution. Herein, the completed dense masks derived from scribbles serve as structural guidance to condition the dose prediction network. This scribble-mask-dose consistency encourages high-dose concentration within target volumes while effectively sparing surrounding organs-at-risk. Extensive experiments on the open-source GDP-HMM dataset demonstrate that the proposed method maintains superior dose prediction performance while substantially reducing annotation cost, providing a practical paradigm for dose prediction under sparse structural annotation. The code and reannotated scribbles are made publicly available at https://github.com/iCherishxixixi/ScribbleDose.

preprint2024arXiv

Sudden change of the photon output field marks phase transitions in the quantum Rabi model

The experimental observation of quantum phase transitions predicted by the quantum Rabi model in quantum critical systems is usually challenging due to the lack of signature experimental observables associated with them. Here, we describe a method to identify the dynamical critical phenomenon in the quantum Rabi model consisting of a three-level atom and a cavity at the quantum phase transition. Such a critical phenomenon manifests itself as a sudden change of steady-state output photons in the system driven by two classical fields, when both the atom and the cavity are initially unexcited. The process occurs as the high-frequency pump field is converted into the low-frequency Stokes field and multiple cavity photons in the normal phase, while this conversion cannot occur in the superradiant phase. The sudden change of steady-state output photons is an experimentally accessible measure to probe quantum phase transitions, as it does not require preparing the equilibrium state.

preprint2024arXiv

Three-state coherent control using narrowband and passband sequences

In this work, we propose a comprehensive design for narrowband and passband composite pulse sequences by involving the dynamics of all states in the three-state system. The design is quite universal as all pulse parameters can be freely employed to modify the coefficients of error terms. Two modulation techniques, the strength and phase modulations, are used to achieve arbitrary population transfer with a desired excitation profile, while the system keeps minimal leakage to the third state. Furthermore, the current sequences are capable of tolerating inaccurate waveforms, detunings errors, and work well when rotating wave approximation is not strictly justified. Therefore, this work provides versatile adaptability for shaping various excitation profiles in both narrowband and passband sequences.

preprint2023arXiv

Unsupervised ensemble-based phenotyping helps enhance the discoverability of genes related to heart morphology

Recent genome-wide association studies (GWAS) have been successful in identifying associations between genetic variants and simple cardiac parameters derived from cardiac magnetic resonance (CMR) images. However, the emergence of big databases including genetic data linked to CMR, facilitates investigation of more nuanced patterns of shape variability. Here, we propose a new framework for gene discovery entitled Unsupervised Phenotype Ensembles (UPE). UPE builds a redundant yet highly expressive representation by pooling a set of phenotypes learned in an unsupervised manner, using deep learning models trained with different hyperparameters. These phenotypes are then analyzed via (GWAS), retaining only highly confident and stable associations across the ensemble. We apply our approach to the UK Biobank database to extract left-ventricular (LV) geometric features from image-derived three-dimensional meshes. We demonstrate that our approach greatly improves the discoverability of genes influencing LV shape, identifying 11 loci with study-wide significance and 8 with suggestive significance. We argue that our approach would enable more extensive discovery of gene associations with image-derived phenotypes for other organs or image modalities.

preprint2022arXiv

An Approach to Mispronunciation Detection and Diagnosis with Acoustic, Phonetic and Linguistic (APL) Embeddings

Many mispronunciation detection and diagnosis (MD&D) research approaches try to exploit both the acoustic and linguistic features as input. Yet the improvement of the performance is limited, partially due to the shortage of large amount annotated training data at the phoneme level. Phonetic embeddings, extracted from ASR models trained with huge amount of word level annotations, can serve as a good representation of the content of input speech, in a noise-robust and speaker-independent manner. These embeddings, when used as implicit phonetic supplementary information, can alleviate the data shortage of explicit phoneme annotations. We propose to utilize Acoustic, Phonetic and Linguistic (APL) embedding features jointly for building a more powerful MD&D system. Experimental results obtained on the L2-ARCTIC database show the proposed approach outperforms the baseline by 9.93%, 10.13% and 6.17% on the detection accuracy, diagnosis error rate and the F-measure, respectively.

preprint2022arXiv

Composite pulses for high fidelity population transfer in three-level systems

In this work, we propose a composite pulses scheme by modulating phases to achieve high fidelity population transfer in three-level systems. To circumvent the obstacle that not enough variables are exploited to eliminate the systematic errors in the transition probability, we put forward a cost function to find the optimal value. The cost function is independently constructed either in ensuring an accurate population of the target state, or in suppressing the population of the leakage state, or both of them. The results demonstrate that population transfer is implemented with high fidelity even when existing the deviations in the coupling coefficients. Furthermore, our composite pulses scheme can be extensible to arbitrarily long pulse sequences. As an example, we employ the composite pulses sequence for achieving the three-atom singlet state in an atom-cavity system with ultrahigh fidelity. The final singlet state shows robustness against deviations and is not seriously affected by waveform distortions. Also, the singlet state maintains a high fidelity under the decoherence environment.

preprint2022arXiv

Detecting a single atom in a cavity using the $χ^{(2)}$ nonlinear medium

We propose a protocol for detecting a single atom in a cavity with the help of the $χ^{(2)}$ nonlinear medium. When the $χ^{(2)}$ nonlinear medium is driven by an external laser field, the cavity mode will be squeezed, and thus one can obtain an exponentially enhanced light-matter coupling. Such a strong coupling between the atom and the cavity field can significantly change the output photon flux, the quantum fluctuations, the quantum statistical property, and the photon number distributions of the cavity field. This provides practical strategies to determine the presence or absence of an atom in a cavity. The proposed protocol exhibits some advantages, such as controllable squeezing strength and exponential increase of atom-cavity coupling strength, which make the experimental phenomenon more obvious. We hope that this protocol can supplement the existing intracavity single-atom detection protocols and provide a promise for quantum sensing in different quantum systems.

preprint2022arXiv

Noise assisted quantum coherence protection in hierarchical environment

In this paper, we investigate coherence protection of a quantum system coupled to a hierarchical environment by utilizing noise. As an example, we solve the Jaynes-Cummings (J-C) model in presence of both a classical and a quantized noise. The master equation is derived beyond the Markov approximation, where the influence of memory effects from both noises is taken into account. More importantly, we find that the performance of the coherence protection sensitively depends on the non-Markovian properties of both noises. By analyzing the mathematical mechanism of the coherence protection, we show the decoherence caused by a non-Markovian noise with longer memory time can be suppressed by another Markovian noise with shorter memory time. Last but not least, as an outlook, we try to analyze the connection between the atom-cavity entanglement and the atomic coherence, then discuss the possible clue to search for the required noise. The results presented in this paper show the possibility of protecting coherence by utilizing noise and may open a new path to design noise-assisted coherence protection schemes.

preprint2022arXiv

Nonadiabatic geometric quantum computation with cat qubits via invariant-based reverse engineering

We propose a protocol to realize nonadiabatic geometric quantum computation of small-amplitude Schrödinger cat qubits via invariant-based reverse engineering. We consider a system with a two-photon driven Kerr nonlinearity, which provides a pair of dressed even and odd coherent states, i.e., Schrödinger cat states for fault-tolerant quantum computations. An additional coherent field is applied to linearly drive a cavity mode, to induce oscillations between dressed cat states. By designing this linear drive with invariant-based reverse engineering, nonadiabatic geometric quantum computation with cat qubits can be implemented. The performance of the protocol is estimated by taking into account the influence of systematic errors, additive white Gaussian noise, and decoherence including photon loss and dephasing. Numerical results demonstrate that our protocol is robust against these negative factors. Therefore, this protocol may provide a feasible method for nonadiabatic geometric quantum computation in bosonic systems.

preprint2022arXiv

Robust population inversion in three-level systems by composite pulses

In this work, we exploit the idea of composite pulses to achieve robust population inversion in a three-level quantum system. The scheme is based on the modulation of the coupling strength, while the other physical parameters remain unchanged. The composite pulses sequence is designed by vanishing high-order error terms, and can compensate the systematic errors to any desired order. In particular, this scheme keeps a good performance under the disturbance of waveform deformations. This trait ensures that population inversion can be nearly obtained even when the pulse sequence has a short jump delay. As an example, we employ the designed composite pulse sequence to prepare the W state in a robust manner in the superconducting circuits. The numerical results show that the fidelity can still maintain a high level in a decoherence environment.

preprint2022arXiv

Tripartite high-dimensional magnon-photon entanglement in PT -symmetry broken phases of a non-Hermitian hybrid system

Hybrid systems that combine spin ensembles and superconducting circuits provide a promising platform for implementing quantum information processing. We propose a non-Hermitian magnoncircuit-QED hybrid model consisting of two cavities and an yttrium iron garnet (YIG) sphere placed in one of the cavities. Abundant exceptional points (EPs), parity-time (PT )-symmetry phases and PT -symmetry broken phases are investigated in the parameter space. Tripartite highdimensional entangled states can be generated steadily among modes of the magnon and photons in PT -symmetry broken phases, corresponding to which the stable quantum coherence exists. Results show that the tripartite high-dimensional entangled state is robust against the dissipation of hybrid system, independent of a certain initial state, and insensitive to the fluctuation of magnonphoton coupling. Further, we propose to simulate the hybrid model with an equivalent LCR circuit. This work may provide prospects for realizing multipartite high-dimensional entangled states in the magnon-circuit-QED hybrid system.

preprint2022arXiv

Video-Guided Curriculum Learning for Spoken Video Grounding

In this paper, we introduce a new task, spoken video grounding (SVG), which aims to localize the desired video fragments from spoken language descriptions. Compared with using text, employing audio requires the model to directly exploit the useful phonemes and syllables related to the video from raw speech. Moreover, we randomly add environmental noises to this speech audio, further increasing the difficulty of this task and better simulating real applications. To rectify the discriminative phonemes and extract video-related information from noisy audio, we develop a novel video-guided curriculum learning (VGCL) during the audio pre-training process, which can make use of the vital visual perceptions to help understand the spoken language and suppress the external noise. Considering during inference the model can not obtain ground truth video segments, we design a curriculum strategy that gradually shifts the input video from the ground truth to the entire video content during pre-training. Finally, the model can learn how to extract critical visual information from the entire video clip to help understand the spoken language. In addition, we collect the first large-scale spoken video grounding dataset based on ActivityNet, which is named as ActivityNet Speech dataset. Extensive experiments demonstrate our proposed video-guided curriculum learning can facilitate the pre-training process to obtain a mutual audio encoder, significantly promoting the performance of spoken video grounding tasks. Moreover, we prove that in the case of noisy sound, our model outperforms the method that grounding video with ASR transcripts, further demonstrating the effectiveness of our curriculum strategy.

preprint2021arXiv

Demonstration of dynamical control of three-level open systems with a superconducting qutrit

We propose a method for the dynamical control in three-level open systems and realize it in the experiment with a superconducting qutrit. Our work demonstrates that in the Markovian environment for a relatively long time (3 us), the systemic populations or coherence can still strictly follow the preset evolution paths. This is the first experiment for precisely controlling the Markovian dynamics of three-level open systems, providing a solid foundation for the future realization of dynamical control in multiple open systems. An instant application of the techniques demonstrated in this experiment is to stabilize the energy of quantum batteries.

preprint2021arXiv

Resilient quantum gates on periodically driven Rydberg atoms

Fault-tolerant implementation of quantum gates is one of preconditions for realizing quantum computation. The platform of Rydberg atoms is one of the most promising candidates for achieving quantum computation. We propose to implement a controlled-$Z$ gate on Rydberg atoms where an amplitude-modulated field is employed to induce Rydberg antiblockade. Gate robustness against the fluctuations in the Rydberg-Rydberg interaction can be largely enhanced by adjusting amplitude-modulated field. Furthermore, we introduce a Landau-Zener-Stückelberg transition on the target atom so as to improve the gate resilience to the deviation in the gate time and the drift in the pulse amplitude. With feasible experimental parameters, one can achieve the gate with low fidelity errors caused by atomic decay, interatomic dipole-dipole force, and Doppler effects. Finally, we generalize the gate scheme into multiqubit cases, where resilient multiqubit phase gates can be obtained in one step with an unchanged gate time as the number of qubits increases.

preprint2021arXiv

VPC-Net: Completion of 3D Vehicles from MLS Point Clouds

As a dynamic and essential component in the road environment of urban scenarios, vehicles are the most popular investigation targets. To monitor their behavior and extract their geometric characteristics, an accurate and instant measurement of vehicles plays a vital role in traffic and transportation fields. Point clouds acquired from the mobile laser scanning (MLS) system deliver 3D information of road scenes with unprecedented detail. They have proven to be an adequate data source in the fields of intelligent transportation and autonomous driving, especially for extracting vehicles. However, acquired 3D point clouds of vehicles from MLS systems are inevitably incomplete due to object occlusion or self-occlusion. To tackle this problem, we proposed a neural network to synthesize complete, dense, and uniform point clouds for vehicles from MLS data, named Vehicle Points Completion-Net (VPC-Net). In this network, we introduce a new encoder module to extract global features from the input instance, consisting of a spatial transformer network and point feature enhancement layer. Moreover, a new refiner module is also presented to preserve the vehicle details from inputs and refine the complete outputs with fine-grained information. Given sparse and partial point clouds as inputs, the network can generate complete and realistic vehicle structures and keep the fine-grained details from the partial inputs. We evaluated the proposed VPC-Net in different experiments using synthetic and real-scan datasets and applied the results to 3D vehicle monitoring tasks. Quantitative and qualitative experiments demonstrate the promising performance of the proposed VPC-Net and show state-of-the-art results.

preprint2020arXiv

Effective Rabi dynamics of Rydberg atoms and robust high-fidelity quantum gates with a resonant amplitude-modulation field

With a resonant amplitude-modulation field on two Rydberg atoms, we propose a Rydberg antiblockade (RAB) regime, where the Rabi oscillation between collective ground and excited states is induced. A controlled-Z gate can be yielded through a Rabi cycle. Further, several common issues of the RAB gates are solved by modifying the parameter relation. The gate fidelity and the gate robustness against the control error are enhanced with a shaped pulse. The requirement of control precision of the Rydberg-Rydberg interaction strength is relaxed. In addition, the atomic excitation is restrained and therefore the gate robustness against the atomic decay is enhanced.

preprint2020arXiv

Heralded atomic nonadiabatic holonomic quantum computation with Rydberg blockade

We propose a protocol to realize atomic nonadiabatic holonomic quantum computation (NHQC) with two computational atoms and an auxiliary atom. Dynamics of the system is analyzed in the regime of Rydberg blockade, and robust laser pulses are designed via reverse engineering, so that quantum gates can be easily realized with high fidelities. In addition, we also study the evolution suffering from dissipation with a master equation. The result indicates that decays of atoms can be heralded by measuring the state of the auxiliary atom, and nearly perfect unitary evolution can be obtained if the auxiliary atom remains in its Rydberg state. Therefore, the protocol may be helpful to realize NHQC in dissipative environment.

preprint2020arXiv

Noise-resistant phase gates with amplitude modulation

We propose a simple scheme for implementing fast arbitrary phase gates and employ pulse modulation to improve the gate robustness against different sources of noise. Parametric driving of a cavity is introduced to induce Rabi interactions between the cavity and qutrits, and then a two-qubit arbitrary phase gate is constructed by designing proper logical states. On this basis, we implement amplitude-shaped gates to obtain enhanced resilience to control errors in gate time and frequency detuning. By virtue of specifically designed logical states, the scheme displays intrinsic resistance to the energy relaxations of qutrits. Furthermore, we show that the gate robustness against cavity decay can be enhanced significantly with amplitude modulation.

preprint2020arXiv

Pulse reverse-engineering for strong field-matter interaction

We propose a scheme to control the evolution of a two-level quantum system in the strong coupling regime based on the idea of reverse-engineering. A coherent control field is designed to drive both closed and open two-level quantum systems along user predefined evolution trajectory without utilizing the rotating-wave approximation (RWA). As concrete examples, we show that complete population inversion, an equally weighted coherent superposition, and even oscillationlike dynamics can be achieved. As there are no limitations on the coupling strength between the control field and matter, the scheme is attractive for applications such as accelerating desired system dynamics and fast quantum information processing.

preprint2020arXiv

Robust generation of logical qubit singlet states with reverse engineering and optimal control with spin qubits

A protocol is proposed to generate singlet states of three logical qubits constructed by pairs of spins. Single and multiple operations of logical qubits are studied for the construction of an effective Hamiltonian, with which robust control fields are derived with invariant-based reverse engineering and optimal control. Moreover, systematic errors are further compensated by periodic modulation for better robustness. Furthermore, resistance to decoherence of the protocol is also shown with numerical simulations. Therefore, the protocol may provide useful perspectives for generations of logical qubit entanglement in spin systems.

preprint2020arXiv

Two-path interference for enantiomer-selective state transfer of chiral molecules

With a microwave-regime cyclic three-state configuration, an enantiomer-selective state transfer~(ESST) is carried out through the two-path interference between a direct one-photon coupling and an effective two-photon coupling. The $π$-phase difference in the one-photon process between two enantiomers makes the interference constructive for one enantiomer but destructive for the other. Therefore only one enantiomer is excited into a higher rotational state while the other remains in the ground state. The scheme is of flexibility in the pulse waveforms and the time order of two paths. We simulate the scheme in a sample of cyclohexylmethanol~(C$_7$H$_{14}$O) molecules. Simulative results show the robust and high-fidelity ESST can be obtained when experimental concerns are considered. Finally, we propose to employ the finished ESST in implementing enantio-separation and determining enantiomeric excess.

preprint2019arXiv

Robust and highly-efficient discrimination of chiral molecules through three-mode parallel paths

We propose to discriminate chiral molecules by combining one- and two-photon processes in a closed-loop configuration. The one-photon-coupling intrinsic π-phase difference between two enantiomers leads to their different superposition states, which is then followed by a two-photon process through three-mode parallel paths (3MPPs), enabling the discrimination of enantiomers by inducing their entirely-different population distributions. The 3MPPs are constructed by "chosen paths", a method of shortcuts to adiabaticity (STA), exhibiting a fast two-photon process. As an example, we propose to perform the scheme in 1, 2-propanediol molecules, which shows relatively robust and highly-efficient results under considering the experimental issues concerning unwanted transitions, imperfect initial state, pulse shaping, control errors and the effect of energy relaxations. The present work may provide help for laboratory researchers in a robust separation of chiral molecules.