Source author record

Yuan Sun

Yuan Sun appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

39works

22topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

RelayGR: Scaling Long-Sequence Generative Recommendation via Cross-Stage Relay-Race Inference

Real-time recommender systems execute multi-stage cascades (retrieval, pre-processing, fine-grained ranking) under strict tail-latency SLOs, leaving only tens of milliseconds for ranking. Generative recommendation (GR) models can improve quality by consuming long user-behavior sequences, but in production their online sequence length is tightly capped by the ranking-stage P99 budget. We observe that the majority of GR tokens encode user behaviors that are independent of the item candidates, suggesting an opportunity to pre-infer a user-behavior prefix once and reuse it during ranking rather than recomputing it on the critical path. Realizing this idea at industrial scale is non-trivial: the prefix cache must survive across multiple pipeline stages before the final ranking instance is determined, the user population implies cache footprints far beyond a single device, and indiscriminate pre-inference would overload shared resources under high QPS. We present RelayGR, a production system that enables in-HBM relay-race inference for GR. RelayGR selectively pre-infers long-term user prefixes, keeps their KV caches resident in HBM over the request lifecycle, and ensures the subsequent ranking can consume them without remote fetches. RelayGR combines three techniques: 1) a sequence-aware trigger that admits only at-risk requests under a bounded cache footprint and pre-inference load, 2) an affinity-aware router that co-locates cache production and consumption by routing both the auxiliary pre-infer signal and the ranking request to the same instance, and 3) a memory-aware expander that uses server-local DRAM to capture short-term cross-request reuse while avoiding redundant reloads. We implement RelayGR on Huawei Ascend NPUs and evaluate it with real queries. Under a fixed P99 SLO, RelayGR supports up to 1.5$\times$ longer sequences and improves SLO-compliant throughput by up to 3.6$\times$.

preprint2026arXiv

Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems

As large language model (LLM) agents are deployed in high-stakes environments, the question of how safely to delegate subtasks to specialized sub-agents becomes critical. Existing work addresses multi-agent architecture selection at design time or provides broad empirical guidelines, but neither provides a runtime mechanism that dynamically adjusts the safety-efficiency trade-off as task context changes during execution. We propose Safe Bilevel Delegation (SBD), a formal framework for runtime delegation safety in hierarchical multi-agent systems. SBD formulates task delegation as a bilevel optimization problem: an outer meta-weight network phi learns context-dependent safety-efficiency weights lambda(s) in [0,1]; an inner loop optimizes the delegation policy pi subject to a probabilistic safety constraint P(safe) >= 1-delta. The continuous delegation degree alpha in [0, 1] controls how much decision authority is transferred to each sub-agent, interpolating smoothly between full human override (alpha=0) and fully autonomous execution (alpha=1). We establish three theoretical results: (1) Safety Monotonicity--higher outer safety weight produces a weakly safer inner policy; (2) Inner Policy Convergence--projected gradient descent on the inner problem converges linearly under standard smoothness assumptions; (3) an Accountability Propagation bound that distributes responsibility across multi-hop delegation chains with a provable per-agent ceiling. We instantiate SBD in three high-stakes domains--medical AI (MIMIC-III), financial risk control (S and P 500), and educational agent supervision (ASSISTments)--specifying datasets, safety constraint sets, baselines, and evaluation protocols. This manuscript presents the formal framework and theoretical results in full; empirical validation following the protocols described herein is planned and will be reported in a forthcoming revision.

preprint2026arXiv

Semantic-Consistent Bidirectional Contrastive Hashing for Noisy Multi-Label Cross-Modal Retrieval

Cross-modal hashing (CMH) facilitates efficient retrieval across different modalities (e.g., image and text) by encoding data into compact binary representations. While recent methods have achieved remarkable performance, they often rely heavily on fully annotated datasets, which are costly and labor-intensive to obtain. In real-world scenarios, particularly in multi-label datasets, label noise is prevalent and severely degrades retrieval performance. Moreover, existing CMH approaches typically overlook the partial semantic overlaps inherent in multi-label data, limiting their robustness and generalization. To tackle these challenges, we propose a novel framework named Semantic-Consistent Bidirectional Contrastive Hashing (SCBCH). The framework comprises two complementary modules: (1) Cross-modal Semantic-Consistent Classification (CSCC), which leverages cross-modal semantic consistency to estimate sample reliability and reduce the impact of noisy labels; (2) Bidirectional Soft Contrastive Hashing (BSCH), which dynamically generates soft contrastive sample pairs based on multi-label semantic overlap, enabling adaptive contrastive learning between semantically similar and dissimilar samples across modalities. Extensive experiments on four widely-used cross-modal retrieval benchmarks validate the effectiveness and robustness of our method, consistently outperforming state-of-the-art approaches under noisy multi-label conditions.

preprint2026arXiv

UIKA: Fast Universal Head Avatar from Pose-Free Images

We present UIKA, a feed-forward animatable Gaussian head model from an arbitrary number of unposed inputs, including a single image, multi-view captures, and smartphone-captured videos. Unlike the traditional avatar method, which requires a studio-level multi-view capture system and reconstructs a human-specific model through a long-time optimization process, we rethink the task through the lenses of model representation, network design, and data preparation. First, we introduce a UV-guided avatar modeling strategy, in which each input image is associated with a pixel-wise facial correspondence estimation. Such correspondence estimation allows us to reproject each valid pixel color from screen space to UV space, which is independent of camera pose and character expression. Furthermore, we design learnable UV tokens on which the attention mechanism can be applied at both the screen and UV levels. The learned UV tokens can be decoded into canonical Gaussian attributes using aggregated UV information from all input views. To train our large avatar model, we additionally prepare a large-scale, identity-rich synthetic training dataset. Our method significantly outperforms existing approaches in both monocular and multi-view settings. See more details in our project page: https://zijian-wu.github.io/uika-page/

preprint2025arXiv

Neighbor-aware Instance Refining with Noisy Labels for Cross-Modal Retrieval

In recent years, Cross-Modal Retrieval (CMR) has made significant progress in the field of multi-modal analysis. However, since it is time-consuming and labor-intensive to collect large-scale and well-annotated data, the annotation of multi-modal data inevitably contains some noise. This will degrade the retrieval performance of the model. To tackle the problem, numerous robust CMR methods have been developed, including robust learning paradigms, label calibration strategies, and instance selection mechanisms. Unfortunately, they often fail to simultaneously satisfy model performance ceilings, calibration reliability, and data utilization rate. To overcome the limitations, we propose a novel robust cross-modal learning framework, namely Neighbor-aware Instance Refining with Noisy Labels (NIRNL). Specifically, we first propose Cross-modal Margin Preserving (CMP) to adjust the relative distance between positive and negative pairs, thereby enhancing the discrimination between sample pairs. Then, we propose Neighbor-aware Instance Refining (NIR) to identify pure subset, hard subset, and noisy subset through cross-modal neighborhood consensus. Afterward, we construct different tailored optimization strategies for this fine-grained partitioning, thereby maximizing the utilization of all available data while mitigating error propagation. Extensive experiments on three benchmark datasets demonstrate that NIRNL achieves state-of-the-art performance, exhibiting remarkable robustness, especially under high noise rates.

preprint2025arXiv

Sub-Ensemble Correlations as a Covariance Geometry

Conventional practice of spatially resolved detection in diffusion-coupled thermal atomic vapors implicitly treat localized responses as mutually independent. However, in this study, it is shown that observable correlations are governed by the intrinsic spatiotemporal covariance of a global spin-fluctuation field, such that spatial separation specifies only overlapping statistical projections rather than independent physical components. A unified field-theoretic description is established in which sub-ensembles are defined as measurement-induced statistical projections of a single stochastic field. Within this formulation, sub-ensemble correlations are determined by the covariance operator, inducing a natural geometry in which statistical independence corresponds to orthogonality of the measurement functionals. For collective spin fluctuations described by a diffusion-relaxation Ornstein-Uhlenbeck stochastic field, the covariance spectrum admits only a finite set of fluctuation modes in a bounded domain, imposing an intrinsic, field-level limit on the number of statistically distinguishable sub-ensembles. The loss of sub-ensemble independence is formalized through the notion of spatial sampling overlap, which quantifies the unavoidable statistical coupling arising from shared access to common low-order fluctuation modes. While multi-channel atomic magnetometry provides a concrete physical setting in which these constraints become explicit, the framework applies generically to diffusion-coupled stochastic fields.

preprint2024arXiv

Cross-modal Active Complementary Learning with Self-refining Correspondence

Recently, image-text matching has attracted more and more attention from academia and industry, which is fundamental to understanding the latent correspondence across visual and textual modalities. However, most existing methods implicitly assume the training pairs are well-aligned while ignoring the ubiquitous annotation noise, a.k.a noisy correspondence (NC), thereby inevitably leading to a performance drop. Although some methods attempt to address such noise, they still face two challenging problems: excessive memorizing/overfitting and unreliable correction for NC, especially under high noise. To address the two problems, we propose a generalized Cross-modal Robust Complementary Learning framework (CRCL), which benefits from a novel Active Complementary Loss (ACL) and an efficient Self-refining Correspondence Correction (SCC) to improve the robustness of existing methods. Specifically, ACL exploits active and complementary learning losses to reduce the risk of providing erroneous supervision, leading to theoretically and experimentally demonstrated robustness against NC. SCC utilizes multiple self-refining processes with momentum correction to enlarge the receptive field for correcting correspondences, thereby alleviating error accumulation and achieving accurate and stable corrections. We carry out extensive experiments on three image-text benchmarks, i.e., Flickr30K, MS-COCO, and CC152K, to verify the superior robustness of our CRCL against synthetic and real-world noisy correspondences.

preprint2023arXiv

CamPro: Camera-based Anti-Facial Recognition

The proliferation of images captured from millions of cameras and the advancement of facial recognition (FR) technology have made the abuse of FR a severe privacy threat. Existing works typically rely on obfuscation, synthesis, or adversarial examples to modify faces in images to achieve anti-facial recognition (AFR). However, the unmodified images captured by camera modules that contain sensitive personally identifiable information (PII) could still be leaked. In this paper, we propose a novel approach, CamPro, to capture inborn AFR images. CamPro enables well-packed commodity camera modules to produce images that contain little PII and yet still contain enough information to support other non-sensitive vision applications, such as person detection. Specifically, CamPro tunes the configuration setup inside the camera image signal processor (ISP), i.e., color correction matrix and gamma correction, to achieve AFR, and designs an image enhancer to keep the image quality for possible human viewers. We implemented and validated CamPro on a proof-of-concept camera, and our experiments demonstrate its effectiveness on ten state-of-the-art black-box FR models. The results show that CamPro images can significantly reduce face identification accuracy to 0.3\% while having little impact on the targeted non-sensitive vision application. Furthermore, we find that CamPro is resilient to adaptive attackers who have re-trained their FR models using images generated by CamPro, even with full knowledge of privacy-preserving ISP parameters.

preprint2023arXiv

Learning Generalizable Models for Vehicle Routing Problems via Knowledge Distillation

Recent neural methods for vehicle routing problems always train and test the deep models on the same instance distribution (i.e., uniform). To tackle the consequent cross-distribution generalization concerns, we bring the knowledge distillation to this field and propose an Adaptive Multi-Distribution Knowledge Distillation (AMDKD) scheme for learning more generalizable deep models. Particularly, our AMDKD leverages various knowledge from multiple teachers trained on exemplar distributions to yield a light-weight yet generalist student model. Meanwhile, we equip AMDKD with an adaptive strategy that allows the student to concentrate on difficult distributions, so as to absorb hard-to-master knowledge more effectively. Extensive experimental results show that, compared with the baseline neural methods, our AMDKD is able to achieve competitive results on both unseen in-distribution and out-of-distribution instances, which are either randomly synthesized or adopted from benchmark datasets (i.e., TSPLIB and CVRPLIB). Notably, our AMDKD is generic, and consumes less computational resources for inference.

preprint2023arXiv

What Do You Get from Turning on Your Video? Effects of Videoconferencing Affordances on Remote Class Experience During COVID-19

The outbreak of COVID-19 forced schools to swiftly transition from in-person classes to online or remote offerings, making educators and learners alike rely on online videoconferencing platforms. Platforms like Zoom offer audio-visual channels of communication and include features that are designed to approximate the classroom experience. However, it is not clear how students' learning experiences are affected by affordances of the videoconferencing platforms or what underlying factors explain the differential effects of these affordances on class experiences of engagement, interaction, and satisfaction. In order to find out, we conducted two online survey studies: Study 1 (N = 176) investigated the effects of three types of videoconferencing affordances (i.e., modality, interactivity, and agency affordances) on class experience during the first two months after the transition to online learning. Results showed that usage of the three kinds of affordances was positively correlated with students' class engagement, interaction, and satisfaction. Perceived anonymity, nonverbal cues, and comfort level were found to be the key mediators. In addition, students' usage of video cameras in class was influenced by their classmates. Study 2 (N = 256) tested the proposed relationships at a later stage of the pandemic and found similar results, thus serving as a constructive replication. This paper focuses on reporting the results of Study 1 since it captures the timely reactions from students when they first went online, and the second study plays a supplementary role in verifying Study 1 and thereby extending its external validity. Together, the two studies provide insights for instructors on how to leverage different videoconferencing affordances to enhance the virtual learning experience. Design implications for digital tools in online education are also discussed.

preprint2022arXiv

An end-to-end multi-scale network for action prediction in videos

In this paper, we develop an efficient multi-scale network to predict action classes in partial videos in an end-to-end manner. Unlike most existing methods with offline feature generation, our method directly takes frames as input and further models motion evolution on two different temporal scales.Therefore, we solve the complexity problems of the two stages of modeling and the problem of insufficient temporal and spatial information of a single scale. Our proposed End-to-End MultiScale Network (E2EMSNet) is composed of two scales which are named segment scale and observed global scale. The segment scale leverages temporal difference over consecutive frames for finer motion patterns by supplying 2D convolutions. For observed global scale, a Long Short-Term Memory (LSTM) is incorporated to capture motion features of observed frames. Our model provides a simple and efficient modeling framework with a small computational cost. Our E2EMSNet is evaluated on three challenging datasets: BIT, HMDB51, and UCF101. The extensive experiments demonstrate the effectiveness of our method for action prediction in videos.

preprint2022arXiv

Enhancing Column Generation by a Machine-Learning-Based Pricing Heuristic for Graph Coloring

Column Generation (CG) is an effective method for solving large-scale optimization problems. CG starts by solving a sub-problem with a subset of columns (i.e., variables) and gradually includes new columns that can improve the solution of the current subproblem. The new columns are generated as needed by repeatedly solving a pricing problem, which is often NP-hard and is a bottleneck of the CG approach. To tackle this, we propose a Machine-Learning-based Pricing Heuristic (MLPH)that can generate many high-quality columns efficiently. In each iteration of CG, our MLPH leverages an ML model to predict the optimal solution of the pricing problem, which is then used to guide a sampling method to efficiently generate multiple high-quality columns. Using the graph coloring problem, we empirically show that MLPH significantly enhancesCG as compared to six state-of-the-art methods, and the improvement in CG can lead to substantially better performance of the branch-and-price exact method.

preprint2022arXiv

Note on $T\bar{T}$ deformed matrix models and JT supergravity duals

In this work we calculate the partition functions of $\mathcal{N}=1$ type 0A and 0B JT supergravity (SJT) on 2D surfaces of arbitrary genus with multiple finite cut-off boundaries, based on the $T\bar{T}$ deformed super-Schwarzian theories. In terms of SJT/matrix model duality, we compute the corresponding correlation functions in the $T\bar{T}$ deformed matrix model side by using topological recursion relations as well as the transformation properties of topological recursion relations under $T\bar{T}$ deformation. We check that the partition functions finite cut-off 0A and 0B SJT on generic 2D surfaces match the associated correlation functions in $T\bar{T}$ deformed matrix models respectively.

preprint2022arXiv

Note on surface growth approach for bulk reconstruction

In a recent paper, a novel surface growth approach for reconstructing bulk geometry and matter fields was proposed, it was shown that this picture can be explicitly realized by the one-shot entanglement distillation tensor network and the surface state correspondence. In the present paper, we give direct analysis for the growth of the bulk minimal surfaces in asymptotically AdS 3 spacetime and show that bulk geometry can be efficiently reproduced in this way, which provides further support for the surface growth approach in entanglement wedge reconstruction.

preprint2022arXiv

Prerequisite-driven Q-matrix Refinement for Learner Knowledge Assessment: A Case Study in Online Learning Context

The ever growing abundance of learning traces in the online learning platforms promises unique insights into the learner knowledge assessment (LKA), a fundamental personalized-tutoring technique for enabling various further adaptive tutoring services in these platforms. Precise assessment of learner knowledge requires the fine-grained Q-matrix, which is generally designed by experts to map the items to skills in the domain. Due to the subjective tendency, some misspecifications may degrade the performance of LKA. Some efforts have been made to refine the small-scale Q-matrix, however, it is difficult to extend the scalability and apply these methods to the large-scale online learning context with numerous items and massive skills. Moreover, the existing LKA models employ flexible deep learning models that excel at this task, but the adequacy of LKA is still challenged by the representation capability of the models on the quite sparse item-skill graph and the learners' exercise data. To overcome these issues, in this paper we propose a prerequisite-driven Q-matrix refinement framework for learner knowledge assessment (PQRLKA) in online context. We infer the prerequisites from learners' response data and use it to refine the expert-defined Q-matrix, which enables the interpretability and the scalability to apply it to the large-scale online learning context. Based on the refined Q-matrix, we propose a Metapath2Vec enhanced convolutional representation method to obtain the comprehensive representations of the items with rich information, and feed them to the PQRLKA model to finally assess the learners' knowledge. Experiments conducted on three real-world datasets demonstrate the capability of our model to infer the prerequisites for Q-matrix refinement, and also its superiority for the LKA task.

preprint2022arXiv

Probabilistic spatial clustering based on the Self Discipline Learning (SDL) model of autonomous learning

Unsupervised clustering algorithm can effectively reduce the dimension of high-dimensional unlabeled data, thus reducing the time and space complexity of data processing. However, the traditional clustering algorithm needs to set the upper bound of the number of categories in advance, and the deep learning clustering algorithm will fall into the problem of local optimum. In order to solve these problems, a probabilistic spatial clustering algorithm based on the Self Discipline Learning(SDL) model is proposed. The algorithm is based on the Gaussian probability distribution of the probability space distance between vectors, and uses the probability scale and maximum probability value of the probability space distance as the distance measurement judgment, and then determines the category of each sample according to the distribution characteristics of the data set itself. The algorithm is tested in Laboratory for Intelligent and Safe Automobiles(LISA) traffic light data set, the accuracy rate is 99.03%, the recall rate is 91%, and the effect is achieved.

preprint2022arXiv

Ranking Constraint Relaxations for Mixed Integer Programs Using a Machine Learning Approach

Solving large-scale Mixed Integer Programs (MIP) can be difficult without advanced algorithms such as decomposition based techniques. Even if a decomposition technique might be appropriate, there are still many possible decompositions for any large MIP and it may not be obvious which will be the most effective. This paper presents a comprehensive analysis of the predictive capabilities of a Machine Learning ranking (ML) function for predicting the quality of Mixed Integer Programming (MIP) decompositions created via constraint relaxation. In this analysis, the role of instance similarity and ML prediction quality is explored, as well as the benchmarking of a ML ranking function against existing heuristic functions. For this analysis, a new dataset consisting of over 40000 unique decompositions sampled from across 24 instances from the MIPLIB2017 library has been established. These decompostions have been created by both a greedy relaxation algorithm as well as a population based multi-objective algorithm, which has previously been shown to produce high quality decompositions. In this paper, we demonstrate that a ML ranking function is able to provide state-of-the-art predictions when benchmarked against existing heuristic ranking functions. Additionally, we demonstrate that by only considering a small set of features related to the relaxed constraints in each decomposition, a ML ranking function is still able to be competitive with heuristic techniques. Such a finding is promising for future constraint relaxation approaches, as these features can be used to guide decomposition creation. Finally, we highlight where a ML ranking function would be beneficial in a decomposition creation framework.

preprint2022arXiv

The PEE aspects of entanglement islands from bit threads

We study the partial entanglement entropy (PEE) aspects of the holographic BCFT setup with an entanglement island, inspired by the holographic triality of the AdS/BCFT setup developed in the recent study on the black hole information problem, and the "PEE=CFF (component flow flux)" prescription, which is proposed recently to investigate the holographic PEE in the framework of bit thread formulation. Our study provides a bit thread description of the AdS/BCFT setup, which characterizes the specific entanglement details between the different parts of the system with an entanglement island, and may provide further insight into the black hole information problem. Furthermore, we show that in the context of island, one should distinguish between the fine-grained PEE and the semi-classical PEE. Interestingly, similar to the island rule of the fine-grained entropy in the semi-classical picture, we also propose the island rules of the fine-grained PEE.

preprint2022arXiv

The universality of islands outside the horizon

We systematically calculate the quantum extremal surface (QES) associated with Hawking radiation for general $D$-dimensional ($D\geq2$) asymptotically flat (or AdS) eternal black holes using the island formula. We collect the Hawking radiation particles by a non-gravitational bath and find that a QES exists in the near-horizon region outside the black hole when $c\cdot G_{(D)}$ is smaller enough where $c$ is the central charge of the conformal matter and $G_{(D)}$ the $D$-dimensional Newton constant. The locations of the QES in these backgrounds are obtained and the late-time radiation entropy saturates the two times of black hole entropy. Finally, we numerically check that the no island configuration exists once $c\cdot G_{(D)}$ exceeds a certain upper bound in two-dimensional generalized dilaton theories (GDT). When $c\cdot G_{(D)}$ is close to the upper bound, the backreaction of the matter field on the background can not be neglected. We also consider the conditions of existence of the island configuration with the backreaction and prove that the upper bound also exists for the Witten black hole and Weyl-related Witten black hole.

preprint2022arXiv

TiBERT: Tibetan Pre-trained Language Model

The pre-trained language model is trained on large-scale unlabeled text and can achieve state-of-the-art results in many different downstream tasks. However, the current pre-trained language model is mainly concentrated in the Chinese and English fields. For low resource language such as Tibetan, there is lack of a monolingual pre-trained model. To promote the development of Tibetan natural language processing tasks, this paper collects the large-scale training data from Tibetan websites and constructs a vocabulary that can cover 99.95$\%$ of the words in the corpus by using Sentencepiece. Then, we train the Tibetan monolingual pre-trained language model named TiBERT on the data and vocabulary. Finally, we apply TiBERT to the downstream tasks of text classification and question generation, and compare it with classic models and multilingual pre-trained models, the experimental results show that TiBERT can achieve the best performance. Our model is published in http://tibert.cmli-nlp.com/

preprint2021arXiv

Hawking radiation from nonrotating singularity-free black holes in conformal gravity

We study the sparsity of Hawking radiation from nonrotating singularity-free black holes in conformal gravity. We give a rigorous bound on the greybody factor for massless scalar field and calculate the sparsity of Hawking radiation from the black hole. Besides, we investigate the dependence of the greybody factor and the sparsity of Hawking radiation on the conformal parameters. Our study shows that the Hawking radiation from the black hole is extremely sparse. When the conformal parameters are large, the increase of conformal parameters will lead to an even more sparse Hawking radiation, while to a less sparse Hawking radiation if the conformal parameters are small.

preprint2021arXiv

High fidelity entanglement of neutral atoms via a Rydberg-mediated single-modulated-pulse controlled-PHASE gate

Neutral atom platform has become an attractive choice to study the science of quantum information and quantum simulation, where intense efforts have been devoted to the entangling processes between individual atoms. For the development of this area, two-qubit controlled-PHASE gate via Rydberg blockade is one of the most essential elements. Recent theoretical studies have suggested the advantages of introducing non-trivial waveform modulation into the gate protocol, which is anticipated to improve its performance towards the next stage. We report our recent experimental results in realizing a two-qubit controlled-PHASE($C_Z$) gate via off-resonant modulated driving(ORMD) embedded in two-photon transition for Rb atoms. It relies upon a single modulated driving pulse with a carefully calculated smooth waveform to gain the appropriate phase accumulations required by the two-qubit gate. Combining this $C_Z$ gate with global microwave pulses, two-atom entanglement is generated with the raw fidelity of 0.945(6). Accounting for state preparation and measurement (SPAM) errors, we extract the entanglement operation fidelity to be 0.980(7). Our work features completing the $C_Z$ gate operation within a single pulse to avoid shelved Rydberg population, thus demonstrate another promising route for realizing high-fidelity two-qubit gate for neutral atom platform.

preprint2021arXiv

Quasi one-dimensional diffuse laser cooling of atoms

We demonstrate experimentally the generation of one-dimensional cold gases of $^{87}$Rb atoms by diffuse laser cooling (DLC). A horizontal slender vacuum glass tube with length of 105~cm and diameter of 2~cm is used in our experiment. The diffuse laser light inside the tube, which is generated by multi-reflection of injected lasers, cools the background vapor atoms. With 250~mW of cooling light and 50~mW of repumping light, an evenly distributed meter-long profile of atom cloud is obtained. We observe a factor 4 improvement on the atomic OD for a typical cooling duration of 170~ms and a sub-Doppler atomic temperature of 25~$μ$k. The maximum number of detected cold atoms remain constant for a free-fall duration of 30~ms. Such samples are ideal for many quantum optical experiments involving electromagnetically induced transparency, electronically highly excited (Rydberg) atoms and quantum precision measurements.

preprint2020arXiv

Correlation functions of the CFTs on torus with $T\bar{T}$ deformation

In this paper, we investigate the correlation functions of the conformal field theory (CFT) with the $T\bar{T}$ deformation on torus in terms of perturbative CFT approach, which is the extension of the previous investigations on correlation functions defined on a plane. We systematically obtain the first order correction to the correlation functions of the CFTs with $T\bar{T}$ deformation in both operator formalism and path integral language. As a consistency check, we compute the deformed partition function, namely the zero-point correlation function, up to the first order, which is consistent with results in literature. Moreover, we obtain a new recursion relation for correlation functions with multiple $T$'s and $\bar{T}$'s insertion in generic CFTs on torus. Base on the recursion relations, we study some correlation functions of stress tensors up to the first order under $T\bar{T}$ deformation.

preprint2020arXiv

Generalization of Machine Learning for Problem Reduction: A Case Study on Travelling Salesman Problems

Combinatorial optimization plays an important role in real-world problem solving. In the big data era, the dimensionality of a combinatorial optimization problem is usually very large, which poses a significant challenge to existing solution methods. In this paper, we examine the generalization capability of a machine learning model for problem reduction on the classic travelling salesman problems (TSP). We demonstrate that our method can greedily remove decision variables from an optimization problem that are predicted not to be part of an optimal solution. More specifically, we investigate our model's capability to generalize on test instances that have not been seen during the training phase. We consider three scenarios where training and test instances are different in terms of: 1) problem characteristics; 2) problem sizes; and 3) problem types. Our experiments show that this machine learning based technique can generalize reasonably well over a wide range of TSP test instances with different characteristics or sizes. While the accuracy of predicting unused variables naturally deteriorates as a test instance is further away from the training set, we observe that even when tested on a different TSP problem variant, the machine learning model still makes useful predictions about which variables can be eliminated without significantly impacting solution quality.

preprint2020arXiv

Nearly nondestructive thermometry of labeled cold atoms and application to isotropic laser cooling

We have designed and implemented a straightforward method to deterministically measure the temperature of the selected segment of a cold atom ensemble, and we have also developed an upgrade in the form of nondestructive thermometry. The essence is to monitor the thermal expansion of the targeted cold atoms after labeling them through manipulating the internal states, and the nondestructive property relies upon the nearly lossless detection via driving a cycling transition. For cold atoms subject to isotropic laser cooling, this method has the unique capability of addressing only the atoms on the optical detection axis within the enclosure, which is exactly the part we care about in major applications such as atomic clock or quantum sensing. Furthermore, our results confirm the sub-Doppler cooling features in isotropic laser cooling, and we have investigated the relevant cooling properties. Meanwhile, we have applied the recently developed optical configuration with the cooling laser injection in the form of hollow beams, which helps to enhance the cooling performance and accumulate more cold atoms in the central regions.

preprint2020arXiv

On the Degree of Boolean Functions as Polynomials over $\mathbb{Z}_m$

Polynomial representations of Boolean functions over various rings such as $\mathbb{Z}$ and $\mathbb{Z}_m$ have been studied since Minsky and Papert (1969). From then on, they have been employed in a large variety of fields including communication complexity, circuit complexity, learning theory, coding theory and so on. For any integer $m\ge2$, each Boolean function has a unique multilinear polynomial representation over ring $\mathbb Z_m$. The degree of such polynomial is called modulo-$m$ degree, denoted as $\mathrm{deg}_m(\cdot)$. In this paper, we investigate the lower bound of modulo-$m$ degree of Boolean functions. When $m=p^k$ ($k\ge 1$) for some prime $p$, we give a tight lower bound that $\mathrm{deg}_m(f)\geq k(p-1)$ for any non-degenerated function $f:\{0,1\}^n\to\{0,1\}$, provided that $n$ is sufficient large. When $m$ contains two different prime factors $p$ and $q$, we give a nearly optimal lower bound for any symmetric function $f:\{0,1\}^n\to\{0,1\}$ that $\mathrm{deg}_m(f) \geq \frac{n}{2+\frac{1}{p-1}+\frac{1}{q-1}}$.

preprint2020arXiv

The correlation function of (1,1) and (2,2) supersymmetric theories with $T\bar{T}$ deformation

In the paper, based on recent studies on $T\bar{T}$ deformation of 2D field theory with supersymmetry, we investigate the deformed correlation functions in $\mathcal{N}=(1,1)$ and $\mathcal{N}=(2,2)$ 2D superconformal field theories. Up to the leading order in perturbation theory, we compute the correlation functions under $T\bar{T}$ deformation. The correlation functions in these undeformed theories are almost known, and together with the help of superconformal Ward identity in $\mathcal{N}=(1,1)$ and $\mathcal{N}=(2,2)$ theories respectively we can obtain the correlation functions with operator $T\bar{T}$ inserted. Finally, by employing dimensional regularization, we can work out the integrals in the first order perturbation. The study in this paper extends previous works on the correlation functions of $T\bar{T}$ deformed bosonic CFT to the supersymmetric case.

preprint2020arXiv

Thermodynamical property of entanglement entropy and deconfinement phase transition

We analyze the holographic entanglement entropy in a soliton background with Wilson lines and derive a relation analogous to the first law of thermodynamics. The confinement/deconfinement phase transition occurs due to the competition of two minimal surfaces. The entropic c function probes the confinement/deconfinement phase transition. It is sensitive to the degrees of freedom (DOF) smaller than the size of a spatial circle. When the Wilson line becomes large, the entropic c function becomes non-monotonic as a function of the size and does not satisfy the usual c-theorem. We analyze the entanglement entropy for a small subregion and the relation analogous to the first law of thermodynamics. For the small amount of Wilson lines, the excited amount of the entanglement entropy decreases from the ground state. It reflects that confinement decreases degrees of freedom. We finally discuss the second order correction of the holographic entanglement entropy.

preprint2019arXiv

Two-qubit controlled-PHASE Rydberg blockade gate protocol for neutral atoms via off-resonant modulated driving within a single pulse

Neutral atom array serves as an ideal platform to study the quantum logic gates, where intense efforts have been devoted to improve the two-qubit gate fidelity. We report our recent findings in constructing a different type of two-qubit controlled-PHASE quantum gate protocol with neutral atoms enabled by Rydberg blockade, which aims at both robustness and high-fidelity. It relies upon modulated driving pulse with specially tailored smooth waveform to gain appropriate phase accumulations for quantum gates. The major features include finishing gate operation within a single pulse, not necessarily requiring individual site addressing, not sensitive to the exact value of blockade shift while suppressing population leakage error and rotation error. We anticipate its fidelity to be reasonably high under realistic considerations for errors such as atomic motion, laser power fluctuation, power imbalance, spontaneous emission and so on. Moreover, we hope that such type of protocol may inspire future improvements in quantum gate designs for other categories of qubit platforms and new applications in other areas of quantum optimal control.

preprint2016arXiv

Method of Tibetan Person Knowledge Extraction

Person knowledge extraction is the foundation of the Tibetan knowledge graph construction, which provides support for Tibetan question answering system, information retrieval, information extraction and other researches, and promotes national unity and social stability. This paper proposes a SVM and template based approach to Tibetan person knowledge extraction. Through constructing the training corpus, we build the templates based the shallow parsing analysis of Tibetan syntactic, semantic features and verbs. Using the training corpus, we design a hierarchical SVM classifier to realize the entity knowledge extraction. Finally, experimental results prove the method has greater improvement in Tibetan person knowledge extraction.

preprint2016arXiv

Thermodynamics and holographic entanglement entropy for spherical black holes in 5D Gauss-Bonnet gravity

The holographic entanglement entropy is studied numerically in (4+1)-dimensional spherically symmetric Gauss-Bonnet AdS black hole spacetime with compact boundary. On the bulk side the black hole spacetime undergoes a van der Waals-like phase transition in the extended phase space, which is reviewed with emphasis on the behavior on the temperature-entropy plane. On the boundary, we calculated the regularized HEE of a disk region of different sizes. We find strong numerical evidence for the failure of equal area law for isobaric curves on the temperature-HEE plane and for the correctness of first law of entanglement entropy, and briefly give an explanation for why the latter may serve as a reason for the former, i.e. the failure of equal area law on the temperature-HEE plane.

preprint2015arXiv

Improved Algorithms for Exact and Approximate Boolean Matrix Decomposition

An arbitrary $m\times n$ Boolean matrix $M$ can be decomposed {\em exactly} as $M =U\circ V$, where $U$ (resp. $V$) is an $m\times k$ (resp. $k\times n$) Boolean matrix and $\circ$ denotes the Boolean matrix multiplication operator. We first prove an exact formula for the Boolean matrix $J$ such that $M =M\circ J^T$ holds, where $J$ is maximal in the sense that if any 0 element in $J$ is changed to a 1 then this equality no longer holds. Since minimizing $k$ is NP-hard, we propose two heuristic algorithms for finding suboptimal but good decomposition. We measure the performance (in minimizing $k$) of our algorithms on several real datasets in comparison with other representative heuristic algorithms for Boolean matrix decomposition (BMD). The results on some popular benchmark datasets demonstrate that one of our proposed algorithms performs as well or better on most of them. Our algorithms have a number of other advantages: They are based on exact mathematical formula, which can be interpreted intuitively. They can be used for approximation as well with competitive "coverage." Last but not least, they also run very fast. Due to interpretability issues in data mining, we impose the condition, called the "column use condition," that the columns of the factor matrix $U$ must form a subset of the columns of $M$.

preprint2014arXiv

Higher bottom and bottom-strange mesons

Motivated by the recent observation of the orbital excitation $B(5970)$ by CDF collaboration, we have performed a systematical study of the mass spectrum and strong decay patterns of the higher $B$ and $B_s$ mesons. Hopefully the present investigation may provide valuable clues to further experimental exploration of these intriguing excited heavy mesons.

preprint2014arXiv

Precisely resolve energy-time entanglement by dual channel Fabry-Perot interferometry

Precisely resolving the continuous variable energy-time entanglement of paired photons is an important issue in quantum optics. The Fabry-Perot interferometer provides a distinguished opportunity for this purpose if the single photon pulse's self-interference is carefully avoided. A dual channel Fabry-Perot interferometer is proposed and studied with the focus put upon higher order quantum interference effects. When the two channels are properly set up, it is capable of resolving the energy-time entanglement in detail analogously to that a usual Fabry-Perot interferometer can resolve classical light's spectrum. Variation form of the dual channel Fabry-Perot interferometry is also discussed.

preprint2014arXiv

Simulating the charged charmoniumlike structure $Z_c(4025)$

Inspired by recent observation of charged charmoniumlike structure $Z_c(4025)$, we explore the $Y(4260)\to (D^*\bar{D}^*)^- π^+$ decay through the initial-single-pion-emission mechanism, where the $D^*\bar{D}^*\to D^*\bar{D}^*$ interaction is studied by the ladder approximation including a non-interacting case. Our calculation of the differential decay width for $Y(4260)\to (D^*\bar{D}^*)^- π^+$ indicates that a charged enhancement structure around $D^*\bar{D}^*$ appears in the $D^*\bar{D}^*$ invariant mass spectrum for this process, which can correspond to newly observed $Z_c(4025)$ structure.

preprint2013arXiv

An elementary approach for the phase retrieval problem

If the phase retrieval problem can be solved by a method similar to that of solving a system of linear equations under the context of FFT, the time complexity of computer based phase retrieval algorithm would be reduced. Here I present such a method which is recursive but highly non-linear in nature, based on a close look at the Fourier spectrum of the square of the function norm. In a one dimensional problem it takes $O(N^2)$ steps of calculation to recover the phases of an N component complex vector. This method could work in 1, 2 or even higher dimensional finite Fourier analysis without changes in the behavior of time complexity. For one dimensional problem the performance of an algorithm based on this method is shown, where the limitations are discussed too, especially when subject to random noises which contains significant high frequency components.

preprint2013arXiv

Newly observed $D_J(3000)^{+,0}$ and $D_J^*(3000)^0$ as $2P$ states in $D$ meson family

In this work, we study the newly observed $D_J(3000)$ and $D_J^*(3000)$ through the analysis of mass spectrum and calculation of the corresponding two-body strong decay behaviors. Our results show that $D_J(3000)$ and $D_J^*(3000)$ are explained as the $2P(1^+)$ and $2^3P_0$ states in the $D$ meson family, respectively, which is supported by the calculated masses of these two states and their decay behaviors. As a byproduct, the decay behaviors of $3^1S_0$, $3^3S_1$, $2D(2^-)$, $2^3D_1$, $2D^\prime(2^-)$, $2^3D_3$, $2P^\prime(1^+)$, $2^3P_2$, $1F(3^+)$, $1^3F_2$, $1F^\prime(3^+)$, and $1^3F_4$ states are also given, which will be helpful to further experimentally study mixings of these $D$ mesons, too.

preprint2013arXiv

The stochastic nature induced by laser noise in narrow transitions

We use a probabilistic method to describe the effect of laser noise on the laser-atom interaction, in the case that the atom is a two level system without spontaneous emission. The stochastic differential equation for the laser-atom interaction is analyzed in the sense of perturbation approach, and we construct a stochastic process corresponding to the time evolution of the atom's wave function, whose extra randomness is induced by the laser noise. It also provides the layout of a theory for the possible experiment of measuring the laser line width by driving a narrow atomic transition.

Yuan Sun

What is connected

Connect this record

See the researcher in context

Building this map preview

39 published item(s)

RelayGR: Scaling Long-Sequence Generative Recommendation via Cross-Stage Relay-Race Inference

Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems

Semantic-Consistent Bidirectional Contrastive Hashing for Noisy Multi-Label Cross-Modal Retrieval

UIKA: Fast Universal Head Avatar from Pose-Free Images

Neighbor-aware Instance Refining with Noisy Labels for Cross-Modal Retrieval

Sub-Ensemble Correlations as a Covariance Geometry

Cross-modal Active Complementary Learning with Self-refining Correspondence

CamPro: Camera-based Anti-Facial Recognition

Learning Generalizable Models for Vehicle Routing Problems via Knowledge Distillation

What Do You Get from Turning on Your Video? Effects of Videoconferencing Affordances on Remote Class Experience During COVID-19

An end-to-end multi-scale network for action prediction in videos

Enhancing Column Generation by a Machine-Learning-Based Pricing Heuristic for Graph Coloring

Note on $T\bar{T}$ deformed matrix models and JT supergravity duals

Note on surface growth approach for bulk reconstruction

Prerequisite-driven Q-matrix Refinement for Learner Knowledge Assessment: A Case Study in Online Learning Context

Probabilistic spatial clustering based on the Self Discipline Learning (SDL) model of autonomous learning

Ranking Constraint Relaxations for Mixed Integer Programs Using a Machine Learning Approach

The PEE aspects of entanglement islands from bit threads

The universality of islands outside the horizon

TiBERT: Tibetan Pre-trained Language Model

Hawking radiation from nonrotating singularity-free black holes in conformal gravity

High fidelity entanglement of neutral atoms via a Rydberg-mediated single-modulated-pulse controlled-PHASE gate

Quasi one-dimensional diffuse laser cooling of atoms

Correlation functions of the CFTs on torus with $T\bar{T}$ deformation

Generalization of Machine Learning for Problem Reduction: A Case Study on Travelling Salesman Problems

Nearly nondestructive thermometry of labeled cold atoms and application to isotropic laser cooling

On the Degree of Boolean Functions as Polynomials over $\mathbb{Z}_m$

The correlation function of (1,1) and (2,2) supersymmetric theories with $T\bar{T}$ deformation

Thermodynamical property of entanglement entropy and deconfinement phase transition

Two-qubit controlled-PHASE Rydberg blockade gate protocol for neutral atoms via off-resonant modulated driving within a single pulse

Method of Tibetan Person Knowledge Extraction

Thermodynamics and holographic entanglement entropy for spherical black holes in 5D Gauss-Bonnet gravity

Improved Algorithms for Exact and Approximate Boolean Matrix Decomposition

Higher bottom and bottom-strange mesons

Precisely resolve energy-time entanglement by dual channel Fabry-Perot interferometry

Simulating the charged charmoniumlike structure $Z_c(4025)$

An elementary approach for the phase retrieval problem

Newly observed $D_J(3000)^{+,0}$ and $D_J^*(3000)^0$ as $2P$ states in $D$ meson family

The stochastic nature induced by laser noise in narrow transitions