Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
19topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2026arXiv

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Recent progress in reasoning models has substantially advanced long-horizon mathematical and scientific problem solving, with several systems now reaching gold-medal-level performance on International Mathematical Olympiad (IMO) and International Physics Olympiad (IPhO) problems. In this paper, we introduce a simple and unified recipe for converting a post-trained reasoning backbone into a rigorous olympiad-level solver. The recipe first uses a reverse-perplexity curriculum for SFT to instill rigorous proof-search and self-checking behaviors, then scales these behaviors through a two-stage RL pipeline that progresses from RL with verifiable rewards to more delicate proof-level RL, and finally boosts solving performance with test-time scaling. Applying this recipe, we train a 30B-A3B backbone with SFT on around 340K sub-8K-token trajectories followed by 200 RL steps. The resulting model, SU-01, supports stable reasoning on difficult problems with trajectories exceeding 100K tokens, while achieving gold-medal-level performance on mathematical and physical olympiad competitions, including IMO 2025/USAMO 2026 and IPhO 2024/2025. It also demonstrates strong generalization of scientific reasoning to domains beyond mathematics and physics.

preprint2026arXiv

LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing

Large language model (LLM) routing assigns each query to the most suitable model from an ensemble. We introduce LLMRouterBench, a large-scale benchmark and unified framework for LLM routing. It comprises over 400K instances from 21 datasets and 33 models. Moreover, it provides comprehensive metrics for both performance-oriented routing and performance-cost trade-off routing, and integrates 10 representative routing baselines. Using LLMRouterBench, we systematically re-evaluate the field. While confirming strong model complementarity-the central premise of LLM routing-we find that many routing methods exhibit similar performance under unified evaluation, and several recent approaches, including commercial routers, fail to reliably outperform a simple baseline. Meanwhile, a substantial gap remains to the Oracle, driven primarily by persistent model-recall failures. We further show that backbone embedding models have limited impact, that larger ensembles exhibit diminishing returns compared to careful model curation, and that the benchmark also enables latency-aware analysis. All code and data are available at https://github.com/ynulihao/LLMRouterBench.

preprint2026arXiv

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

We introduce SciEvalKit, a unified benchmarking toolkit designed to evaluate AI models for science across a broad range of scientific disciplines and task capabilities. Unlike general-purpose evaluation platforms, SciEvalKit focuses on the core competencies of scientific intelligence, including Scientific Multimodal Perception, Scientific Multimodal Reasoning, Scientific Multimodal Understanding, Scientific Symbolic Reasoning, Scientific Code Generation, Science Hypothesis Generation and Scientific Knowledge Understanding. It supports six major scientific domains, spanning from physics and chemistry to astronomy and materials science. SciEvalKit builds a foundation of expert-grade scientific benchmarks, curated from real-world, domain-specific datasets, ensuring that tasks reflect authentic scientific challenges. The toolkit features a flexible, extensible evaluation pipeline that enables batch evaluation across models and datasets, supports custom model and dataset integration, and provides transparent, reproducible, and comparable results. By bridging capability-based evaluation and disciplinary diversity, SciEvalKit offers a standardized yet customizable infrastructure to benchmark the next generation of scientific foundation models and intelligent agents. The toolkit is open-sourced and actively maintained to foster community-driven development and progress in AI4Science.

preprint2026arXiv

scMRDR: A scalable and flexible framework for unpaired single-cell multi-omics data integration

Advances in single-cell sequencing have enabled high-resolution profiling of diverse molecular modalities, while integrating unpaired multi-omics single-cell data remains challenging. Existing approaches either rely on pair information or prior correspondences, or require computing a global pairwise coupling matrix, limiting their scalability and flexibility. In this paper, we introduce a scalable and flexible generative framework called single-cell Multi-omics Regularized Disentangled Representations (scMRDR) for unpaired multi-omics integration. Specifically, we disentangle each cell's latent representations into modality-shared and modality-specific components using a well-designed $β$-VAE architecture, which are augmented with isometric regularization to preserve intra-omics biological heterogeneity, adversarial objective to encourage cross-modal alignment, and masked reconstruction loss strategy to address the issue of missing features across modalities. Our method achieves excellent performance on benchmark datasets in terms of batch correction, modality alignment, and biological signal preservation. Crucially, it scales effectively to large-scale datasets and supports integration of more than two omics, offering a powerful and flexible solution for large-scale multi-omics data integration and downstream biological discovery.

preprint2024arXiv

Quantum entanglement and non-Hermiticity in free-fermion systems

This topical review article reports rapid progress on the generalization and application of entanglement in non-Hermitian free-fermion quantum systems. We begin by examining the realization of non-Hermitian quantum systems through the Lindblad master equation, alongside a review of typical non-Hermitian free-fermion systems that exhibit unique features. A pedagogical discussion is provided on the relationship between entanglement quantities and the correlation matrix in Hermitian systems. Building on this foundation, we focus on how entanglement concepts are extended to non-Hermitian systems from their Hermitian free-fermion counterparts, with a review of the general properties that emerge. Finally, we highlight various concrete studies, demonstrating that entanglement entropy remains a powerful diagnostic tool for characterizing non-Hermitian physics. The entanglement spectrum also reflects the topological characteristics of non-Hermitian topological systems, while unique non-Hermitian entanglement behaviors are also discussed. The review is concluded with several future directions. Through this review, we hope to provide a useful guide for researchers who are interested in entanglement in non-Hermitian quantum systems.

preprint2023arXiv

$β$-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search

Neural Architecture Search has attracted increasing attention in recent years. Among them, differential NAS approaches such as DARTS, have gained popularity for the search efficiency. However, they still suffer from three main issues, that are, the weak stability due to the performance collapse, the poor generalization ability of the searched architectures, and the inferior robustness to different kinds of proxies. To solve the stability and generalization problems, a simple-but-effective regularization method, termed as Beta-Decay, is proposed to regularize the DARTS-based NAS searching process (i.e., $β$-DARTS). Specifically, Beta-Decay regularization can impose constraints to keep the value and variance of activated architecture parameters from being too large, thereby ensuring fair competition among architecture parameters and making the supernet less sensitive to the impact of input on the operation set. In-depth theoretical analyses on how it works and why it works are provided. Comprehensive experiments validate that Beta-Decay regularization can help to stabilize the searching process and makes the searched network more transferable across different datasets. To address the robustness problem, we first benchmark different NAS methods under a wide range of proxy data, proxy channels, proxy layers and proxy epochs, since the robustness of NAS under different kinds of proxies has not been explored before. We then conclude some interesting findings and find that $β$-DARTS always achieves the best result among all compared NAS methods under almost all proxies. We further introduce the novel flooding regularization to the weight optimization of $β$-DARTS (i.e., Bi-level regularization), and experimentally and theoretically verify its effectiveness for improving the proxy robustness of differentiable NAS.

preprint2022arXiv

$β$-DARTS: Beta-Decay Regularization for Differentiable Architecture Search

Neural Architecture Search~(NAS) has attracted increasingly more attention in recent years because of its capability to design deep neural networks automatically. Among them, differential NAS approaches such as DARTS, have gained popularity for the search efficiency. However, they suffer from two main issues, the weak robustness to the performance collapse and the poor generalization ability of the searched architectures. To solve these two problems, a simple-but-efficient regularization method, termed as Beta-Decay, is proposed to regularize the DARTS-based NAS searching process. Specifically, Beta-Decay regularization can impose constraints to keep the value and variance of activated architecture parameters from too large. Furthermore, we provide in-depth theoretical analysis on how it works and why it works. Experimental results on NAS-Bench-201 show that our proposed method can help to stabilize the searching process and makes the searched network more transferable across different datasets. In addition, our search scheme shows an outstanding property of being less dependent on training time and data. Comprehensive experiments on a variety of search spaces and datasets validate the effectiveness of the proposed method.

preprint2022arXiv

Detailed study of quantum path interferences in high harmonic generation driven by chirped laser pulses

We investigate the electron quantum path interference effects during high harmonic generation in atomic gas medium driven by ultrashort chirped laser pulses. To achieve that, we identify and vary the different experimentally relevant control parameters of such a driving laser pulse influencing the high harmonic spectra. Specifically, the impact of the pulse duration, peak intensity and instantaneous frequency is studied in a self-consistent manner based on Lewenstein formalism. Simulations involving macroscopic propagation effects are also considered. The study aims to reveal the microscopic background behind a variety of interference patterns capturing important information both about the fundamental laser field and the generation process itself. The results provide guidance towards experiments with chirp control as a tool to unravel, explain and utilize the rich and complex interplay between quantum path interferences including the tuning of the periodicity of the intensity dependent oscillation of the harmonic signal, and the curvature of spectrally resolved Maker fringes.

preprint2022arXiv

Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

Semantic segmentation is a popular research topic in computer vision, and many efforts have been made on it with impressive results. In this paper, we intend to search an optimal network structure that can run in real-time for this problem. Towards this goal, we jointly search the depth, channel, dilation rate and feature spatial resolution, which results in a search space consisting of about 2.78*10^324 possible choices. To handle such a large search space, we leverage differential architecture search methods. However, the architecture parameters searched using existing differential methods need to be discretized, which causes the discretization gap between the architecture parameters found by the differential methods and their discretized version as the final solution for the architecture search. Hence, we relieve the problem of discretization gap from the innovative perspective of solution space regularization. Specifically, a novel Solution Space Regularization (SSR) loss is first proposed to effectively encourage the supernet to converge to its discrete one. Then, a new Hierarchical and Progressive Solution Space Shrinking method is presented to further achieve high efficiency of searching. In addition, we theoretically show that the optimization of SSR loss is equivalent to the L_0-norm regularization, which accounts for the improved search-evaluation gap. Comprehensive experiments show that the proposed search scheme can efficiently find an optimal network structure that yields an extremely fast speed (175 FPS) of segmentation with a small model size (1 M) while maintaining comparable accuracy.

preprint2022arXiv

Evolution of Dynamical Signature in the X-cube Fracton Topological Order

As an unconventional realization of topological orders with an exotic interplay of topology and geometry, fracton (topological) orders feature subextensive topological ground state degeneracy and subdimensional excitations that are movable only within a certain subspace. It has been known in the exactly solvable three-dimensional X-cube model that universally represents the type-I fracton orders, that mobility constraints on subdimensional excitations originate from the absence of spatially deformable string-like operators. To unveil the interplay of topology and geometry, in this paper, we study the dynamical signature in the X-cube model in the presence of external Zeeman fields via large-scale quantum Monte Carlo simulation and stochastic analytic continuation. We compute both real-space correlation functions and dynamic structure factors of subdimensional excitations (i.e., fractons, lineons, and planons) in the fracton phase and their evolution into the trivial paramagnetic phase by increasing external fields. We find in the fracton phase, that the correlation functions and the spectral functions show clear anisotropy exactly caused by the underlying mobility constraints. On the other hand, the external fields successfully induce quantum fluctuations and offer mobility to excitations along the subspace allowed by mobility constraints. These numerical results provide the evolution of a dynamical signature of subdimensional particles in fracton orders, indicating that the mobility constraints on local dynamical properties of subdimensional excitations are deeply related to the existence of fracton topological order. The results will also be helpful in potential experimental identifications in spectroscopy measurements such as neutron scattering and nuclear magnetic resonance.

preprint2022arXiv

Fractionalizing Global Symmetry on Looplike Topological Excitations

Symmetry fractionalization (SF) on topological excitations is one of the most remarkable quantum phenomena in topological orders with symmetry, i.e., symmetry-enriched topological phases. While much progress has been theoretically and experimentally made in 2D, the understanding on SF in 3D is far from complete. A long-standing challenge is to understand SF on looplike topological excitations which are spatially extended objects. In this work, we construct a powerful topological-field-theoretical framework approach for 3D topological orders, which leads to a systematic characterization and classification of SF. For systems with Abelian gauge groups ($G_g$) and Abelian symmetry groups ($G_s$), we successfully establish equivalence classes that lead to a finite number of patterns of SF, although there are notoriously infinite number of Lagrangian-descriptions of the system. We compute topologically distinct types of fractional symmetry charges carried by particles. Then, for each type, we compute topologically distinct statistical phases of braiding processes among loop excitations and external symmetry fluxes. As a result, we are able to unambiguously list all physical observables for each pattern of SF. We present detailed calculations on many concrete examples. As an example, we find that the SF in an untwisted $\mathbb{Z}_2\times \mathbb{Z}_2$ topological order with $\mathbb{Z}_2$ symmetry is classified by $ (\mathbb{Z}_2)^6\oplus (\mathbb{Z}_2)^2\oplus (\mathbb{Z}_2)^2\oplus (\mathbb{Z}_2)^2$. If the topological order is twisted, the classification reduces to $(\mathbb{Z}_2)^6$ in which particle excitations always carry integer charge. Pure algebraic formalism of the classification is given by: $ \bigoplus_{ν_i} \mathcal H^4 ( G_g\leftthreetimes_{ν_i}G_s, U (1)) /Γ_ω({ν_i}) \,$. Several future directions are proposed.

preprint2022arXiv

Generalized Global Ranking-Aware Neural Architecture Ranker for Efficient Image Classifier Search

Neural Architecture Search (NAS) is a powerful tool for automating effective image processing DNN designing. The ranking has been advocated to design an efficient performance predictor for NAS. The previous contrastive method solves the ranking problem by comparing pairs of architectures and predicting their relative performance. However, it only focuses on the rankings between two involved architectures and neglects the overall quality distributions of the search space, which may suffer generalization issues. A predictor, namely Neural Architecture Ranker (NAR) which concentrates on the global quality tier of specific architecture, is proposed to tackle such problems caused by the local perspective. The NAR explores the quality tiers of the search space globally and classifies each individual to the tier they belong to according to its global ranking. Thus, the predictor gains the knowledge of the performance distributions of the search space which helps to generalize its ranking ability to the datasets more easily. Meanwhile, the global quality distribution facilitates the search phase by directly sampling candidates according to the statistics of quality tiers, which is free of training a search algorithm, e.g., Reinforcement Learning (RL) or Evolutionary Algorithm (EA), thus it simplifies the NAS pipeline and saves the computational overheads. The proposed NAR achieves better performance than the state-of-the-art methods on two widely used datasets for NAS research. On the vast search space of NAS-Bench-101, the NAR easily finds the architecture with top 0.01$\unicode{x2030}$ performance only by sampling. It also generalizes well to different image datasets of NAS-Bench-201, i.e., CIFAR-10, CIFAR-100, and ImageNet-16-120 by identifying the optimal architectures for each of them.

preprint2022arXiv

High-flux 100-kHz attosecond pulse source driven by a high average power annular laser beam

High-repetition-rate attosecond pulse sources are indispensable tools of time-resolved studies of electron dynamics, such as coincidence spectroscopy and experiments with high demands on statistics or signal-to-noise ratio, especially in case of solid and big molecule samples in chemistry and biology. Although with the high-repetition-rate lasers such attosecond pulses in a pump-probe configuration are possible to achieve, until now only a few such light sources have been demonstrated. Here, by shaping the driving laser to an annular beam, a 100-kHz attosecond pulse train (APT) is reported with the highest energy so far (51 pJ/shot) on target (269 pJ at generation) among the high-repetition-rate systems (> 10 kHz) in which the attosecond pulses were temporally characterized. The on-target pulse energy is maximized by reducing the losses from the reflections and filtering of the high harmonics, and an unprecedented 19% transmission rate from the generation point to the target position is achieved. At the same time, the probe beam is also annular, and low loss of this beam is reached by using another holey mirror to combine with the APT. The advantages of using an annular beam to generate attosecond pulses with a high average power laser is demonstrated experimentally and theoretically. The effect of nonlinear propagation in the generation medium on the annular-beam generation concept is also analyzed in detail.

preprint2022arXiv

Liquid-cooled modular gas cell system for high-order harmonic generation using high average power laser systems

We present the design and implementation of a new, modular gas target suitable for high-order harmonic generation using high average power lasers. To ensure thermal stability in this high heat load environment, we implement an appropriate liquid cooling system. The system can be used in multiple-cell configurations allowing to control the cell length and aperture size. The cell design was optimized with heat and flow simulations for thermal characteristics, vacuum compatibility and generation medium properties. Finally, the cell system was experimentally validated by conducting high-order harmonic generation measurements using the 100 kHz high average power HR-1 laser system at the Extreme Light Infrastructure Attosecond Light Pulse Source (ELI ALPS) facility. Such a robust, versatile and stackable gas cell arrangement can easily be adapted to different experimental geometries in both table-top laboratory systems and user-oriented facilities, such as ELI ALPS.

preprint2022arXiv

Quantum Entanglement of Non-Hermitian Quasicrystals

As a hallmark of pure quantum effect, quantum entanglement has provided unconventional routes to condensed matter systems. Here, from the perspective of quantum entanglement, we disclose exotic quantum physics in non-Hermitian quasicrystals. We present a class of experimentally realizable models for non-Hermitian quasicrystal chains, in which asymmetric hopping and complex potential coexist. We diagnose global phase diagram by means of entanglement from both real-space and momentum-space partition. By measuring entanglement entropy, we numerically determine the metal-insulator transition point. We combine real-space and momentum-space entanglement spectra to complementarily characterize the delocalization phase and the localization phase. Inspired by entanglement spectrum, we further analytically prove that a duality exists between the two phase regions. The transition point is self-dual and exact, further validating the numerical result from diagonalizing non-Hermitian matrices. Finally, we identify mobility edge by means of entanglement.

preprint2022arXiv

Quantum Hydrodynamics of Fractonic Superfluids with Lineon Condensate: from Navier-Stokes-like Equations to Landau-like Criterion

Fractonic superfluids are exotic states of matter with spontaneously broken higher-rank $U(1)$ symmetry. The latter is associated with conserved quantities that include not only particle number (i.e. charge) but also higher moments, such as dipoles, quadrupoles, and angular moments. Due to the presence of such conserved quantities, the mobility of particles is restricted either completely or partially. In this work, we systematically study hydrodynamical properties of fractonic superfluids, especially focusing on the fractonic superfluids with conserved angular moments. The constituent bosons are called "lineons" with $d$-components in $d$-dimensional space. From Euler-Lagrange equation, we derive the continuity equation and Navier-Stokes-like equations, in which the angular moment conservation introduces extra terms. Furthermore, we discuss the current configurations that are related to the defects. Like the conventional superfluid, we study the critical values of velocity fields and density currents, which gives rise to a Landau-like criterion. At the end of this work, several future directions are discussed.

preprint2022arXiv

Ruleformer: Context-aware Differentiable Rule Mining over Knowledge Graph

Rule mining is an effective approach for reasoning over knowledge graph (KG). Existing works mainly concentrate on mining rules. However, there might be several rules that could be applied for reasoning for one relation, and how to select appropriate rules for completion of different triples has not been discussed. In this paper, we propose to take the context information into consideration, which helps select suitable rules for the inference tasks. Based on this idea, we propose a transformer-based rule mining approach, Ruleformer. It consists of two blocks: 1) an encoder extracting the context information from subgraph of head entities with modified attention mechanism, and 2) a decoder which aggregates the subgraph information from the encoder output and generates the probability of relations for each step of reasoning. The basic idea behind Ruleformer is regarding rule mining process as a sequence to sequence task. To make the subgraph a sequence input to the encoder and retain the graph structure, we devise a relational attention mechanism in Transformer. The experiment results show the necessity of considering these information in rule mining task and the effectiveness of our model.

preprint2022arXiv

Topological Orders, Braiding Statistics, and Mixture of Two Types of Twisted $BF$ Theories in Five Dimensions

Topological orders are a prominent paradigm for describing quantum many-body systems without symmetry-breaking orders. We present a topological quantum field theoretical (TQFT) study on topological orders in five-dimensional spacetime ($5$D) in which \textit{topological excitations} include not only point-like \textit{particles}, but also two types of spatially extended objects: closed string-like \textit{loops} and two-dimensional closed \textit{membranes}. Especially, membranes have been rarely explored in the literature of topological orders. By introducing higher-form gauge fields, we construct exotic TQFT actions that include mixture of two distinct types of $BF$ topological terms and many twisted topological terms. The gauge transformations are properly defined and utilized to compute level quantization and classification of TQFTs. Among all TQFTs, some are not in Dijkgraaf-Witten cohomological classification. To characterize topological orders, we concretely construct all braiding processes among topological excitations, which leads to very exotic links formed by closed spacetime trajectories of particles, loops, and membranes. For each braiding process, we construct gauge-invariant Wilson operators and calculate the associated braiding statistical phases. As a result, we obtain expressions of link invariants all of which have manifest geometric interpretation. Following Wen's definition, the boundary theory of a topological order exhibits gravitational anomaly. We expect that the characterization and classification of 5D topological orders in this paper encode information of 4D gravitational anomaly. Further consideration, e.g., putting TQFTs on 5D manifolds with boundaries, is left to future work.

preprint2020arXiv

Fracton physics of spatially extended excitations

Fracton topological order hosts fractionalized point-like excitations (e.g., fractons) that have restricted mobility. In this article, we explore even more bizarre realization of fracton phases that admit spatially extended excitations with restriction on both mobility and deformability. First, we present exactly solvable lattice quantum frustrated spin models and study their ground states and excited states analytically. We construct a family tree in which parent models and descendent models share excitation DNA. Second, with the help of solvability and novel excitation spectrum of these models, we initiate the first-step of general discussions on quantitative and qualitative properties of spatially extended excitations whose mobility and deformability are restricted to some extent. Especially, as a useful viewpoint for understanding such fracton-physics, all excitations are divided into four mutually distinct sectors, namely, simple excitations, complex excitations, intrinsically disconnected excitations, and trivial excitations. Several implications in, e.g., condensed matter physics and gravity are briefly discussed.

preprint2020arXiv

Generation of high-order harmonics with tunable photon energy and spectral width using double pulses

This work theoretically investigates high-order harmonic generation in rare gas atoms driven by two temporally delayed ultrashort laser pulses. Apart from their temporal delay, the two pulses are identical. Using a single-atom model of the laser-matter interaction it is shown that the photon energy of the generated harmonics is controllable within the range of one eV -- a bandwidth comparable to the photon energy of the fundamental field -- by varying the time delay between the generating laser pulses. It is also demonstrated that high-order harmonics generated by double pulses have advantageous characteristics, which mimick certain properties of an extreme ultraviolet (XUV) monochromator. With the proposed method, a simpler setup at a much lower cost and comparatively higher spectral yield can be implemented in contrast to other approaches.