Researcher profile

Yong Hu

Yong Hu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

Common sublattice-pure van Hove singularities in the kagome superconductors $\textit{A}$V$_{3}$Sb$_{5}$ ($\textit{A}$ = K, Rb, Cs)

Kagome materials offer a versatile platform for exploring correlated and topological quantum states, where van Hove singularities (VHSs) play a pivotal role in driving electronic instabilities, exhibiting distinct behaviors depending on electron filling and interaction settings. In the recently discovered kagome superconductors $\textit{A}$V$_{3}$Sb$_{5}$ ($\textit{A}$ = K, Rb, Cs), unconventional charge density wave order, superconductivity, and electronic chirality emerge, yet the nature of VHSs near the Fermi level ($\textit{E}$$_{F}$) and their connection to these exotic orders remain elusive. Here, using high-resolution polarization-dependent angle-resolved photoemission spectroscopy, we uncover a universal electronic structure across $\textit{A}$V$_{3}$Sb$_{5}$ that is distinct from density-functional theory predictions that show noticeable discrepancies. We identify multiple common sublattice-pure VHSs near $\textit{E}$$_{F}$, arising from strong V-$\textit{d}$/Sb-$\textit{p}$ hybridization, which significantly promote bond-order fluctuations and likely drive the observed charge density wave order. These findings provide direct spectroscopic evidence for hybridization-driven VHS formation in kagome metals and establish a unified framework for understanding the intertwined electronic instabilities in $\textit{A}$V$_{3}$Sb$_{5}$.

preprint2026arXiv

SCOUT: Active Information Foraging for Long-Text Understanding with Decoupled Epistemic States

Long-Text Understanding (LTU) at million-token scale requires balancing reasoning fidelity with computational efficiency. Frontier long-context LLMs can process millions of token contexts end-to-end, but they suffer from high token consumption and attention dilution. In parallel, specialized LTU agents often sacrifice fidelity through task-agnostic abstractions like graph construction or indexing. We identify a key insight for LTU: query-relevant information is typically sparse relative to the full document, so effective reasoning should rely on a query-sufficient subset rather than the entire context. To address this, we propose SCOUT, a new paradigm for LTU that shifts from passive processing to active information foraging. It treats the document as an explorable environment and answers from a compact, provenance-grounded epistemic state. Guided by state-level gap diagnosis, SCOUT adaptively alternates between coarse-to-fine exploration and anchored state updates that progressively contract its epistemic state toward query sufficiency. Experiments show that SCOUT matches state-of-the-art proprietary models while reducing token consumption by up to 8x. Moreover, SCOUT remains stable as context length scales, substantially alleviating the practical cost-performance trade-off.

preprint2025arXiv

SagaScale: A Realistic, Scalable, and High-Quality Long-Context Benchmark Built from Full-Length Novels

Large Language Models (LLMs) have shown significant progress, but understanding long and complex documents remains challenging. Many long-context benchmarks have been proposed, but they face several limitations, including task realism, data scalability, and data quality. To this end, we introduce SagaScale, a realistic, scalable, and high-quality long-context benchmark built from full-length novels. The entire benchmark is constructed using an automated data collection pipeline that utilizes external resources (e.g., Wikipedia pages) to curate question-answer pairs. Critically, these external resources are provided only for benchmark construction and not during evaluation, which allows LLMs to curate complex questions that go beyond what they can answer during evaluation. SagaScale is also bilingual and offers the largest context length to date, with average token counts exceeding 250K for English novels and 320K for Chinese novels. Our evaluation across 12 frontier LLMs and three long-context methods -- Naïve RAG, Agentic RAG, and Long Context -- yields key insights, including: (1) Directly supplying the full context to the LLM can outperform other methods by a large margin; (2) Most LLMs still struggle with lengthy contexts, but Gemini-2.5-Pro stands out as an exception; and (3) Agentic RAG effectively addresses the retrieval bottleneck in Naïve RAG. Finally, we publicly release the SagaScale benchmark and our data collection codebase to facilitate future research.

preprint2022arXiv

Algebraic threefolds of general type with small volume

It is known that the optimal Noether inequality $\mathrm{vol}(X) \ge \frac{4}{3}p_g(X) - \frac{10}{3}$ holds for every $3$-fold $X$ of general type with $p_g(X) \ge 11$. In this paper, we give a complete classification of $3$-folds $X$ of general type with $p_g(X) \ge 11$ satisfying the above equality by giving the explicit structure of a relative canonical model of $X$. This model coincides with the canonical model of $X$ when $p_g(X) \ge 23$. We also establish the second and third optimal Noether inequalities for $3$-folds $X$ of general type with $p_g(X) \ge 11$. These results answer two open questions raised by J. Chen, M. Chen and C. Jiang, and in dimension three an open question raised by J. Chen and C. Lai. A novel phenomenon shows that there is a one-to-one correspondence between the three Noether inequalities and three possible residues of $p_g(X)$ modulo $3$.

preprint2022arXiv

giMLPs: Gate with Inhibition Mechanism in MLPs

This paper presents a new model architecture, gate with inhibition MLP (giMLP).The gate with inhibition on CycleMLP (gi-CycleMLP) can produce equal performance on the ImageNet classification task, and it also improves the BERT, Roberta, and DeBERTaV3 models depending on two novel techniques. The first is the gating MLP, where matrix multiplications between the MLP and the trunk Attention input in further adjust models' adaptation. The second is inhibition which inhibits or enhances the branch adjustment, and with the inhibition levels increasing, it offers models more muscular features restriction. We show that the giCycleMLP with a lower inhibition level can be competitive with the original CycleMLP in terms of ImageNet classification accuracy. In addition, we also show through a comprehensive empirical study that these techniques significantly improve the performance of fine-tuning NLU downstream tasks. As for the gate with inhibition MLPs on DeBERTa (giDeBERTa) fine-tuning, we find it can achieve appealing results on most parts of NLU tasks without any extra pretraining again. We also find that with the use of Gate With Inhibition, the activation function should have a short and smooth negative tail, with which the unimportant features or the features that hurt models can be moderately inhibited. The experiments on ImageNet and twelve language downstream tasks demonstrate the effectiveness of Gate With Inhibition, both for image classification and for enhancing the capacity of nature language fine-tuning without any extra pretraining.

preprint2022arXiv

Noether-Severi inequality and equality for irregular threefolds of general type

We prove the optimal Noether-Severi inequality that $\mathrm{vol}(X) \ge \frac{4}{3} χ(ω_{X})$ for all smooth and irregular $3$-folds $X$ of general type over $\mathbb{C}$. For those $3$-folds $X$ attaining the equality, we completely describe their canonical models and show that the topological fundamental group $π_1(X) \simeq \mathbb{Z}^2$. As a corollary, we obtain for the same $X$ another optimal inequality that $\mathrm{vol}(X) \ge \frac{4}{3}h^0_a(X, K_X)$ where $h^0_a(X, K_X)$ stands for the continuous rank of $K_X$, and we show that $X$ attains this equality if and only if $\mathrm{vol}(X) = \frac{4}{3}χ(ω_{X})$.

preprint2022arXiv

Relative Severi inequality for fibrations of maximal Albanese dimension over curves

Let $f: X \to B$ be a relatively minimal fibration of maximal Albanese dimension from a variety $X$ of dimension $n \ge 2$ to a curve $B$ defined over an algebraically closed field of characteristic zero. We prove that $K_{X/B}^n \ge 2n! χ_f$, which was conjectured by Barja in [2]. Via the strategy outlined in [5], it also leads to a new proof of the Severi inequality for varieties of maximal Albanese dimension. Moreover, when the equality holds and $χ_f > 0$, we prove that the general fiber $F$ of $f$ has to satisfy the Severi equality that $K_F^{n-1} = 2(n-1)! χ(F, ω_F)$. We also prove some sharper results of the same type under extra assumptions.

preprint2021arXiv

Charge-order-assisted topological surface states and flat bands in the kagome superconductor CsV$_3$Sb$_5$

The diversity of emergent phenomena in quantum materials often arises from the interplay between different physical energy scales or broken symmetries. Cooperative interactions among them are rare; however, when they do occur, they often stabilize fundamentally new ground states or phase behaviors. For instance, a pair density wave can form when the superconducting order parameter borrows spatial periodical variation from charge order; a topological superconductor can arise when topologically nontrivial electronic states proximitize with or participate in the formation of the superconducting condensate. Here, we report spectroscopic evidence for a unique synergy of topology and correlation effects in the kagome superconductor CsV$_3$Sb$_5$ - one where topologically nontrivial surface states are pushed below the Fermi energy (E$_F$) by charge order, making the topological physics active near E$_F$ upon entering the superconducting state. Flat bands are observed, indicating that electron correlation effects are also at play in this system. Our results reveal the peculiar electronic structure of CsV$_3$Sb$_5$, which holds the potential for realizing Majorana zero modes and anomalous superconducting states in kagome lattices. They also establish CsV$_3$Sb$_5$ as a unique platform for exploring the cooperation between the charge order, topology, correlation effects and superconductivity.

preprint2021arXiv

Rich Nature of Van Hove Singularities in Kagome Superconductor CsV$_3$Sb$_5$

The recently discovered layered kagome metals AV$_3$Sb$_5$ (A=K, Rb, Cs) exhibit diverse correlated phenomena, which are intertwined with a topological electronic structure with multiple van Hove singularities (VHSs) in the vicinity of the Fermi level. As the VHSs with their large density of states enhance correlation effects, it is of crucial importance to determine their nature and properties. Here, we combine polarization-dependent angle-resolved photoemission spectroscopy with density functional theory to directly reveal the sublattice properties of 3d-orbital VHSs in CsV$_3$Sb$_5$. Four VHSs are identified around the M point and three of them are close to the Fermi level, with two having sublattice-pure and one sublattice-mixed nature. Remarkably, the VHS just below the Fermi level displays an extremely flat dispersion along MK, establishing the experimental discovery of higher-order VHS. The characteristic intensity modulation of Dirac cones around K further demonstrates the sublattice interference embedded in the electronic structure. The crucial insights into the electronic structure, revealed by our work, provide a solid starting point for the understanding of the intriguing correlation phenomena in the kagome metals AV$_3$Sb$_5$.

preprint2020arXiv

CsV$_3$Sb$_5$: a $\mathbb{Z}_2$ topological kagome metal with a superconducting ground state

Recently discovered alongside its sister compounds KV$_3$Sb$_5$ and RbV$_3$Sb$_5$, CsV$_3$Sb$_5$ crystallizes with an ideal kagome network of vanadium and antimonene layers separated by alkali metal ions. This work presents the electronic properties of CsV$_3$Sb$_5$, demonstrating bulk superconductivity in single crystals with a T$_{c} = 2.5$K. The normal state electronic structure is studied via angle-resolved photoemission spectroscopy (ARPES) and density functional theory (DFT), which categorize CsV$_3$Sb$_5$ as a $\mathbb{Z}_2$ topological metal. Multiple protected Dirac crossings are predicted in close proximity to the Fermi level ($E_F$), and signatures of normal state correlation effects are also suggested by a high temperature charge density wave-like instability. The implications for the formation of unconventional superconductivity in this material are discussed.

preprint2020arXiv

Multi-task Learning for Low-resource Second Language Acquisition Modeling

Second language acquisition (SLA) modeling is to predict whether second language learners could correctly answer the questions according to what they have learned. It is a fundamental building block of the personalized learning system and has attracted more and more attention recently. However, as far as we know, almost all existing methods cannot work well in low-resource scenarios due to lacking of training data. Fortunately, there are some latent common patterns among different language-learning tasks, which gives us an opportunity to solve the low-resource SLA modeling problem. Inspired by this idea, in this paper, we propose a novel SLA modeling method, which learns the latent common patterns among different language-learning datasets by multi-task learning and are further applied to improving the prediction performance in low-resource scenarios. Extensive experiments show that the proposed method performs much better than the state-of-the-art baselines in the low-resource scenario. Meanwhile, it also obtains improvement slightly in the non-low-resource scenario.

preprint2020arXiv

Non-Abelian Aharonov-Bohm Caging in Photonic Lattices

Aharonov-Bohm (AB) caging is the localization effect in translational-invariant lattices due to destructive interference induced by penetrated magnetic fields. While current research focuses mainly on the case of Abelian AB caging, here we go beyond and develop the non-Abelian AB caging concept by considering the particle localization in a 1D multi-component rhombic lattice with non-Abelian background gauge field. In contrast to its Abelian counterpart, the non-Abelian AB cage depends on both the form of the nilpotent interference matrix and the initial state of the lattice. This phenomena is the consequence of the non-Abelian nature of the gauge potential and thus has no Abelian analog. We further propose a circuit quantum electrodynamics realization of the proposed physics, in which the required non-Abelian gauge field can be synthesized by the parametric conversion method, and the non-Abelian AB caging can be unambiguously demonstrated through the pumping and the steady-state measurements of only a few sites on the lattice. Requiring only currently available technique, our proposal can be readily tested in experiment and may pave a new route towards the investigation of exotic photonic quantum fluids.

preprint2020arXiv

Out-of-time-order correlators and quantum phase transitions in the Rabi and Dicke model

The out-of-time-order correlators (OTOCs) is used to study the quantum phase transitions (QPTs) between the normal phase and the superradiant phase in the Rabi and few-body Dicke models with large frequency ratio of theatomic level splitting to the single-mode electromagnetic radiation field frequency. The focus is on the OTOC thermally averaged with infinite temperature, which is an experimentally feasible quantity. It is shown that thecritical points can be identified by long-time averaging of the OTOC via observing its local minimum behavior. More importantly, the scaling laws of the OTOC for QPTs are revealed by studying the experimentally accessible conditions with finite frequency ratio and finite number of atoms in the studied models. The critical exponents extracted from the scaling laws of OTOC indicate that the QPTs in the Rabi and Dicke models belong to the same universality class.

preprint2019arXiv

Insulating Parent Phase and Distinct Doping Evolution to Superconductivity in Single-Layer FeSe/SrTiO3 Films

The single-layer FeSe/SrTiO3 (FeSe/STO) films have attracted much attention because of their simple crystal structure, distinct electronic structure and record high superconducting transition temperature (Tc). The origin of the dramatic Tc enhancement in single-layer FeSe/STO films and the dichotomy of superconductivity between single-layer and multiple-layer FeSe/STO films are still under debate. Here we report a comprehensive high resolution angle-resolved photoemission spectroscopy and scanning tunneling microscopy/spectroscopy measurements on the electronic structure evolution with doping in single-layer and multiple-layer FeSe/STO films. We find that the single-layer FeSe/STO films have a distinct parent phase and a route of doping evolution to superconductivity that are fundamentally different from multiple-layer FeSe/STO films. The parent phase of the single-layer FeSe/STO films is insulating, and its doping evolution is very similar to that of doping a Mott insulator in cuprate superconductors. In multiple-layer FeSe/STO films, high-Tc superconductivity occurs by suppressing the nematic order in the parent compound with electron doping. The single-layer FeSe/STO films represent the first clear case in the iron-based superconductors that the parent compound is an insulator. Our observations of the unique parent state and doping evolution in the single-layer FeSe/STO films provide key insight in understanding its record high-Tc superconductivity. They also provide a new route of realizing superconductivity in iron-based superconductors that is common in high temperature cuprate superconductors.

preprint2019arXiv

Reduced norms of division algebras over complete discrete valuation fields of local-global type

Let $F$ be a complete discrete valuation field whose residue field $k$ is a global field of positive characteristic $p$. Let $D$ be a central division $F$-algebra of $p$-power degree. We prove that the subgroup of $F^*$ consisting of reduced norms of $D$ is exactly the kernel of the cup product map $λ\in F^*\mapsto (D)\cup(λ)\in H^3(F,\,\mathbb{Q}_{p}/\Z_{p}(2))$, if either $D$ is tamely ramified or of period $p$. This gives a $p$-torsion counterpart of a recent theorem of Parimala, Preeti and Surech, where the same result is proved for division algebras of prime-to-$p$ degree.

preprint2019arXiv

Universal gapless Dirac cone and tunable topological states in (MnBi$_2$Te$_4$)$_m$(Bi$_2$Te$_3$)$_n$ heterostructures

In the newly discovered magnetic topological insulator MnBi$_2$Te$_4$, both axion insulator state and quantized anomalous Hall effect (QAHE) have been observed by tuning the magnetic structure. The related (MnBi$_2$Te$_4$)$_m$(Bi$_2$Te$_3$)$_n$ heterostructures with increased tuning knobs, are predicted to be a more versatile platform for exotic topological states. Here, we report angle-resolved photoemission spectroscopy (ARPES) studies on a series of the heterostructures (MnBi$_2$Te$_4$, MnBi$_4$Te$_7$ and MnBi$_6$Te$_{10}$). A universal gapless Dirac cone is observed at the MnBi$_2$Te$_4$ terminated (0001) surfaces in all systems. This is in sharp contrast to the expected gap from the original antiferromagnetic ground state, indicating an altered magnetic structure near the surface, possibly due to the surface termination. In the meantime, the electron band dispersion of the surface states, presumably dominated by the top surface, is found to be sensitive to different stackings of the underlying MnBi$_2$Te$_4$ and Bi$_2$Te$_3$ layers. Our results suggest the high tunability of both magnetic and electronic structures of the topological surface states in (MnBi$_2$Te$_4$)$_m$(Bi$_2$Te$_3$)$_n$ heterostructures, which is essential in realizing various novel topological states.