Source author record

Yi Yao

Yi Yao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.DG Artificial Intelligence cond-mat.mtrl-sci Computer Vision cond-mat.str-el cond-mat.supr-con Computation and Language Computational Geometry Human-Computer Interaction Machine Learning Neural and Evolutionary Computing physics.chem-ph physics.comp-ph

Catalog footprint

What is connected

14works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL

The performance gap between closed-source and open-source large language models (LLMs) is largely attributed to disparities in access to high-quality training data. To bridge this gap, we introduce a novel framework for the automated synthesis of sophisticated, research-grade instructional data. Our approach centers on a multi-agent workflow where collaborative AI agents simulate complex tool-integrated reasoning to generate diverse and high-fidelity data end-to-end. Leveraging this synthesized data, we develop a two-stage training strategy that integrates supervised fine-tuning with a novel reinforcement learning method, designed to maximize model alignment and capability. Extensive experiments demonstrate that our framework empowers open-source models across multiple scales, enabling them to achieve new state-of-the-art performance on the major deep research benchmark. This work provides a scalable and effective pathway for advancing open-source LLMs without relying on proprietary data or models.

preprint2022arXiv

An Electronic Nematic Liquid in BaNi$_2$As$_2$

Understanding the organizing principles of interacting electrons and the emergence of novel electronic phases is a central endeavor of condensed matter physics. Electronic nematicity, in which the discrete rotational symmetry in the electron fluid is broken while the translational one remains unaffected, is a prominent example of such a phase. It has proven ubiquitous in correlated electron systems, and is of prime importance to understand Fe-based superconductors. Here, we find that fluctuations of such broken symmetry are exceptionally strong over an extended temperature range above phase transitions in \bnap, the nickel homologue to the Fe-based systems. This provides evidence for a type of electronic nematicity, dynamical in nature, which exhibits an unprecedented coupling to the underlying crystal lattice. Fluctuations between degenerate nematic configurations cause a splitting of phonon lines, without lifting degeneracies nor breaking symmetries, akin to spin liquids in magnetic systems.

preprint2022arXiv

Dynamics of Collective Modes in an unconventional Charge Density Wave system BaNi$_{2}$As$_{2}$

BaNi$_{2}$As$_{2}$ is a non-magnetic analogue of BaFe$_{2}$As$_{2}$, the parent compound of a prototype pnictide high-temperature superconductor, displaying superconductivity already at ambient pressure. Recent diffraction studies demonstrated the existence of two types of periodic lattice distortions above and below the triclinic phase transition, suggesting the existence of an unconventional charge-density-wave (CDW) order. The suppression of CDW order upon doping results in a sixfold increase in the superconducting transition temperature and enhanced nematic fluctuations, suggesting CDW is competing with superconductivity. Here, we apply time-resolved optical spectroscopy to investigate collective dynamics in BaNi$_{2}$As$_{2}$. We demonstrate the existence of several CDW amplitude modes. Their smooth evolution through the structural phase transition implies the commensurate CDW order in the triclinic phase evolves from the high-temperature unidirectional incommensurate CDW, and may indeed trigger the structural phase transition. Excitation density dependence reveals exceptional resilience of CDW against perturbation, implying an unconventional origin of the underlying electronic instability.

preprint2022arXiv

Electron dynamics in extended systems within real-time time-dependent density functional theory

Due to a beneficial balance of computational cost and accuracy, real-time time-dependent density functional theory has emerged as a promising first-principles framework to describe electron real-time dynamics. Here we discuss recent implementations around this approach, in particular in the context of complex, extended systems. Results include an analysis of the computational cost associated with numerical propagation and when using absorbing boundary conditions. We extensively explore the shortcomings for describing electron-electron scattering in real time and compare to many-body perturbation theory. Modern improvements of the description of exchange and correlation are reviewed. In this work, we specifically focus on the Qb@ll code, which we have mainly used for these types of simulations over the last years, and we conclude by pointing to further progress needed going forward.

preprint2022arXiv

Relative Ding Stability and an Obstruction to the Existence of Mabuchi Solitons

Mabuchi solitons generalize Kähler-Einstein metrics on Fano manifolds, which constitute a Yau-Tian-Donaldson type correspondence with relative Ding stability. Comparing with Kähler-Ricci solitons, there is a distinct necessary condition for the existence. We show this condition can be implied by the uniformly relative Ding stability. For this we study the inner product of $\mathbb{C}^{*}$-actions on equivariant test-configurations and obtain an integration formula over the total space. To analyze the uniform stability, by adapting Okounkov body construction to the setting of torus action, we give a convex-geometry description for the reduced non-Archimedean J-functionals.

preprint2022arXiv

Trigger Hunting with a Topological Prior for Trojan Detection

Despite their success and popularity, deep neural networks (DNNs) are vulnerable when facing backdoor attacks. This impedes their wider adoption, especially in mission critical applications. This paper tackles the problem of Trojan detection, namely, identifying Trojaned models -- models trained with poisoned data. One popular approach is reverse engineering, i.e., recovering the triggers on a clean image by manipulating the model's prediction. One major challenge of reverse engineering approach is the enormous search space of triggers. To this end, we propose innovative priors such as diversity and topological simplicity to not only increase the chances of finding the appropriate triggers but also improve the quality of the found triggers. Moreover, by encouraging a diverse set of trigger candidates, our method can perform effectively in cases with unknown target labels. We demonstrate that these priors can significantly improve the quality of the recovered triggers, resulting in substantially improved Trojan detection accuracy as validated on both synthetic and publicly available TrojAI benchmarks.

preprint2021arXiv

All-electron periodic $G_0W_0$ implementation with numerical atomic orbital basis functions: algorithm and benchmarks

We present an all-electron, periodic {\GnWn} implementation within the numerical atomic orbital (NAO) basis framework. A localized variant of the resolution-of-the-identity (RI) approximation is employed to significantly reduce the computational cost of evaluating and storing the two-electron Coulomb repulsion integrals. We demonstrate that the error arising from localized RI approximation can be reduced to an insignificant level by enhancing the set of auxiliary basis functions, used to expand the products of two single-particle NAOs. An efficient algorithm is introduced to deal with the Coulomb singularity in the Brillouin zone sampling that is suitable for the NAO framework. We perform systematic convergence tests and identify a set of computational parameters, which can serve as the default choice for most practical purposes. Benchmark calculations are carried out for a set of prototypical semiconductors and insulators, and compared to independent reference values obtained from an independent $G_0W_0$ implementation based on linearized augmented plane waves (LAPW) plus high-energy localized orbitals (HLOs) basis set, as well as experimental results. With a moderate (FHI-aims \textit{tier} 2) NAO basis set, our $G_0W_0$ calculations produce band gaps that typically lie in between the standard LAPW and the LAPW+HLO results. Complementing \textit{tier} 2 with highly localized Slater-type orbitals (STOs), we find that the obtained band gaps show an overall convergence towards the LAPW+HLO results. The algorithms and techniques developed in this work pave the way for efficient implementations of correlated methods within the NAO framework.

preprint2020arXiv

A Study on Multimodal and Interactive Explanations for Visual Question Answering

Explainability and interpretability of AI models is an essential factor affecting the safety of AI. While various explainable AI (XAI) approaches aim at mitigating the lack of transparency in deep networks, the evidence of the effectiveness of these approaches in improving usability, trust, and understanding of AI systems are still missing. We evaluate multimodal explanations in the setting of a Visual Question Answering (VQA) task, by asking users to predict the response accuracy of a VQA agent with and without explanations. We use between-subjects and within-subjects experiments to probe explanation effectiveness in terms of improving user prediction accuracy, confidence, and reliance, among other factors. The results indicate that the explanations help improve human prediction accuracy, especially in trials when the VQA system's answer is inaccurate. Furthermore, we introduce active attention, a novel method for evaluating causal attentional effects through intervention by editing attention maps. User explanation ratings are strongly correlated with human prediction accuracy and suggest the efficacy of these explanations in human-machine AI collaboration tasks.

preprint2020arXiv

Progressive Growing of Neural ODEs

Neural Ordinary Differential Equations (NODEs) have proven to be a powerful modeling tool for approximating (interpolation) and forecasting (extrapolation) irregularly sampled time series data. However, their performance degrades substantially when applied to real-world data, especially long-term data with complex behaviors (e.g., long-term trend across years, mid-term seasonality across months, and short-term local variation across days). To address the modeling of such complex data with different behaviors at different frequencies (time spans), we propose a novel progressive learning paradigm of NODEs for long-term time series forecasting. Specifically, following the principle of curriculum learning, we gradually increase the complexity of data and network capacity as training progresses. Our experiments with both synthetic data and real traffic data (PeMS Bay Area traffic data) show that our training methodology consistently improves the performance of vanilla NODEs by over 64%.

preprint2020arXiv

The Impact of Explanations on AI Competency Prediction in VQA

Explainability is one of the key elements for building trust in AI systems. Among numerous attempts to make AI explainable, quantifying the effect of explanations remains a challenge in conducting human-AI collaborative tasks. Aside from the ability to predict the overall behavior of AI, in many applications, users need to understand an AI agent's competency in different aspects of the task domain. In this paper, we evaluate the impact of explanations on the user's mental model of AI agent competency within the task of visual question answering (VQA). We quantify users' understanding of competency, based on the correlation between the actual system performance and user rankings. We introduce an explainable VQA system that uses spatial and object features and is powered by the BERT language model. Each group of users sees only one kind of explanation to rank the competencies of the VQA model. The proposed model is evaluated through between-subject experiments to probe explanations' impact on the user's perception of competency. The comparison between two VQA models shows BERT based explanations and the use of object features improve the user's prediction of the model's competencies.

preprint2016arXiv

Diffusion Quantum Monte Carlo Study of Martensitic Phase Transition: The Case of Phosphorene

Recent technical advances in dealing with finite-size errors make quantum Monte Carlo methods quite appealing for treating extended systems in electronic structure calculations, especially when commonly-used density functional theory (DFT) methods might not be satisfactory. We present a theoretical study of martensitic phase transition of a two-dimensional phosphorene by employing diffusion Monte Carlo (DMC) approach to investigate the energetics of this phase transition. The DMC calculation supports DFT prediction of having a rather diffusive barrier that is characterized by having two transition states, in addition to confirming that the so-called black and blue phases of phosphorene are essentially degenerate. At the same time, the calculation shows the importance of treating correlation energy accurately for describing the energy changes in the martensitic phase transition, as is already widely appreciated for chemical bond formation/dissociation. Building on the atomistic characterization of the phase transition process, we also discuss how mechanical strain influences the stabilities of the two phases of phosphorene.

preprint2016arXiv

Greatest lower bounds on Ricci curvature of homogeneous toric bundles

For Fano homogeneous toric bundles, we obtain a formula of the greatest lower bound on Ricci curvature. We also give a criteria for the ampleness of a kind of line bundles over general homogeneous toric bundles.

preprint2014arXiv

A criterion for the properness of the K-energy in a general Kahler class

In this paper, we give a criterion for the properness of the K-energy in a general Kahler class of a compact Kahler manifold by using Song-Weinkove's result. As applications, we give some Kahler classes on $\mathbb{C}\mathbb{P}^2\#3\overline {\mathbb{C}\mathbb{P}^2}$ and $\mathbb{C}\mathbb{P}^2\#8\overline {\mathbb{C}\mathbb{P}^2}$ in which the K-energy is proper. Finally, we prove Song-Weinkove's result on the existence of critical points of $\hat J$ functional by the continuity method.

preprint2014arXiv

The J-flow On Toric Manifolds

We study the J-flow on the toric manifolds, through study the transition map between the moment maps induced by two Kähler metrics, which is a diffeomorphism between polytopes. This is similar to the work of Fang-Lai, under the assumption of Calabi symmetry, they study the monotone map between two intervals. We get a partial bound of the derivatives of transition map.

Yi Yao

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL

An Electronic Nematic Liquid in BaNi$_2$As$_2$

Dynamics of Collective Modes in an unconventional Charge Density Wave system BaNi$_{2}$As$_{2}$

Electron dynamics in extended systems within real-time time-dependent density functional theory

Relative Ding Stability and an Obstruction to the Existence of Mabuchi Solitons

Trigger Hunting with a Topological Prior for Trojan Detection

All-electron periodic $G_0W_0$ implementation with numerical atomic orbital basis functions: algorithm and benchmarks

A Study on Multimodal and Interactive Explanations for Visual Question Answering

Progressive Growing of Neural ODEs

The Impact of Explanations on AI Competency Prediction in VQA

Diffusion Quantum Monte Carlo Study of Martensitic Phase Transition: The Case of Phosphorene

Greatest lower bounds on Ricci curvature of homogeneous toric bundles

A criterion for the properness of the K-energy in a general Kahler class

The J-flow On Toric Manifolds