Source author record

Yi Sui

Yi Sui appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Machine Learning cond-mat.mtrl-sci cond-mat.soft Multiagent Systems physics.ao-ph physics.comp-ph physics.space-ph

Catalog footprint

What is connected

5works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge

Retrieval-augmented generation (RAG) aims to mitigate the hallucination of Large Language Models (LLMs) by retrieving and incorporating relevant external knowledge into the generation process. However, the external knowledge may contain noise and conflict with the parametric knowledge of LLMs, leading to degraded performance. Current LLMs lack inherent mechanisms for resolving such conflicts. To fill this gap, we propose a Dual-Stream Knowledge-Augmented Framework for Shared-Private Semantic Synergy (DSSP-RAG). Central to it is the refinement of the traditional self-attention into a mixed-attention that distinguishes shared and private semantics for a controlled knowledge integration. An unsupervised hallucination detection method that captures the LLMs' intrinsic cognitive uncertainty ensures that external knowledge is introduced only when necessary. To reduce noise in external knowledge, an Energy Quotient (EQ), defined by attention difference matrices between task-aligned and task-misaligned layers, is proposed. Extensive experiments show that DSSP-RAG achieves a superior performance over strong baselines.

preprint2026arXiv

Classifying and Addressing the Diversity of Errors in Retrieval-Augmented Generation Systems

Retrieval-augmented generation (RAG) is a prevalent approach for building LLM-based question-answering systems that can take advantage of external knowledge databases. Due to the complexity of real-world RAG systems, there are many potential causes for erroneous outputs. Understanding the range of errors that can occur in practice is crucial for robust deployment. We present a new taxonomy of the error types that can occur in realistic RAG systems, examples of each, and practical advice for addressing them. Additionally, we curate a dataset of erroneous RAG responses annotated by error types. We then propose an auto-evaluation method aligned with our taxonomy that can be used in practice to track and address errors during development. Code and data are available at https://github.com/layer6ai-labs/rag-error-classification.

preprint2026arXiv

Conformal Agent Error Attribution

When multi-agent systems (MAS) fail, identifying where the decisive error occurred is the first step for automated recovery to an earlier state. Error attribution remains a fundamental challenge due to the long interaction traces that large language model-based MAS generate. This paper presents a framework for error attribution based on conformal prediction (CP) which provides finite-sample, distribution-free coverage guarantees. We introduce new algorithms for filtration-based CP designed for sequential data such as agent trajectories. Unlike existing CP algorithms, our approach predicts sets that are contiguous sequences to enable efficient recovery and debugging. We verify our theoretical guarantees on a variety of agents and datasets, show that errors can be precisely isolated, then use prediction sets to rollback MAS to correct their own errors. Our overall approach is model-agnostic, and offers a principled uncertainty layer for MAS error attribution. We release code at https://github.com/layer6ai-labs/conformal-agent-error-attribution.

preprint2015arXiv

An Eulerian projection method for quasi-static elastoplasticity

A well-established numerical approach to solve the Navier--Stokes equations for incompressible fluids is Chorin's projection method, whereby the fluid velocity is explicitly updated, and then an elliptic problem for the pressure is solved, which is used to orthogonally project the velocity field to maintain the incompressibility constraint. In this paper, we develop a mathematical correspondence between Newtonian fluids in the incompressible limit and hypo-elastoplastic solids in the slow, quasi-static limit. Using this correspondence, we formulate a new fixed-grid, Eulerian numerical method for simulating quasi-static hypo-elastoplastic solids, whereby the stress is explicitly updated, and then an elliptic problem for the velocity is solved, which is used to orthogonally project the stress to maintain the quasi-staticity constraint. We develop a finite-difference implementation of the method and apply it to an elasto-viscoplastic model of a bulk metallic glass based on the shear transformation zone theory. We show that in a two-dimensional plane strain simple shear simulation, the method is in quantitative agreement with an explicit method. Like the fluid projection method, it is efficient and numerically robust, making it practical for a wide variety of applications. We also demonstrate that the method can be extended to simulate objects with evolving boundaries. We highlight a number of correspondences between incompressible fluid mechanics and quasi-static elastoplasticity, creating possibilities for translating other numerical methods between the two classes of physical problems.

preprint2013arXiv

Buoyancy storms in a zonal stream on the polar beta-plane: experiments with altimetry

Results from a new series of experiments on flows generated by localized heating in the presence of a background zonal current on the polar beta-plane are presented. The flow induced by a heater without the background zonal flow is in the form of a beta-plume. Zonal jets of alternating directions are formed within the plume. The westward transport velocity in the plume is proportional to the upwelling velocity above the heater in agreement with linear theory. When the background flow in the form of the eastward zonal current is present, the beta-plume can be overwhelmed by the eastward current. The main control parameters of the experiment are the strength of the heater and strength of the sink which is used to create the background flow. The regime diagram shows the area where a beta-plume can exist in the parameter space. The critical value of the velocity of the zonal flow below which the beta-plume can exist is obtained by considering barotropic Rossby waves emitted by the baroclinic eddies in the heated area.

Yi Sui

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge

Classifying and Addressing the Diversity of Errors in Retrieval-Augmented Generation Systems

Conformal Agent Error Attribution

An Eulerian projection method for quasi-static elastoplasticity

Buoyancy storms in a zonal stream on the polar beta-plane: experiments with altimetry