Researcher profile

Rui Xu

Rui Xu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Attribution-Guided Continual Learning for Large Language Models

Large language models (LLMs) often suffer from catastrophic forgetting in continual learning: after learning new tasks sequentially, they perform worse on earlier tasks. Existing methods mitigate catastrophic forgetting by data replay, parameter freezing, or regularization. However, these methods lack semantic awareness of internal knowledge distribution in LLMs. As a result, they cannot distinguish parameters that should be preserved or updated. We propose an attribution-guided continual fine-tuning framework for LLMs. Our method estimates task-specific, element-wise parameter importance in each Transformer layer and uses these scores to modulate gradients. Parameters important to previous tasks receive smaller updates, while less relevant ones remain plastic for learning new tasks. Experiments on continual learning benchmarks show that our method consistently outperforms baselines, achieving better retention of old tasks while maintaining competitive performance on new tasks.

preprint2026arXiv

CoSER: A Comprehensive Literary Dataset and Framework for Training and Evaluating LLM Role-Playing and Persona Simulation

Role-playing language agents (RPLAs) have emerged as promising applications of large language models (LLMs). However, simulating established characters presents a challenging task for RPLAs, due to the lack of authentic character datasets and nuanced evaluation methods using such data. In this paper, we present CoSER, a collection of a high-quality dataset, open models, and an evaluation protocol towards effective RPLAs of established characters. The CoSER dataset covers 17,966 characters from 771 renowned books. It provides authentic dialogues with real-world intricacies, as well as diverse data types such as conversation setups, character experiences and internal thoughts. Drawing from acting methodology, we introduce given-circumstance acting for training and evaluating role-playing LLMs, where LLMs sequentially portray multiple characters in book scenes. Using our dataset, we develop CoSER 8B and CoSER 70B, i.e., advanced open role-playing LLMs built on LLaMA-3.1 models. Extensive experiments demonstrate the value of the CoSER dataset for RPLA training, evaluation and retrieval. Moreover, CoSER 70B exhibits state-of-the-art performance surpassing or matching GPT-4o on our evaluation and three existing benchmarks, i.e., achieving 75.80% and 93.47% accuracy on the InCharacter and LifeChoice benchmarks respectively.

preprint2026arXiv

EgoReAct: Egocentric Video-Driven 3D Human Reaction Generation

Humans exhibit adaptive, context-sensitive responses to egocentric visual input. However, faithfully modeling such reactions from egocentric video remains challenging due to the dual requirements of strictly causal generation and precise 3D spatial alignment. To tackle this problem, we first construct the Human Reaction Dataset (HRD) to address data scarcity and misalignment by building a spatially aligned egocentric video-reaction dataset, as existing datasets (e.g., ViMo) suffer from significant spatial inconsistency between the egocentric video and reaction motion, e.g., dynamically moving motions are always paired with fixed-camera videos. Leveraging HRD, we present EgoReAct, the first autoregressive framework that generates 3D-aligned human reaction motions from egocentric video streams in real-time. We first compress the reaction motion into a compact yet expressive latent space via a Vector Quantised-Variational AutoEncoder and then train a Generative Pre-trained Transformer for reaction generation from the visual input. EgoReAct incorporates 3D dynamic features, i.e., metric depth, and head dynamics during the generation, which effectively enhance spatial grounding. Extensive experiments demonstrate that EgoReAct achieves remarkably higher realism, spatial consistency, and generation efficiency compared with prior methods, while maintaining strict causality during generation. We will release code, models, and data upon acceptance.

preprint2026arXiv

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Financial agents powered by large language models (LLMs) are increasingly deployed for investment analysis, risk assessment, and automated decision-making, where their abilities to plan, invoke tools, and manipulate mutable state introduce new security risks in high-stakes and highly regulated financial environments. However, existing safety evaluations largely focus on language-model-level content compliance or abstract agent settings, failing to capture execution-grounded risks arising from real operational workflows and state-changing actions. To bridge this gap, we propose FinVault, the first execution-grounded security benchmark for financial agents, comprising 31 regulatory case-driven sandbox scenarios with state-writable databases and explicit compliance constraints, together with 107 real-world vulnerabilities and 963 test cases that systematically cover prompt injection, jailbreaking, financially adapted attacks, as well as benign inputs for false-positive evaluation. Experimental results reveal that existing defense mechanisms remain ineffective in realistic financial agent settings, with average attack success rates (ASR) still reaching up to 50.0\% on state-of-the-art models and remaining non-negligible even for the most robust systems (ASR 6.7\%), highlighting the limited transferability of current safety designs and the need for stronger financial-specific defenses. Our code can be found at https://github.com/aifinlab/FinVault.

preprint2026arXiv

Intertwined atomic-nanoscale-microscale structures via intralayer anisotropic Fe-chains in the layered ferromagnet FePd2Te2

Controlling mesoscale and nanoscale material structures and properties through self-organized atomic behavior is essential for atomic-scale manufacturing. However, direct and visual studies on the cross-scale effects of such atomic self-organization on mesoscopic structures remain scarce. Here, we report the intertwined atomic-nanoscale-mesoscale structures via the intralayer Fe-chains in the sandwich-like layered FePd2Te2 crystal by scanning tunneling microscopy (STM) and atomic force microscopy (AFM). The hierarchical orthogonal corrugated morphologies are directly revealed and attributed to its chain-orientation-determined twinning-domain effect. Both Fe-chains of middle-sublayer and two kinds of Te atoms of top-sublayer are further atomically resolved at the sub-Å level, indicating the critical effects of Pd-atoms/voids on the intra-layer anisotropic Fe-chains and the interlayer structural alignment. The thermal-induced and strain-related structural transitions of surface layer are further investigated and discussed based on the proposed filling model of Pd-voids by the intralayer Pd-atoms. Our work not only provides deep understanding of this exotic layered magnetic material, and will inspire more perspectives for tailoring its anisotropic atomic-to-mesoscale structures and properties.

preprint2026arXiv

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

While autonomous software engineering (SWE) agents are reshaping programming paradigms, they currently suffer from a "closed-world" limitation: they attempt to fix bugs from scratch or solely using local context, ignoring the immense historical human experience available on platforms like GitHub. Accessing this open-world experience is hindered by the unstructured and fragmented nature of real-world issue-tracking data. In this paper, we introduce MemGovern, a framework designed to govern and transform raw GitHub data into actionable experiential memory for agents. MemGovern employs experience governance to convert human experience into agent-friendly experience cards and introduces an agentic experience search strategy that enables logic-driven retrieval of human expertise. By producing 135K governed experience cards, MemGovern achieves a significant performance boost, improving resolution rates on the SWE-bench Verified by 4.65%. As a plug-in approach, MemGovern provides a solution for agent-friendly memory infrastructure.

preprint2025arXiv

Enhanced Stability and Linearly Polarized Emission from CsPbI$_3$ Perovskite Nanoplatelets through A-site Cation Engineering

The anisotropy of perovskite nanoplatelets (PeNPLs) opens up many opportunities in optoelectronics, including enabling the emission of linearly polarized light. But the limited stability of PeNPLs is a pressing challenge, especially for red-emitting CsPbI$_3$. Herein, we address this limitation by alloying FA into the perovskite cuboctahedral site. Unlike Cs/FA alloying in bulk thin films or nonconfined nanocubes, FA incorporation in nanoplatelets requires meticulous control over the reaction conditions, given that nanoplatelets are obtained in kinetically-driven growth regimes instead of thermodynamically-driven conditions. Through in-situ photoluminescence (PL) measurements, we find that excess FA leads to uncontrolled growth, where phase-impurities and nanoplatelets of multiple thicknesses co-exist. Restricting the FA content to up to 25% Cs substitution enables monodisperse PeNPLs, and increases the PL quantum yield (from 53% to 61%), exciton lifetime (from 18 ns to 27 ns), and stability in ambient air (from ~2 days to >7 days) compared to CsPbI$_3$. This arises due to hydrogen bonding between FA and the oleate and oleylammonium ligands, anchoring them to the surface to improve optoelectronic properties and stability. The reduction in non-radiative recombination, improvement in the nanoplatelet aspect ratio, and higher ligand density lead to FA-containing PeNPLs more effectively forming edge-up superlattices, enhancing the PL degree of linear polarization from 5.1% (CsPbI$_3$) to 9.4% (Cs$_{0.75}$FA$_{0.25}$PbI$_3$). These fundamental insights show how the stability limitations of PeNPLs could be addressed, and these materials grown more precisely to improve their performance as polarized light emitters, critical for utilizing them in next-generation display, bioimaging and communications applications.