Source author record

Derong Xu

Derong Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language physics.acc-ph Machine Learning

Catalog footprint

What is connected

5works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Align-GRAG: Anchor and Rationale Guided Dual Alignment for Graph Retrieval-Augmented Generation

Despite the strong abilities, large language models (LLMs) still suffer from hallucinations and reliance on outdated knowledge, raising concerns in knowledge-intensive tasks. Graph-based retrieval-augmented generation (GRAG) enriches LLMs with knowledge by retrieving graphs leveraging relational evidence, but it faces two challenges: structure-coupled irrelevant knowledge introduced by neighbor expansion and structure-reasoning discrepancy between graph embeddings and LLM semantics. We propose \ourmodel, an anchor-and-rationale guided refinement framework to address these challenges. It prompts an LLM to extract anchors and rationale chains, which provide intermediate supervision for \textbf{(1) node-level alignment} that identifies critical nodes and prunes noisy evidence, and \textbf{(2) graph-level alignment} that bridges graph and language semantic spaces via contrastive learning. Extensive experiments on commonsense reasoning, scene graph understanding, and knowledge graph reasoning demonstrate consistent gains over 18 strong baselines, validating the effectiveness of \ourmodel for improving graph-grounded generation. The code can be found in https://anonymous.4open.science/r/Align-GRAG-F3D8/.

preprint2026arXiv

Learning How and What to Memorize: Cognition-Inspired Two-Stage Optimization for Evolving Memory

Large language model (LLM) agents require long-term user memory for consistent personalization, but limited context windows hinder tracking evolving preferences over long interactions. Existing memory systems mainly rely on static, hand-crafted update rules; although reinforcement learning (RL)-based agents learn memory updates, sparse outcome rewards provide weak supervision, resulting in unstable long-horizon optimization. Drawing on memory schema theory and the functional division between prefrontal regions and hippocampus regions, we introduce MemCoE, a cognition-inspired two-stage optimization framework that learns how memory should be organized and what information to update. In the first stage, we propose Memory Guideline Induction to optimize a global guideline via contrastive feedback interpreted as textual gradients; in the second stage, Guideline-Aligned Memory Policy Optimization uses the induced guideline to define structured process rewards and performs multi-turn RL to learn a guideline-following memory evolution policy. We evaluate on three personalization memory benchmarks, covering explicit/implicit preference and different sizes and noise, and observe consistent improvements over strong baselines with favorable robustness, transferability, and efficiency.

preprint2026arXiv

More Edits, More Stable: Understanding the Lifelong Normalization in Sequential Model Editing

Lifelong Model Editing aims to continuously update evolving facts in Large Language Models while preserving unrelated knowledge and general capabilities, yet it remains plagued by catastrophic forgetting and model collapse. Empirically, we find that recent editors resilient over long horizons share the same core strategy: Lifelong Normalization (LN), which normalizes value gradients using running statistics. Removing LN causes immediate performance collapse, and we observe a counter-intuitive positive cumulative effect where early edits can promote the success of future edits. Yet the mechanism of LN remains a "black box", leaving its precise role in lifelong stability poorly understood. In this work, we provide the first theoretical account of LN in the lifelong regime. Our analysis reveals a self-reinforcing stability loop and proves that, when combined with ridge-regularized regression, LN yields parameter updates with asymptotic orthogonality and bounded norms, directly mitigating forgetting and systemic collapse. Based on these insights, we derive StableEdit, which strengthens this stability loop via an explicit warm-up stage and full whitening, improving long-horizon stability at minimal overhead. Extensive experiments validate our theory and demonstrate competitive performance. Our code is available at https://github.com/MINE-USTC/StableEdit.

preprint2022arXiv

Combined effects of Crab Dispersion and Momentum Dispersion in Colliders with Local Crab Crossing Scheme

In this paper, we present the effects of linear transverse-longitudinal coupling on beam size at Interaction Point (IP) of a collider with local crab crossing scheme, when time dependent transverse deflection (crab kicks) and dispersive orbit intertwine near IP. The analytic propagation formula and the closed orbit form of the crab dispersion and momentum dispersion are derived. The non-zero momentum dispersion at crab cavities and the non-ideal phase from crab cavities to IP are detailed with the derived propagation formula to predict the beam size distortion at IP with or without the beam-beam interaction. The linear results are compared with nonlinear simulation using the weak-strong beam-beam code.

preprint2022arXiv

Synchro-betatron Resonance of Crab Crossing Scheme with Large Crossing Angle and Finite Bunch Length

Crab crossing scheme is an essential collision scheme to achieve high luminosity for the future colliders with large crossing angles. However, when bunch length of one or both colliding beams is comparable with the wavelength of the crab cavity voltage, the nonlinear dependence of the crabbing kick may present a challenge to the beam dynamics of the colliding beams and impact the beam quality as well as the luminosity lifetime. In this paper, the results of nonlinear dynamics in the crab crossing scheme are presented, using both analytical and numerical studies. The result indicates that higher-order synchro-betatron resonances may be excited in the crab crossing scheme with large crossing angle, which causes the beam quality deterioration and luminosity degradation. The studies also reveal possible countermeasures to suppress the synchro-beta resonance, hence mitigate the degradation of beam quality and luminosity.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint