Source author record

Yuxi Chen

Yuxi Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence astro-ph.CO Machine Learning physics.space-ph Computation and Language Computer Vision Cryptography and Security Information Retrieval

Catalog footprint

What is connected

9works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Practical Poisoning Attacks against Retrieval-Augmented Generation

Large language models (LLMs) have demonstrated impressive natural language processing abilities but face challenges such as hallucination and outdated knowledge. Retrieval-Augmented Generation (RAG) has emerged as a state-of-the-art approach to mitigate these issues. While RAG enhances LLM outputs, it remains vulnerable to poisoning attacks. Recent studies show that injecting poisoned text into the knowledge database can compromise RAG systems, but most existing attacks assume that the attacker can insert a sufficient number of poisoned texts per query to outnumber correct-answer texts in retrieval, an assumption that is often unrealistic. To address this limitation, we propose CorruptRAG, a practical poisoning attack against RAG systems in which the attacker injects only a single poisoned text, enhancing both feasibility and stealth. Extensive experiments conducted on multiple large-scale datasets demonstrate that CorruptRAG achieves higher attack success rates than existing baselines.

preprint2026arXiv

TOC-Bench: A Temporal Object Consistency Benchmark for Video Large Language Models

Video large language models (Video-LLMs) have made strong progress in general video understanding, but their ability to maintain temporal object consistency remains underexplored. Existing benchmarks often emphasize event recognition, action understanding, or coarse temporal reasoning, while rarely testing whether models can preserve the identity, state, and continuity of the same object across occlusion, disappearance, reappearance, state transitions, and cross-object interactions. We introduce TOC-Bench, a diagnostic benchmark for evaluating temporal object consistency in Video-LLMs. TOC-Bench is object-track grounded: each queried subject is linked to a per-frame trajectory and a structured temporal event timeline. To ensure that questions require temporally ordered visual evidence rather than language priors, single-frame shortcuts, or unordered frame cues, we design a three-layer temporal-necessity filtering protocol, which removes 60.7% of candidate QA pairs and retains 17,900 temporally dependent items across 10 diagnostic dimensions. From this pool, we construct a human-verified benchmark with 2,323 high-quality QA pairs over 1,951 videos. Experiments on representative Video-LLMs show that temporal object consistency remains a major unsolved challenge, with notable weaknesses in event counting, event ordering, identity-sensitive reasoning, and hallucination-aware verification, even when models perform well on general video understanding benchmarks. These results suggest that object-centric temporal coherence is a key bottleneck for current Video-LLMs, and that TOC-Bench provides a focused platform for diagnosing and improving object-aware temporal reasoning. The resource is available at https://github.com/cjzcjz666/toc_bench.git.

preprint2026arXiv

Token Economics for LLM Agents: A Dual-View Study from Computing and Economics

As LLM agents evolve, tokens have emerged as the core economic primitives of Agentic AI. However, their exponential consumption introduces severe computational, collaborative, and security bottlenecks. Current surveys remain fragmented across system optimization, architecture design, and trust, lacking a unified framework to evaluate the fundamental trade-off between output quality and economic cost. To bridge this gap, this survey presents the first comprehensive survey of Token Economics. By unifying computer science and economics, we conceptualize tokens as production factors, exchange mediums, and units of account. We synthesize existing literature across a four-dimensional taxonomy: (1) Micro-level (Single Agent): Optimizing budget-constrained factor substitution via neoclassical firm theory. (2) Meso-level (Multi-Agent Systems): Minimizing collaboration friction using transaction cost and principal-agent theories. (3) Macro-level (Agent Ecosystems): Addressing congestion externalities and pricing via mechanism design. (4) Security: Internalizing adversarial threats as endogenous economic constraints. Finally, we outline frontier directions, including differentiable token budgets and dynamic markets, to lay the theoretical foundation for scalable next-generation agent systems.

preprint2022arXiv

DearFSAC: An Approach to Optimizing Unreliable Federated Learning via Deep Reinforcement Learning

In federated learning (FL), model aggregation has been widely adopted for data privacy. In recent years, assigning different weights to local models has been used to alleviate the FL performance degradation caused by differences between local datasets. However, when various defects make the FL process unreliable, most existing FL approaches expose weak robustness. In this paper, we propose the DEfect-AwaRe federated soft actor-critic (DearFSAC) to dynamically assign weights to local models to improve the robustness of FL. The deep reinforcement learning algorithm soft actor-critic is adopted for near-optimal performance and stable convergence. Besides, an auto-encoder is trained to output low-dimensional embedding vectors that are further utilized to evaluate model quality. In the experiments, DearFSAC outperforms three existing approaches on four datasets for both independent and identically distributed (IID) and non-IID settings under defective scenarios.

preprint2022arXiv

Protum: A New Method For Prompt Tuning Based on "[MASK]"

Recently, prompt tuning \cite{lester2021power} has gradually become a new paradigm for NLP, which only depends on the representation of the words by freezing the parameters of pre-trained language models (PLMs) to obtain remarkable performance on downstream tasks. It maintains the consistency of Masked Language Model (MLM) \cite{devlin2018bert} task in the process of pre-training, and avoids some issues that may happened during fine-tuning. Naturally, we consider that the "[MASK]" tokens carry more useful information than other tokens because the model combines with context to predict the masked tokens. Among the current prompt tuning methods, there will be a serious problem of random composition of the answer tokens in prediction when they predict multiple words so that they have to map tokens to labels with the help verbalizer. In response to the above issue, we propose a new \textbf{Pro}mpt \textbf{Tu}ning based on "[\textbf{M}ASK]" (\textbf{Protum}) method in this paper, which constructs a classification task through the information carried by the hidden layer of "[MASK]" tokens and then predicts the labels directly rather than the answer tokens. At the same time, we explore how different hidden layers under "[MASK]" impact on our classification model on many different data sets. Finally, we find that our \textbf{Protum} can achieve much better performance than fine-tuning after continuous pre-training with less time consumption. Our model facilitates the practical application of large models in NLP.

preprint2020arXiv

Magnetohydrodynamic with embedded particle-in-cell simulation of the Geospace Environment Modeling dayside kinetic processes challenge event

We use the MHD with embedded particle-in-cell model (MHD-EPIC) to study the Geospace Environment Modeling (GEM) dayside kinetic processes challenge event at 01:50-03:00 UT on 2015-11-18, when the magnetosphere was driven by a steady southward IMF. In the MHD-EPIC simulation, the dayside magnetopause is covered by a PIC code so that the dayside reconnection is properly handled. We compare the magnetic fields and the plasma profiles of the magnetopause crossing with the MMS3 spacecraft observations. Most variables match the observations well in the magnetosphere, in the magnetosheath, and also during the current sheet crossing. The MHD-EPIC simulation produces flux ropes, and we demonstrate that some magnetic field and plasma features observed by the MMS3 spacecraft can be reproduced by a flux rope crossing event. We use an algorithm to automatically identify the reconnection sites from the simulation results. It turns out that there are usually multiple X-lines at the magnetopause. By tracing the locations of the X-lines, we find the typical moving speed of the X-line endpoints is about 70~km/s, which is higher than but still comparable with the ground-based observations.

preprint2019arXiv

Studying dawn-dusk asymmetries of Mercury's magnetotail using MHD-EPIC simulations

MESSENGER has observed a lot of dawn-dusk asymmetries in Mercury's magnetotail, such as the asymmetries of the cross-tail current sheet thickness and the occurrence of flux ropes, dipolarization events and energetic electron injections. In order to obtain a global pictures of Mercury's magnetotail dynamics and the relationship between these asymmetries, we perform global simulations with the magnetohydrodynamics with embedded particle-in-cell (MHD-EPIC) model, where Mercury's magnetotail region is covered by a PIC code. Our simulations show that the dawnside current sheet is thicker, the plasma density is larger, and the electron pressure is higher than the duskside. Under a strong IMF driver, the simulated reconnection sites prefer the dawnside. We also found the dipolarization events and the planetward electron jets are moving dawnward while they are moving towards the planet, so that almost all dipolarization events and high-speed plasma flows concentrate in the dawn sector. The simulation results are consistent with MESSENGER observations.

preprint2011arXiv

Deep 1.1 mm-wavelength imaging of the GOODS-South field by AzTEC/ASTE -- II. Redshift distribution and nature of the submillimetre galaxy population

We report the results of the counterpart identification and a detailed analysis of the physical properties of the 48 sources discovered in our deep 1.1mm wavelength imaging survey of the GOODS-South field using the AzTEC instrument on the Atacama Submillimeter Telescope Experiment (ASTE). One or more robust or tentative counterpart candidate is found for 27 and 14 AzTEC sources, respectively, by employing deep radio continuum, Spitzer MIPS & IRAC, and LABOCA 870 micron data. Five of the sources (10%) have two robust counterparts each, supporting the idea that these galaxies are strongly clustered and/or heavily confused. Photometric redshifts and star formation rates (SFRs) are derived by analyzing UV-to-optical and IR-to-radio SEDs. The median redshift of z~2.6 is similar to other earlier estimates, but we show that 80% of the AzTEC-GOODS sources are at z>2, with a significant high redshift tail (20% at z>3.3). Rest-frame UV and optical properties of AzTEC sources are extremely diverse, spanning 10 magnitude in the i- and K-band photometry with median values of i=25.3 and K=22.6 and a broad range of red colour (i-K=0-6). These AzTEC sources are some of the most luminous galaxies in the rest-frame optical bands at z>2, with inferred stellar masses of (1-30) x 10^{10} solar masses and UV-derived star formation rates of SFR(UV) > 10-1000 solar masses per year. The IR-derived SFR, 200-2000 solar masses per year, is independent of redshift or stellar mass. The resulting specific star formation rates, SSFR = 1-100 per Gyr, are 10-100 times higher than similar mass galaxies at z=0, and they extend the previously observed rapid rise in the SSFR with redshift to z=2-5. These galaxies have a SFR high enough to have built up their entire stellar mass within their Hubble time. We find only marginal evidence for an AGN contribution to the near-IR and mid-IR SEDs. (abridged)

preprint2010arXiv

Color Magnitude Relation and Morphology of Low-Redshift ULIRGs in SDSS

We present color-magnitude and morphological analysis of 54 low-redshift ULIRGs, a subset of the IRAS 1Jy sample (Kim & Sanders, 1998), in the SDSS. The ULIRGs are on average 1 magnitude brighter in M0.1r than the SDSS galaxies within the same redshift range. The majority of the ULIRGs (~87%) have the colors typical of the blue cloud, and only 4 sources (~7%) are located in the red sequence. While ULIRGs are popularly thought to be precursors to a QSO phase, we find few (~6%) in the "green valley" where the majority of the X-ray and IR selected AGNs are found, and none of which harbors an AGN. For the 14 previously spectroscopic identified AGNs (~28%), we perform PSF subtractions and find that on average the central point sources contribute less than one third to the total luminosity, and that their high optical luminosities and overall blue colors are apparently the result of star formation activity of the host galaxies. Visual inspection of the SDSS images reveals a wide range of disturbed morphologies. A detailed morphology analysis using Gini and M20 coefficients shows that slightly less than one half (~42% in g band) of the ULIRGs are located in the region where most local mergers are found. The heterogeneous distribution of ULIRGs in the G-M20 space is qualitatively consistent with the results found by numerical simulations of disk-disk mergers. Our study also shows that the measured morphological parameters are systematically affected by the SNR and thus the merging galaxies can appear at various regions in the G-M20 space. In general, our results reinforce the view that ULIRGs contain young stellar populations and are mergers in progress. Our study provides a uniform comparison sample for studying ULIRGs at higher redshifts such as Spitzer mid-IR selected ULIRGs at z=1~2 and submm galaxies.

Yuxi Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Practical Poisoning Attacks against Retrieval-Augmented Generation

TOC-Bench: A Temporal Object Consistency Benchmark for Video Large Language Models

Token Economics for LLM Agents: A Dual-View Study from Computing and Economics

DearFSAC: An Approach to Optimizing Unreliable Federated Learning via Deep Reinforcement Learning

Protum: A New Method For Prompt Tuning Based on "[MASK]"

Magnetohydrodynamic with embedded particle-in-cell simulation of the Geospace Environment Modeling dayside kinetic processes challenge event

Studying dawn-dusk asymmetries of Mercury's magnetotail using MHD-EPIC simulations

Deep 1.1 mm-wavelength imaging of the GOODS-South field by AzTEC/ASTE -- II. Redshift distribution and nature of the submillimetre galaxy population

Color Magnitude Relation and Morphology of Low-Redshift ULIRGs in SDSS