Source author record

Jie Li

Jie Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computer Vision Machine Learning physics.optics Computation and Language cond-mat.mes-hall Cryptography and Security quant-ph Sound Biological Physics cond-mat.stat-mech cs.CY eess.AS Graphics math.OC math.RA math.RT Multiagent Systems

Catalog footprint

What is connected

32works

18topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Unified Spoken Language Model with Injected Emotional-Attribution Thinking for Human-like Interaction

This paper presents a unified spoken language model for emotional intelligence, enhanced by a novel data construction strategy termed Injected Emotional-Attribution Thinking (IEAT). IEAT incorporates user emotional states and their underlying causes into the model's internal reasoning process, enabling emotion-aware reasoning to be internalized rather than treated as explicit supervision. The model is trained with a two-stage progressive strategy. The first stage performs speech-text alignment and emotional attribute modeling via self-distillation, while the second stage conducts end-to-end cross-modal joint optimization to ensure consistency between textual and spoken emotional expressions. Experiments on the Human-like Spoken Dialogue Systems Challenge (HumDial) Emotional Intelligence benchmark demonstrate that the proposed approach achieves top-ranked performance across emotional trajectory modeling, emotional reasoning, and empathetic response generation under both LLM-based and human evaluations.

preprint2026arXiv

An Enigmatic PeVatron in an Area around HII Region G35.6$-$0.5

Identifying Galactic PeVatrons (PeV particle accelerators) from the ultra-high-energy (UHE, >100 TeV) $γ$-ray sources plays a crucial role in revealing the origin of Galactic cosmic rays. The UHE source 1LHAASO J1857+0203u is suggested to be associated with HESS J1858+020, which may be attributed to the possible PeVatron candidate supernova remnant (SNR) G35.6$-$0.4 or HII region G35.6$-$0.5. We perform detailed analysis on the very-high-energy and UHE $γ$-ray emissions towards this region with data from the Large High Altitude Air Shower Observatory (LHAASO). 1LHAASO J1857+0203u is detected with a significance of 11.6$σ$ above 100 TeV, indicating the presence of a PeVatron. It has an extension of $\sim 0.18^\circ$ with a power-law (PL) spectral index of $\sim$2.5 in 1-25 TeV and a point-like emission with a PL spectral index of $\sim$3.2 above 25 TeV. Using the archival CO and HI data, we identify some molecular and atomic clouds that may be associated with the TeV $γ$-ray emissions. Our modelling indicates that the TeV $γ$-ray emissions are unlikely to arise from the clouds illuminated by the protons that escaped from SNR G35.6$-$0.4. In the scenario that HII region G35.6$-$0.5 could accelerate particles to the UHE band, the observed GeV-TeV $γ$-ray emission could be well explained by a hadronic model with a PL spectral index of $\sim$2.0 and cutoff energy of $\sim$450 TeV. However, an evolved pulsar wind nebula origin cannot be ruled out.

preprint2026arXiv

An Ultrahigh-energy $γ$-ray Bubble Powered by a Super PeVatron

We report the detection of a $γ$-ray bubble spanning at least 100$\rm deg^2$ in ultra high energy (UHE) up to a few PeV in the direction of the star-forming region Cygnus X, implying the presence Super PeVatron(s) accelerating protons to at least 10 PeV. A log-parabola form with the photon index $Γ(E) = (2.71 \pm 0.02) + (0.11 \pm 0.02) \times \log_{10} (E/10 \ {\rm TeV})$ is found fitting the gamma-ray energy spectrum of the bubble well. UHE sources, `hot spots' correlated with very massive molecular clouds, and a quasi-spherical amorphous $γ$-ray emitter with a sharp central brightening are observed in the bubble. In the core of $\sim 0.5^{\circ}$, spatially associating with a region containing massive OB association (Cygnus OB2) and a microquasar (Cygnus X-3), as well as previously reported multi-TeV sources, an enhanced concentration of UHE $γ$-rays are observed with 2 photons at energies above 1 PeV. The general feature of the bubble, the morphology and the energy spectrum, are reasonably reproduced by the assumption of a particle accelerator in the core, continuously injecting protons into the ambient medium.

preprint2026arXiv

Bridging Photon Statistics and Phase Transitions in Random Fiber Lasers

Complex systems exhibit rich equilibrium states, yet the universal principles governing these systems remain unrevealed, motivating the search for novel experimental platforms. Random fiber lasers (RFLs), which generate partially-coherent light-wave through feedback from Rayleigh scattering, provide a photonic realization of such systems. Here we report a comprehensive theoretical and experimental investigation of photon statistics for RFLs based on classical second-order temporal correlation function $ g^{(2)}(τ) $, revealing unique statistical properties and introduce a two-dimensional framework for controlling photon statistics. Remarkably, we establish a unified landscape between photon correlation, intensity statistics governed by Levy statistics, and phase transitions with replica symmetry breaking. This multifaceted relationship, observed for the first time, bridges disordered photonics with statistical physics of complex system. Our results offer new pathways for engineering laser emission with controllable photon statistics, and more broadly, this work positions RFLs as a fertile land for exploring emergent behaviors in disordered systems.

preprint2026arXiv

Constraining the Cosmic-ray Energy Based on Observations of Nearby Galaxy Clusters by LHAASO

Galaxy clusters act as reservoirs of high-energy cosmic rays (CRs). As CRs propagate through the intracluster medium, they generate diffuse $γ$-rays detectable by arrays such as LHAASO. These $γ$-rays result from proton-proton ($pp$) collisions of very high-energy cosmic rays (VHECRs) or inverse Compton (IC) scattering of positron-electron pairs created by $pγ$ interactions of ultra-high-energy cosmic rays (UHECRs). We analyzed diffuse $γ$-ray emission from the Coma, Perseus, and Virgo clusters using LHAASO data. Diffuse emission was modeled as a disk of radius $R_{500}$ for each cluster while accounting for point sources. No significant diffuse emission was detected, yielding 95\% confidence level (C.L.) upper limits on the $γ$-ray flux: for WCDA (1-25~TeV) and KM2A ($>25$~TeV), less than $(49.4, 13.7, 54.0)$ and $(1.34, 1.14, 0.40) \times 10^{-14}$~ph~cm$^{-2}$~s$^{-1}$ for Coma, Perseus, and Virgo, respectively. The $γ$-ray upper limits can be used to derive model-independent constraints on the integral energy of CRp above 10~TeV (corresponding to the LHAASO observational range $>1$~TeV under the $pp$ scenario) to be less than $(1.96, 0.59, 0.08) \times 10^{61}$~erg. The absence of detectable annuli/ring-like structures, indicative of cluster accretion or merging shocks, imposes further constraints on models in which the UHECRs are accelerated in the merging shocks of galaxy clusters.

preprint2026arXiv

Constraints on heavy decaying dark matter from 570 days of LHAASO observations

The Kilometer Square Array~(KM2A) of the Large High Altitude Air Shower Observatory (LHAASO) aims at surveying the northern gamma-ray sky at energies above 10 TeV with unprecedented sensitivity. Gamma-ray observations have long been one of the most powerful tools for dark matter searches, as e.g., high-energy gamma-rays could be produced by the decays of heavy dark matter particles. In this letter, we present the first dark matter analysis with LHAASO-KM2A, using the first 340~days of data from 1/2-KM2A and 230~days of data from 3/4-KM2A. Several regions of interest are used to search for a signal and account for the residual cosmic-ray background after gamma/hadron separation. We find no excess of dark matter signals, and thus place some of the strongest gamma-ray constraints on the lifetime of heavy dark matter particles with mass between 10^5 and 10^9~GeV. Our results with LHAASO are robust, and have important implications for dark matter interpretations of the diffuse astrophysical high-energy neutrino emission.

preprint2026arXiv

CS-GBA: A Critical Sample-based Gradient-guided Backdoor Attack for Offline Reinforcement Learning

Offline Reinforcement Learning (RL) enables policy optimization from static datasets but is inherently vulnerable to backdoor attacks. Existing attack strategies typically struggle against safety-constrained algorithms (e.g., CQL) due to inefficient random poisoning and the use of easily detectable Out-of-Distribution (OOD) triggers. In this paper, we propose CS-GBA (Critical Sample-based Gradient-guided Backdoor Attack), a novel framework designed to achieve high stealthiness and destructiveness under a strict budget. Leveraging the theoretical insight that samples with high Temporal Difference (TD) errors are pivotal for value function convergence, we introduce an adaptive Critical Sample Selection strategy that concentrates the attack budget on the most influential transitions. To evade OOD detection, we propose a Correlation-Breaking Trigger mechanism that exploits the physical mutual exclusivity of state features (e.g., 95th percentile boundaries) to remain statistically concealed. Furthermore, we replace the conventional label inversion with a Gradient-Guided Action Generation mechanism, which searches for worst-case actions within the data manifold using the victim Q-network's gradient. Empirical results on D4RL benchmarks demonstrate that our method significantly outperforms state-of-the-art baselines, achieving high attack success rates against representative safety-constrained algorithms with a minimal 5% poisoning budget, while maintaining the agent's performance in clean environments.

preprint2026arXiv

Deep view of Composite SNR CTA1 with LHAASO in $γ$-rays up to 300 TeV

The ultra-high-energy (UHE) gamma-ray source 1LHAASO J0007+7303u is positionally associated with the composite SNR CTA1 that is located at high Galactic Latitude $b\approx 10.5^\circ$. This provides a rare opportunity to spatially resolve the component of the pulsar wind nebula (PWN) and supernova remnant (SNR) at UHE. This paper conducted a dedicated data analysis of 1LHAASO J0007+7303u using the data collected from December 2019 to July 2023. This source is well detected with significances of 21$σ$ and 17$σ$ at 8$-$100 TeV and $>$100 TeV, respectively. The corresponding extensions are determined to be 0.23$^{\circ}\pm$0.03$^{\circ}$ and 0.17$^{\circ}\pm$0.03$^{\circ}$. The emission is proposed to originate from the relativistic electrons and positrons accelerated within the PWN of PSR J0007+7303. The energy spectrum is well described by a power-law with an exponential cutoff function $dN/dE = (42.4\pm4.1)(\frac{E}{20\rm\ TeV})^{-2.31\pm0.11}\exp(-\frac{E}{110\pm25\rm\ TeV})$ $\rm\ TeV^{-1}\ cm^{-2}\ s^{-1}$in the energy range from 8 TeV to 300 TeV, implying a steady-state parent electron spectrum $dN_e/dE_e\propto (\frac{E_e}{100\rm\ TeV})^{-3.13\pm0.16}\exp[(\frac{-E_e}{373\pm70\rm\ TeV})^2]$ at energies above $\approx 50 \rm\ TeV$. The cutoff energy of the electron spectrum is roughly equal to the expected current maximum energy of particles accelerated at the PWN terminal shock. Combining the X-ray and gamma-ray emission, the current space-averaged magnetic field can be limited to $\approx 4.5\rm\ μG$. To satisfy the multi-wavelength spectrum and the $γ$-ray extensions, the transport of relativistic particles within the PWN is likely dominated by the advection process under the free-expansion phase assumption.

preprint2026arXiv

Energy calibration of LHAASO-KM2A using the cosmic ray Moon shadow

We present a precise measurement of the westward, rigidity-dependent shift of the Moon's shadow using three and a half years of cosmic-ray data collected by the Kilometer Square Array (KM2A) of the Large High Altitude Air Shower Observatory (LHAASO). These measurements enable us to calibrate the detector energy response in the range 20-260 TeV, with results showing excellent agreement with the response derived from Monte Carlo (MC) simulations of the KM2A detector. We also measure a best-fit parameter $ε= 0.015 \pm 0.08$, corresponding to a 95% confidence interval of [-14%, +17%] for the energy-scale estimation. This result establishes the exceptional accuracy of the KM2A-MC in simulating the detector's response within this energy range.

preprint2026arXiv

Energy-Dependent Shifts of Medium-Scale Anisotropies in Very-High-Energy Cosmic Rays Observed by LHAASO-KM2A

Small deviations from isotropy in the arrival directions of Galactic cosmic rays serve as a unique probe of the local magnetic environment. In this Letter, we report observations of medium-scale anisotropies (MSA) at energies above 10 TeV using the LHAASO-KM2A array. Our analysis identifies four regions of excess and four regions of deficit, each spanning angular scales of approximately ten degrees. Crucially, we detect significant energy-dependent shifts in the centroids of two excess regions: Region B and the newly identified Region $\mathrm{\widetilde{D}}$. We also characterize the energy evolution of the fractional relative intensity across both excess and deficit regions. These findings imply that the observed anisotropies are shaped by the specific realization of the local turbulent magnetic field within the cosmic ray scattering length. Such energy-dependent behaviors impose strict constraints on local turbulence models and cosmic ray propagation theories.

preprint2026arXiv

Evidence for particle acceleration approaching PeV energies in the W51 complex

The $γ$-ray emission from the W51 complex is widely acknowledged to be attributed to the interaction between the cosmic rays (CRs) accelerated by the shock of supernova remnant (SNR) W51C and the dense molecular clouds in the adjacent star-forming region, W51B. However, the maximum acceleration capability of W51C for CRs remains elusive. Based on observations conducted with the Large High Altitude Air Shower Observatory (LHAASO), we report a significant detection of $γ$ rays emanating from the W51 complex, with energies from 2 TeV to 200 TeV. The LHAASO measurements, for the first time, extend the $γ$-ray emission from the W51 complex beyond 100 TeV and reveal a significant spectrum bending at tens of TeV. By combining the ``$π^0$-decay bump" featured data from Fermi-LAT, the broadband $γ$-ray spectrum of the W51 region can be well-characterized by a simple pp-collision model. The observed spectral bending feature suggests an exponential cutoff at $\sim400$~TeV or a power-law break at $\sim200$~TeV in the CR proton spectrum, most likely providing the first evidence of SNRs serving as CR accelerators approaching the PeV regime. Additionally, two young star clusters within W51B could also be theoretically viable to produce the most energetic $γ$ rays observed by LHAASO. Our findings strongly support the presence of extreme CR accelerators within the W51 complex and provide new insights into the origin of Galactic CRs.

preprint2026arXiv

Exploring Lorentz Invariance Violation from Ultra-high-energy Gamma Rays Observed by LHAASO

Recently the LHAASO Collaboration published the detection of 12 ultra-high-energy gamma-ray sources above 100 TeV, with the highest energy photon reaching 1.4 PeV. The first detection of PeV gamma rays from astrophysical sources may provide a very sensitive probe of the effect of the Lorentz invariance violation (LIV), which results in decay of high-energy gamma rays in the superluminal scenario and hence a sharp cutoff of the energy spectrum. Two highest energy sources are studied in this work. No signature of the existence of LIV is found in their energy spectra, and the lower limits on the LIV energy scale are derived. Our results show that the first-order LIV energy scale should be higher than about 10^5 times the Planck scale M_{pl} and that the second-order LIV scale is >10^{-3}M_{pl}. Both limits improve by at least one order of magnitude the previous results.

preprint2026arXiv

Extended Very-High-Energy Gamma-Ray Emission Surrounding PSR J0622 + 3749 Observed by LHAASO-KM2A

We report the discovery of an extended very-high-energy (VHE) gamma-ray source around the location of the middle-aged (207.8 kyr) pulsar PSR J0622+3749 with the Large High Altitude Air Shower Observatory (LHAASO). The source is detected with a significance of $8.2σ$ for $E>25$~TeV assuming a Gaussian template. The best-fit location is (R.A., Dec.)$=(95^{\circ}\!.47\pm0^{\circ}\!.11,\,37^{\circ}\!.92 \pm0^{\circ}\!.09)$, and the extension is $0^{\circ}\!.40\pm0^{\circ}\!.07$. The energy spectrum can be described by a power-law spectrum with an index of ${-2.92 \pm 0.17_{\rm stat} \pm 0.02_{\rm sys} }$. No clear extended multi-wavelength counterpart of the LHAASO source has been found from the radio to sub-TeV bands. The LHAASO observations are consistent with the scenario that VHE electrons escaped from the pulsar, diffused in the interstellar medium, and scattered the interstellar radiation field. If interpreted as the pulsar halo scenario, the diffusion coefficient, inferred for electrons with median energies of $\sim160$~TeV, is consistent with those obtained from the extended halos around Geminga and Monogem and much smaller than that derived from cosmic ray secondaries. The LHAASO discovery of this source thus likely enriches the class of so-called pulsar halos and confirms that high-energy particles generally diffuse very slowly in the disturbed medium around pulsars.

preprint2026arXiv

FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich Training

Recent advancements in text-to-image (T2I) generation have led to the emergence of highly expressive models such as diffusion transformers (DiTs), exemplified by FLUX. However, their massive parameter sizes lead to slow inference, high memory usage, and poor deployability. Existing acceleration methods (e.g., single-step distillation and attention pruning) often suffer from significant performance degradation and incur substantial training costs. To address these limitations, we propose FastFLUX, an architecture-level pruning framework designed to enhance the inference efficiency of FLUX. At its core is the Block-wise Replacement with Linear Layers (BRLL) method, which replaces structurally complex residual branches in ResBlocks with lightweight linear layers while preserving the original shortcut connections for stability. Furthermore, we introduce Sandwich Training (ST), a localized fine-tuning strategy that leverages LoRA to supervise neighboring blocks, mitigating performance drops caused by structural replacement. Experiments show that our FastFLUX maintains high image quality under both qualitative and quantitative evaluations, while significantly improving inference speed, even with 20\% of the hierarchy pruned. Our code will be available soon.

preprint2026arXiv

Fluctuation-Dissipation Limits in Quantum Thermoelectric Transport

As a fundamental measure of stability in nonequilibrium thermodynamics, fluctuations provide critical insight into the performance and reliability of heat engines. In this work, we establish universal fluctuation-dissipation bounds that directly link energy-current fluctuations to both the entropy production rate and steady-state transport currents. Our results are applicable to arbitrary temperature and chemical potential gradients and hold for all steady states within the framework of quantum scattering theory. These bounds remain robust even in regimes where quantum effects break classical thermodynamic uncertainty relations. We demonstrate their validity by using boxcar transmission functions and further derive constraints on the power output from the perspective of fluctuations and dissipation, offering a unified thermodynamic guideline for the design and evaluation of nanoscale and quantum thermal devices.

preprint2026arXiv

Fusion of Multiscale Features Via Centralized Sparse-attention Network for EEG Decoding

Electroencephalography (EEG) signal decoding is a key technology that translates brain activity into executable commands, laying the foundation for direct brain-machine interfacing and intelligent interaction. To address the inherent spatiotemporal heterogeneity of EEG signals, this paper proposes a multi-branch parallel architecture, where each temporal scale is equipped with an independent spatial feature extraction module. To further enhance multi-branch feature fusion, we propose a Fusion of Multiscale Features via Centralized Sparse-attention Network (EEG-CSANet), a centralized sparse-attention network. It employs a main-auxiliary branch architecture, where the main branch models core spatiotemporal patterns via multiscale self-attention, and the auxiliary branch facilitates efficient local interactions through sparse cross-attention. Experimental results show that EEG-CSANet achieves state-of-the-art (SOTA) performance across five public datasets (BCIC-IV-2A, BCIC-IV-2B, HGD, SEED, and SEED-VIG), with accuracies of 88.54%, 91.09%, 97.15%, 96.03%, and 90.56%, respectively. Such performance demonstrates its strong adaptability and robustness across various EEG decoding tasks. Moreover, extensive ablation studies are conducted to enhance the interpretability of EEG-CSANet. In the future, we hope that EEG-CSANet could serve as a promising baseline model in the field of EEG signal decoding. The source code is publicly available at: https://github.com/Xiangrui-Cai/EEG-CSANet

preprint2026arXiv

GSAlign: Geometric and Semantic Alignment Network for Aerial-Ground Person Re-Identification

Aerial-Ground person re-identification (AG-ReID) is an emerging yet challenging task that aims to match pedestrian images captured from drastically different viewpoints, typically from unmanned aerial vehicles (UAVs) and ground-based surveillance cameras. The task poses significant challenges due to extreme viewpoint discrepancies, occlusions, and domain gaps between aerial and ground imagery. While prior works have made progress by learning cross-view representations, they remain limited in handling severe pose variations and spatial misalignment. To address these issues, we propose a Geometric and Semantic Alignment Network (GSAlign) tailored for AG-ReID. GSAlign introduces two key components to jointly tackle geometric distortion and semantic misalignment in aerial-ground matching: a Learnable Thin Plate Spline (LTPS) Module and a Dynamic Alignment Module (DAM). The LTPS module adaptively warps pedestrian features based on a set of learned keypoints, effectively compensating for geometric variations caused by extreme viewpoint changes. In parallel, the DAM estimates visibility-aware representation masks that highlight visible body regions at the semantic level, thereby alleviating the negative impact of occlusions and partial observations in cross-view correspondence. A comprehensive evaluation on CARGO with four matching protocols demonstrates the effectiveness of GSAlign, achieving significant improvements of +18.8\% in mAP and +16.8\% in Rank-1 accuracy over previous state-of-the-art methods on the aerial-ground setting.

preprint2026arXiv

Leak Proof PDBBind: A Reorganized Dataset of Protein-Ligand Complexes for More Generalizable Binding Affinity Prediction

The majority of machine learning scoring functions used in drug discovery for predicting protein-ligand binding poses and affinities have been trained on the PDBBind dataset. However, it is unclear whether these new scoring functions are actually an improvement over traditional models since often the training and test sets are cross-contaminated with proteins and ligands with high similarity, and hence they may not perform comparably well in binding prediction of unrelated protein-ligand complexes. In this work we have carefully prepared a new split of the PDBBind data set to control for data leakage, defined as proteins and ligands with high sequence and structural similarity. The resulting leak-proof (LP)-PDBBind data is used to retrain four popular SFs: AutoDock Vina, Random Forest (RF)-Score, InteractionGraphNet (IGN), and DeepDTA, to better test their capabilities when applied to new protein-ligand complexes. In particular we have formulated a new independent data set, BDB2020+, by matching high quality binding free energies from BindingDB with co-crystalized ligand-protein complexes from the PDB that have been deposited since 2020. Based on all the benchmark results, the retrained models using LP-PDBBind consistently perform better, with IGN especially being recommended for scoring and ranking applications for new protein-ligand systems.

preprint2026arXiv

LHAASO Detection of Ultra-High-Energy Gamma-Ray Emission toward the Giant Molecular Clouds

The $γ$-ray from Giant molecular clouds (GMCs) is regarded as the most ideal tool to perform in-situ measurement of cosmic ray (CR) density and spectra in our Galaxy. We report the first detection of $γ$-ray emissions in the very-high-energy (VHE) domain from the five nearby GMCs with a stacking analysis based on a 4.5-year $γ$-ray observation with the Large High Altitude Air Shower Observatory (LHAASO) experiment. The spectral energy distributions derived from the GMCs are consistent with the expected $γ$-ray flux produced via CR interacting with the ISM in the energy interval 1 - 100 $~\rm$ TeV. In addition, we investigate the presence of the CR spectral `knee' by introducing a spectral break in the $γ$-ray data. While no significant evidence for the CR knee is found, the current KM2A measurements from GMCs strongly favor a proton CR knee located above 0.9$~\rm$ PeV, which is consistent with the latest measurement of the CR spectrum by ground-based experiments.

preprint2026arXiv

Long-range optomechanical interactions in SiN membrane arrays

Optomechanical systems using a membrane-in-the-middle configuration can exhibit a long-range type of interaction similar to how atoms show collective motion in an optical potential. Photons bounce back and forth inside a high-finesse Fabry-Pérot cavity and mediate the interaction between multiple membranes over a significant distance compared to the wavelength. Recently, it has been demonstrated that off-resonant coupling between light and the inter-membrane cavity can lead to coherent mechanical noise cancellation. On-resonance coupling of light with both the Fabry-Pérot and inter-membrane cavities, predicted to enhance the single photon optomechanical coupling, have to date not been experimentally demonstrated, however. In our experiment, a double-membrane system inside a Fabry-Pérot cavity resonantly enhances the cavity field, resulting in a stronger optomechanical coupling strength from the increased radiation pressure. The resonance condition is first identified by analyzing the slope of the dispersion relation. Then, the optomechanical coupling is determined at various chip positions over one wavelength range. The optimum coupling conditions are obtained and enhancement is demonstrated for double membrane arrays with three different reflectivites, reaching nearly four-fold enhancement for the collective motion of $R=65\%$ double membranes. The cavity losses at the optimum coupling are also characterized and the potential of reaching the single-photon strong coupling regime is discussed.

preprint2026arXiv

LooC: Effective Low-Dimensional Codebook for Compositional Vector Quantization

Vector quantization (VQ) is a prevalent and fundamental technique that discretizes continuous feature vectors by approximating them using a codebook. As the diversity and complexity of data and models continue to increase, there is an urgent need for high-capacity, yet more compact VQ methods. This paper aims to reconcile this conflict by presenting a new approach called LooC, which utilizes an effective Low-dimensional codebook for Compositional vector quantization. Firstly, LooC introduces a parameter-efficient codebook by reframing the relationship between codevectors and feature vectors, significantly expanding its solution space. Instead of individually matching codevectors with feature vectors, LooC treats them as lower-dimensional compositional units within feature vectors and combines them, resulting in a more compact codebook with improved performance. Secondly, LooC incorporates a parameter-free extrapolation-by-interpolation mechanism to enhance and smooth features during the VQ process, which allows for better preservation of details and fidelity in feature approximation. The design of LooC leads to full codebook usage, effectively utilizing the compact codebook while avoiding the problem of collapse. Thirdly, LooC can serve as a plug-and-play module for existing methods for different downstream tasks based on VQ. Finally, extensive evaluations on different tasks, datasets, and architectures demonstrate that LooC outperforms existing VQ methods, achieving state-of-the-art performance with a significantly smaller codebook.

preprint2026arXiv

MDReID: Modality-Decoupled Learning for Any-to-Any Multi-Modal Object Re-Identification

Real-world object re-identification (ReID) systems often face modality inconsistencies, where query and gallery images come from different sensors (e.g., RGB, NIR, TIR). However, most existing methods assume modality-matched conditions, which limits their robustness and scalability in practical applications. To address this challenge, we propose MDReID, a flexible any-to-any image-level ReID framework designed to operate under both modality-matched and modality-mismatched scenarios. MDReID builds on the insight that modality information can be decomposed into two components: modality-shared features that are predictable and transferable, and modality-specific features that capture unique, modality-dependent characteristics. To effectively leverage this, MDReID introduces two key components: the Modality Decoupling Learning (MDL) and Modality-aware Metric Learning (MML). Specifically, MDL explicitly decomposes modality features into modality-shared and modality-specific representations, enabling effective retrieval in both modality-aligned and mismatched scenarios. MML, a tailored metric learning strategy, further enforces orthogonality and complementarity between the two components to enhance discriminative power across modalities. Extensive experiments conducted on three challenging multi-modality ReID benchmarks (RGBNT201, RGBNT100, MSVR310) consistently demonstrate the superiority of MDReID. Notably, MDReID achieves significant mAP improvements of 9.8\%, 3.0\%, and 11.5\% in general modality-matched scenarios, and average gains of 3.4\%, 11.8\%, and 10.9\% in modality-mismatched scenarios, respectively. The code is available at: \textcolor{magenta}{https://github.com/stone96123/MDReID}.

preprint2026arXiv

Measurement of attenuation length of the muon content in extensive air showers from 0.3 to 30 PeV with LHAASO

The attenuation length of the muon content in extensive air showers provides important information regarding the generation and development of air showers. This information can be used not only to improve the description of such showers but also to test fundamental models of hadronic interactions. Using data from the LHAASO-KM2A experiment, the development of the muon content in high-energy air showers was studied. The attenuation length of muon content in the air showers was measured from experimental data in the energy range from 0.3 to 30 PeV using the constant intensity cut method. By comparing the attenuation length of the muon content with predictions from high-energy hadronic interaction models (QGSJET-II-04, SIBYLL 2.3d, and EPOS-LHC), it is evident that LHAASO results are significantly shorter than those predicted by the first two models (QGSJET-II-04 and SIBYLL 2.3d) but relatively close to those predicted by the third model (EPOS-LHC). Thus, the LHAASO data favor the EPOS-LHC model over the other two models. The three interaction models confirmed an increasing trend in the attenuation length as the cosmic-ray energy increases.

preprint2026arXiv

Measurement of Very-high-energy Diffuse Gamma-ray Emissions from the Galactic Plane with LHAASO-WCDA

The diffuse Galactic gamma-ray emission is a very important tool used to study the propagation and interaction of cosmic rays in the Milky Way. In this work, we report the measurements of the diffuse emission from the Galactic plane, covering Galactic longitudes from $15^{\circ}$ to $235^{\circ}$ and latitudes from $-5^{\circ}$ to $+5^{\circ}$, in an energy range of 1 TeV to 25 TeV, with the Water Cherenkov Detector Array (WCDA) of the Large High Altitude Air Shower Observatory (LHAASO). After masking the sky regions of known sources, the diffuse emission is detected with $24.6σ$ and $9.1σ$ significance in the inner Galactic plane and outer Galactic plane, respectively. The WCDA spectra in both regions can be well described by a power-law function, with spectral indices of $-2.67\pm0.05_{\rm stat}$ in the inner region and $-2.83\pm0.19_{\rm stat}$ in the outer region, respectively. Combined with the Square Kilometer Array (KM2A) measurements at higher energies, a clear softening of the spectrum is found in the inner region, with change of spectral indices by $\sim0.5$ at a break energy around $30$ TeV. The fluxes of the diffuse emission are higher by a factor of $1.5-2.7$ than the model prediction assuming local CR spectra and the gas column density, which are consistent with those measured by the KM2A. Along Galactic longitude, the spatial distribution of the diffuse emission shows deviation from that of the gas column density. The spectral shape of the diffuse emission are possibly variation in different longitude region. The WCDA measurements bridge the gap between the low-energy measurements by space detectors and the ultra-high-energy observations by LHAASO-KM2A and other experiments. These results suggest that improved modeling of the wide-band diffuse emission is required.

preprint2026arXiv

OEP: Poisoning Self-Evolving LLM Agents via Locally Correct but Non-Transferable Experiences

Memory-augmented large language model (LLM) agents use iterative reflection and self-evolution to solve complex tasks, but these mechanisms introduce security risks. Existing agentic memory attacks require privileged access or explicit malicious content, making them detectable by advanced safety filters. This leaves a subtler attack surface underexplored: whether adversaries can induce agent to generate experiences that appear locally correct and semantically plausible yet induce harmful generalization during reflection. We find that reflective agents are vulnerable to such clean experiences, especially when paired with severe but plausible hypothetical consequences. Based on this observation, we introduce Obsessive Experience Poisoning (OEP), a low-privilege black-box attack requiring no direct control over the system prompt or memory database. OEP constructs adversarial clean edge-cases that combine locally correct solutions, non-transferable methods, and severe consequences, biasing reflection toward risk-averse rule formation. During memory consolidation, agents may over-trust self-generated reflections and distill localized experiences into high-priority but over-generalized rules, causing downstream failures. Evaluations across three domains show that OEP achieves ASR above 50\% with GPT-4o agents, and outperforms existing attacks under LLM auditing defense.

preprint2026arXiv

OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs

The rapid integration of Multimodal Large Language Models (MLLMs) into critical applications is increasingly hindered by persistent safety vulnerabilities. However, existing red-teaming benchmarks are often fragmented, limited to single-turn text interactions, and lack the scalability required for systematic evaluation. To address this, we introduce OpenRT, a unified, modular, and high-throughput red-teaming framework designed for comprehensive MLLM safety evaluation. At its core, OpenRT architects a paradigm shift in automated red-teaming by introducing an adversarial kernel that enables modular separation across five critical dimensions: model integration, dataset management, attack strategies, judging methods, and evaluation metrics. By standardizing attack interfaces, it decouples adversarial logic from a high-throughput asynchronous runtime, enabling systematic scaling across diverse models. Our framework integrates 37 diverse attack methodologies, spanning white-box gradients, multi-modal perturbations, and sophisticated multi-agent evolutionary strategies. Through an extensive empirical study on 20 advanced models (including GPT-5.2, Claude 4.5, and Gemini 3 Pro), we expose critical safety gaps: even frontier models fail to generalize across attack paradigms, with leading models exhibiting average Attack Success Rates as high as 49.14%. Notably, our findings reveal that reasoning models do not inherently possess superior robustness against complex, multi-turn jailbreaks. By open-sourcing OpenRT, we provide a sustainable, extensible, and continuously maintained infrastructure that accelerates the development and standardization of AI safety.

preprint2026arXiv

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Spoken language models (SLMs) have advanced rapidly in recent years, accompanied by a growing number of evaluation benchmarks. However, most existing benchmarks emphasize task completion and capability scaling, while remaining poorly aligned with how users interact with SLMs in real-world spoken conversations. Effective spoken interaction requires not only accurate understanding of user intent and content, but also the ability to respond with appropriate interactional strategies. In this paper, we present TELEVAL, a dynamic, user-centered benchmark for evaluating SLMs in realistic Chinese spoken interaction scenarios. TELEVAL consolidates evaluation into two core aspects. Reliable Content Fulfillment assesses whether models can comprehend spoken inputs and produce semantically correct responses. Interactional Appropriateness evaluates whether models act as socially capable interlocutors, requiring them not only to generate human-like, colloquial responses, but also to implicitly incorporate paralinguistic cues for natural interaction. Experiments reveal that, despite strong performance on semantic and knowledge-oriented tasks, current SLMs still struggle to produce natural and interactionally appropriate responses, highlighting the need for more interaction-faithful evaluation.

preprint2026arXiv

The locally complexified-gentle algebras

We call an $\mathbb{R}$-algebra locally complexified-gentle if it becomes a locally gentle $\mathbb{C}$-algebra up to Morita equivalence after complexification. We use modulated quivers to introduce two types of locally complexified-gentle algebras and show that they are Morita equivalent to some semilinear clannish algebras.

preprint2026arXiv

Transient Large-Scale Anisotropy in TeV Cosmic Rays due to an Interplanetary Coronal Mass Ejection

Large- or medium-scale cosmic ray anisotropy at TeV energies has not previously been confirmed to vary with time. Transient anisotropy changes have been observed below 150 GeV, especially near the passage of an interplanetary shock and coronal mass ejection containing a magnetic flux rope ejected by a solar storm, which can trigger a geomagnetic storm with practical consequences. In such events, cosmic rays provide remote sensing of the magnetic field properties. Here we report the observation of transient large-scale anisotropy in TeV cosmic ray ions using data from the Large High Altitude Air Shower Observatory (LHAASO). We analyze hourly skymaps of the transient cosmic ray intensity excess or deficit, the gradient of which indicates the direction and magnitude of transient large-scale anisotropy across the field of view. We observe enhanced anisotropy above typical hourly fluctuations with $>$5$σ$ significance during some hours of November 4, 2021, in separate data sets for four primary cosmic ray energy ranges of median energy from $E$=0.7 to 3.1 TeV. The gradient varies with energy as $E^γ$, where $γ\approx-0.5$. At a median energy $\leq$1.0 TeV, this gradient corresponds to a dipole anisotropy of at least 1\%, or possibly a weaker anisotropy of higher order. This new type of observation opens the opportunity to study interplanetary magnetic structures using air shower arrays around the world, complementing existing in situ and remote measurements of plasma properties.

preprint2025arXiv

AgentTutor: Empowering Personalized Learning with Multi-Turn Interactive Teaching in Intelligent Education Systems

The rapid advancement of large-scale language models (LLMs) has shown their potential to transform intelligent education systems (IESs) through automated teaching and learning support applications. However, current IESs often rely on single-turn static question-answering, which fails to assess learners' cognitive levels, cannot adjust teaching strategies based on real-time feedback, and is limited to providing simple one-off responses. To address these issues, we introduce AgentTutor, a multi-turn interactive intelligent education system to empower personalized learning. It features an LLM-powered generative multi-agent system and a learner-specific personalized learning profile environment that dynamically optimizes and delivers teaching strategies based on learners' learning status, personalized goals, learning preferences, and multimodal study materials. It includes five key modules: curriculum decomposition, learner assessment, dynamic strategy, teaching reflection, and knowledge & experience memory. We conducted extensive experiments on multiple benchmark datasets, AgentTutor significantly enhances learners' performance while demonstrating strong effectiveness in multi-turn interactions and competitiveness in teaching quality among other baselines.

preprint2025arXiv

Cooling mechanical motion with polaritons

The strong coupling between light and matter gives rise to polaritons. Further coupling polaritons to phonons leads to the formation of hybrid polaromechanical systems. Recent experiments have achieved the strong coupling between polaritons and phonons in two configurations, namely, the magnon-photon-phonon and exciton-photon-phonon systems, which enables the control of mechanical motion via manipulating polaritons. Here, we present a polaromechanical cooling theory and explicitly show how two polaritons can be used to simultaneously cool two mechanical modes. The unique advantage of our protocol lies in the fact that the continuous tunability of the polariton frequencies over a wide range allows for the cooling of any two mechanical modes with their frequency difference falling within this range. We further discuss how to extend the theory to cool multiple mechanical modes. The protocol is designed for cooling mechanical motion in various emerging polaromechanical platforms, such as magnon-, exciton-, and plasmon-polaromechanical systems, which is the first step towards quantum states generation in these hybrid systems.

preprint2022arXiv

Homotopy Continuation Enhanced Branch and Bound Algorithms for Strongly Nonconvex Mixed-Integer Nonlinear Programming Problems

Large-scale strongly nonlinear and nonconvex mixed-integer nonlinear programming (MINLP) models frequently appear in optimisation-based process synthesis, integration, intensification, and process control. However, they are usually difficult to solve by existing algorithms within an acceptable time. In this work, we propose two robust homotopy continuation enhanced branch and bound (HCBB) algorithms (denoted as HCBB-FP and HCBB-RB) where the homotopy continuation method is employed to gradually approach the optimum of the NLP subproblem at a node from the solution at its parent node. A variable step length is adapted to effectively balance feasibility and computational efficiency. The computational results from solving four existing process synthesis problems demonstrate that the proposed HCBB algorithms can find the same optimal solution from different initial points, while the existing MINLP algorithms fail or find much worse solutions. In addition, HCBB-RB is superior to HCBB-FP due to the much lower computational effort required for the same locally optimal solution.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint

Fields this researcher appears in

Source provenance