Researcher profile

Xinyu Li

Xinyu Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
18works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

18 published item(s)

preprint2026arXiv

Gradients with Respect to Semantics Preserving Embeddings Tell the Uncertainty of Large Language Models

Uncertainty quantification (UQ) is an important technique for ensuring the trustworthiness of LLMs, given their tendency to hallucinate. Existing state-of-the-art UQ approaches for free-form generation rely heavily on sampling, which incurs high computational cost and variance. In this work, we propose the first gradient-based UQ method for free-form generation, SemGrad, which is sampling-free and computationally efficient. Unlike prior gradient-based methods developed for classification tasks that operates in parameter space, we propose to consider gradients in semantic space. Our method builds on the key intuition that a confident LLM should maintain stable output distributions under semantically equivalent input perturbations. We interpret the stability as the gradients in semantic space and introduce a Semantic Preservation Score (SPS) to identify embeddings that best capture semantics, with respect to which gradients are computed. We further propose HybridGrad, which combines the strengths of SemGrad and parameter gradients. Experiments demonstrate that both of our methods provide efficient and effective uncertainty estimates, achieving superior performance than state-of-the-art methods, particularly in settings with multiple valid responses.

preprint2026arXiv

Signature Approach for Contextual Bandits with Nonlinear and Path-dependent Rewards

We study contextual bandits with nonlinear and path-dependent rewards through a novel signature-transform-based approach. Leveraging the universal nonlinearity property of signatures, we approximate continuous path-dependent reward functionals by linear functionals in the signature space. This representation enables the use of efficient linear contextual bandit methods while preserving expressive sequential structure. Building on this framework, we propose \texttt{DisSigUCB}, a signature-based disjoint upper confidence bound (UCB) algorithm. Under boundedness and non-degeneracy assumptions, we prove a high-probability data-dependent sublinear regret bound of order \(\tilde{\mathcal O}(\sqrt{(d+m)KT})\) where \(d\) is the context dimension and \(m\) is the signature feature dimension. Synthetic experiments and numerical applications on temperature sensor monitoring, sleep-stage classification, and hospital nurse staffing demonstrate that \texttt{DisSigUCB} consistently outperforms classical linear and kernelized contextual bandit baselines in nonlinear and path-dependent settings.

preprint2023arXiv

Construction of Wave Dark Matter Halos: Numerical Algorithm and Analytical Constraints

We present a wave generalization of the classic Schwarzschild method for constructing self-consistent halos -- such a halo consists of a suitable superposition of waves instead of particle orbits, chosen to yield a desired mean density profile. As an illustration, the method is applied to spherically symmetric halos. We derive an analytic relation between the particle distribution function and the wave superposition amplitudes, and show how it simplifies in the high energy (WKB) limit. We verify the stability of such constructed halos by numerically evolving the Schrödinger-Poisson system. The algorithm provides an efficient and accurate way to simulate the time-dependent halo substructures from wave interference. We use this method to construct halos with a variety of density profiles, all of which have a core from the ground-state wave function, though the core-halo relation need not be the standard one.

preprint2022arXiv

A New Knowledge Distillation Network for Incremental Few-Shot Surface Defect Detection

Surface defect detection is one of the most essential processes for industrial quality inspection. Deep learning-based surface defect detection methods have shown great potential. However, the well-performed models usually require large training data and can only detect defects that appeared in the training stage. When facing incremental few-shot data, defect detection models inevitably suffer from catastrophic forgetting and misclassification problem. To solve these problems, this paper proposes a new knowledge distillation network, called Dual Knowledge Align Network (DKAN). The proposed DKAN method follows a pretraining-finetuning transfer learning paradigm and a knowledge distillation framework is designed for fine-tuning. Specifically, an Incremental RCNN is proposed to achieve decoupled stable feature representation of different categories. Under this framework, a Feature Knowledge Align (FKA) loss is designed between class-agnostic feature maps to deal with catastrophic forgetting problems, and a Logit Knowledge Align (LKA) loss is deployed between logit distributions to tackle misclassification problems. Experiments have been conducted on the incremental Few-shot NEU-DET dataset and results show that DKAN outperforms other methods on various few-shot scenes, up to 6.65% on the mean Average Precision metric, which proves the effectiveness of the proposed method.

preprint2022arXiv

Dynamical Effects of Colliding Outflows in Binary Systems

The outflow of an object traveling in a fluid can shape the fluid morphology by forming a forward bow shock which accelerates the object via gravitational feedback. This dynamical effect, namely "dynamical anti-friction", has been studied in idealized infinite uniform media, which suffers from the convergence problem due to the long-range nature of gravitation. In this work, we conduct global 3D hydrodynamic simulations to study this effect in the scenario of a binary system, where the collision of outflows from both stars creates a suitable configuration. We demonstrate with simulations that a dense and slow outflow can give rise to a positive torque on the binary and lead to the expansion of the orbit. As an application, we show that binaries consisting of an AGB star and an outflowing pulsar can experience $\sim 10~\%$ orbit expansion during the AGB stage, in addition to the contribution from mass-loss. We also prove that the gravitational force drops as $O(r^{-3})$ from the center of mass in the binary scenarios, which guarantees a quick converge of the overall effect.

preprint2022arXiv

Improving short-term bike sharing demand forecast through an irregular convolutional neural network

As an important task for the management of bike sharing systems, accurate forecast of travel demand could facilitate dispatch and relocation of bicycles to improve user satisfaction. In recent years, many deep learning algorithms have been introduced to improve bicycle usage forecast. A typical practice is to integrate convolutional (CNN) and recurrent neural network (RNN) to capture spatial-temporal dependency in historical travel demand. For typical CNN, the convolution operation is conducted through a kernel that moves across a "matrix-format" city to extract features over spatially adjacent urban areas. This practice assumes that areas close to each other could provide useful information that improves prediction accuracy. However, bicycle usage in neighboring areas might not always be similar, given spatial variations in built environment characteristics and travel behavior that affect cycling activities. Yet, areas that are far apart can be relatively more similar in temporal usage patterns. To utilize the hidden linkage among these distant urban areas, the study proposes an irregular convolutional Long-Short Term Memory model (IrConv+LSTM) to improve short-term bike sharing demand forecast. The model modifies traditional CNN with irregular convolutional architecture to extract dependency among "semantic neighbors". The proposed model is evaluated with a set of benchmark models in five study sites, which include one dockless bike sharing system in Singapore, and four station-based systems in Chicago, Washington, D.C., New York, and London. We find that IrConv+LSTM outperforms other benchmark models in the five cities. The model also achieves superior performance in areas with varying levels of bicycle usage and during peak periods. The findings suggest that "thinking beyond spatial neighbors" can further improve short-term travel demand prediction of urban bike sharing systems.

preprint2022arXiv

Integrated Human Activity Sensing and Communications

Advances in wireless communication and signal processing facilitate integrated sensing and communication a compelling technology that intrinsically combines sensing and communication functionalities for the dual purpose exploitation of wireless hardware resources and pursues mutual benefits. Consequently, the next generation communications network will be perceptive. In this article, we provide a review of human related sensing in the context of ISAC. We first present a general ISAC receiver signal processing framework, with a focus on human activity recognition. Based on its specific spatial deployments, we then categorize ISAC HAR into monostatic, bistatic, and distributed deployments, and discuss their properties, critical research problems and solutions. To facilitate the system's realization and improve its recognition performance, we then explore the inherent connections between the physical layer system parameters and HAR performance metrics. Experimental results are presented for characterizing the sensing potentials of different ISAC systems. Finally, we review the technical challenges and identify the open research problems.

preprint2022arXiv

Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models

We propose a memory efficient method, named Stochastic Backpropagation (SBP), for training deep neural networks on videos. It is based on the finding that gradients from incomplete execution for backpropagation can still effectively train the models with minimal accuracy loss, which attributes to the high redundancy of video. SBP keeps all forward paths but randomly and independently removes the backward paths for each network layer in each training step. It reduces the GPU memory cost by eliminating the need to cache activation values corresponding to the dropped backward paths, whose amount can be controlled by an adjustable keep-ratio. Experiments show that SBP can be applied to a wide range of models for video tasks, leading to up to 80.0% GPU memory saving and 10% training speedup with less than 1% accuracy drop on action recognition and temporal action detection.

preprint2022arXiv

TubeR: Tubelet Transformer for Video Action Detection

We propose TubeR: a simple solution for spatio-temporal video action detection. Different from existing methods that depend on either an off-line actor detector or hand-designed actor-positional hypotheses like proposals or anchors, we propose to directly detect an action tubelet in a video by simultaneously performing action localization and recognition from a single representation. TubeR learns a set of tubelet-queries and utilizes a tubelet-attention module to model the dynamic spatio-temporal nature of a video clip, which effectively reinforces the model capacity compared to using actor-positional hypotheses in the spatio-temporal space. For videos containing transitional states or scene changes, we propose a context aware classification head to utilize short-term and long-term context to strengthen action classification, and an action switch regression head for detecting the precise temporal action extent. TubeR directly produces action tubelets with variable lengths and even maintains good results for long video clips. TubeR outperforms the previous state-of-the-art on commonly used action detection datasets AVA, UCF101-24 and JHMDB51-21.

preprint2022arXiv

What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions

We propose a novel one-stage Transformer-based semantic and spatial refined transformer (SSRT) to solve the Human-Object Interaction detection task, which requires to localize humans and objects, and predicts their interactions. Differently from previous Transformer-based HOI approaches, which mostly focus at improving the design of the decoder outputs for the final detection, SSRT introduces two new modules to help select the most relevant object-action pairs within an image and refine the queries' representation using rich semantic and spatial features. These enhancements lead to state-of-the-art results on the two most popular HOI benchmarks: V-COCO and HICO-DET.

preprint2021arXiv

Multi-Label Activity Recognition using Activity-specific Features and Activity Correlations

Multi-label activity recognition is designed for recognizing multiple activities that are performed simultaneously or sequentially in each video. Most recent activity recognition networks focus on single-activities, that assume only one activity in each video. These networks extract shared features for all the activities, which are not designed for multi-label activities. We introduce an approach to multi-label activity recognition that extracts independent feature descriptors for each activity and learns activity correlations. This structure can be trained end-to-end and plugged into any existing network structures for video classification. Our method outperformed state-of-the-art approaches on four multi-label activity recognition datasets. To better understand the activity-specific features that the system generated, we visualized these activity-specific features in the Charades dataset.

preprint2020arXiv

Directional Temporal Modeling for Action Recognition

Many current activity recognition models use 3D convolutional neural networks (e.g. I3D, I3D-NL) to generate local spatial-temporal features. However, such features do not encode clip-level ordered temporal information. In this paper, we introduce a channel independent directional convolution (CIDC) operation, which learns to model the temporal evolution among local features. By applying multiple CIDC units we construct a light-weight network that models the clip-level temporal evolution across multiple spatial scales. Our CIDC network can be attached to any activity recognition backbone network. We evaluate our method on four popular activity recognition datasets and consistently improve upon state-of-the-art techniques. We further visualize the activation map of our CIDC network and show that it is able to focus on more meaningful, action related parts of the frame.

preprint2020arXiv

Dual-comb delay spectroscopy with attometer resolution

Spectroscopy has attracted much attention in molecular detection, biomolecular identification, and chemical analysis for providing accurate measurement. However, it is almost unable to distinguish different sources with overlapped resonances in mixed analytes. Here, we present dual-comb delay spectroscopy to overcome this problem. The introduction of group delay spectroscopy provides a new tool to identify sources that would lead to overlapped resonances in intensity or phase spectroscopy. To obtain sufficiently high spectral resolution and signal-to-noise ratio for achieving reliable group delay spectrum, a probe comb with the wavelengths precisely scaned by a microwave source is applied, leading to attometer-level resolution and million-level signal-to-noise ratio. In an experiment, spectroscopy with an optional resolution up to 1 kHz (8 attometer), an average signal-to-noise ratio surpassing 2,000,000, and a span exceeding 33 nm is demonstrated. Two overlapped resonances from two different sources are clearly differentiated. Our work offers a new perspective for exploring the interaction between matter and light.

preprint2020arXiv

Oscillations and Random Walk of the Soliton Core in a Fuzzy Dark Matter Halo

A Fuzzy Dark Matter (FDM) halo consists of a soliton core close to the center and an NFW-like density profile in the outer region. Previous investigations found that the soliton core exhibits temporal oscillations and random walk excursions around the halo center. Analyzing a set of numerical simulations, we show that both phenomena can be understood as the results of wave interference -- a suitable superposition of the ground (solitonic) state and excited states in a fixed potential suffices to account for the main features of these phenomena. Such an eigenmode analysis can shed light on the evolution of a satellite halo undergoing tidal disruption. As the outer halo is stripped away, reducing the amplitudes of the excited states, the ground state evolves adiabatically. This suggests diminished soliton oscillations and random walk excursions, an effect to consider in deducing constraints from stellar heating.

preprint2020arXiv

Relativistic Alfvén Waves Entering Charge Starvation in the Magnetospheres of Neutron Stars

Instabilities in a neutron star can generate Alfvén waves in its magnetosphere. Propagation along the curved magnetic field lines strongly shears the wave, boosting its electric current $j_{\rm A}$. We derive an analytic expression for the evolution of the wave vector $\boldsymbol{k}$ and the growth of $j_{\rm A}$. In the strongly sheared regime, $j_{\rm A}$ may exceed the maximum current $j_{0}$ that can be supported by the background $e^{\pm}$ plasma. We investigate these "charge-starved" waves, first using a simplified two-fluid analytic model, then with first-principles kinetic simulations. We find that the Alfvén wave continues to propagate successfully even when $κ\equiv j_{\rm A}/j_{0} \gg 1$. It sustains $j_{\rm A}$ by compressing and advecting the plasma along the magnetic field lines with particle Lorentz factors $\sim κ^{1/2}$. The simulations show how plasma instabilities lead to gradual dissipation of the wave energy, giving a dissipation power $L_{\rm diss}\sim 10^{35}(κ/100)^{1/2} (B_w/10^{11}\,{\rm G})\,\mathrm{erg/s}$, where $B_w$ is the wave amplitude. Our results imply that dissipation due to charge starvation is not sufficient to power observed fast radio bursts (FRBs), in contrast to recent proposals.

preprint2020arXiv

Simulation of a Compact Object with Outflows Moving Through a Gaseous Background

A compact object moving relative to surrounding gas accretes material and perturbs the density of gas in its vicinity. In the classical picture of Bondi-Hoyle-Lyttleton accretion, the perturbation takes the form of an overdense wake behind the object, which exerts a dynamical friction drag. We use hydrodynamic simulations to investigate how the accretion rate and strength of dynamical friction are modified by the presence of outflow from the compact object. We show that the destruction of the wake by an outflow reduces dynamical friction, and reverses its sign when the outflow is strong enough, in good quantitative agreement with analytic calculations. For a strong isotropic outflow, the outcome on scales that we have simulated is a negative dynamical friction, i.e., net acceleration. For jet-like outflows driven by reprocessed accretion, both the rate of accretion and the magnitude of dynamical friction drop for more powerful jets. The accretion rate is strongly intermittent when the jet points to the same direction as the motion of the compact object. The dynamical effects of outflows may be important for the evolution of compact objects during the common envelope phase of binary systems, and for accreting compact objects and massive stars encountering AGN discs.

preprint2020arXiv

Vortices and waves in light dark matter

In a galactic halo like the Milky Way, bosonic dark matter particles lighter than about $30$ eV have a de Broglie wavelength larger than the average inter-particle separation and are therefore well described as a set of classical waves. This applies to, for instance, the QCD axion as well as to lighter axion-like particles such as fuzzy dark matter. We show that the interference of waves inside a halo inevitably leads to vortices, locations where chance destructive interference takes the density to zero. The phase of the wavefunction has non-trivial winding around these points. This can be interpreted as a non-zero velocity circulation, so that vortices are sites where the fluid velocity has a non-vanishing curl. Using analytic arguments and numerical simulations, we study the properties of vortices and show they have a number of universal features: (1) In three spatial dimensions, the generic defects take the form of vortex rings. (2) On average there is about one vortex ring per de Broglie volume and (3) generically only single winding ($\pm 1$) vortices are found in a realistic halo. (4) The density near a vortex scales as $r^2$ while the velocity goes as $1/r$, where $r$ is the distance to vortex. (5) A vortex segment moves at a velocity inversely proportional to its curvature scale so that smaller vortex rings move faster, allowing momentary motion exceeding escape velocity. We discuss observational/experimental signatures from vortices and, more broadly, wave interference. In the ultra-light regime, gravitational lensing by interference substructures leads to flux anomalies of $5-10 \%$ in strongly lensed systems. For QCD axions, vortices lead to a diminished signal in some detection experiments but not in others. We advocate the measurement of correlation functions by axion detection experiments as a way to probe and capitalize on the expected interference substructures.

preprint2019arXiv

Consequences of Energetic Magnetar-Like Outbursts of Nearby Neutron Stars: $^{14}$C Events and the Cosmic Electron Spectrum

Four significant events of rapid $^{14}$C increase have taken place within the past several thousand years. The physical origin of these rapid increases is still a mystery but must be associated with extremely energetic cosmic processes. Pulsars are highly magnetized neutron stars that emit a beam of electromagnetic radiations. Any sudden release of the energy stored in the magnetic multipole field will trigger outbursts similar to the giant flares of magnetars. Here we show that the relativistic outflow from the outbursts of a nearby pulsar interacting with the interstellar medium generates a shock, which accelerates electrons to trillions of electron volts. The high-energy photons from synchrotron emission of the shock interact with Earth's atmosphere, producing the cosmogenic nuclide $^{14}$C, which can cause the rapid $^{14}$C increases discovered in tree rings. These same relativistic electrons can account for a significant fraction of the cosmic electron spectrum in the trillion electron volts energy range, as observed by space-borne satellites. Since these outburst events can significantly affect our environment, monitoring nearby pulsars for such outbursts may be important in the future.