Researcher profile

Xuewen Liu

Xuewen Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Arena as Offline Reward: Efficient Fine-Grained Preference Optimization for Diffusion Models

Reinforcement learning from human feedback (RLHF) effectively promotes preference alignment of text-to-image (T2I) diffusion models. To improve computational efficiency, direct preference optimization (DPO), which avoids explicit reward modeling, has been widely studied. However, its reliance on binary feedback limits it to coarse-grained modeling on chosen-rejected pairs, resulting in suboptimal optimization. In this paper, we propose ArenaPO, which leverages Arena scores as offline rewards to provide refined feedback, thus achieving efficient and fine-grained optimization without a reward model. This enables ArenaPO to benefit from both the rich rewards of traditional RLHF and the efficiency of DPO. Specifically, we first construct a model Arena in which each model's capability is represented as a Gaussian distribution, and infer these capabilities by traversing the annotated pairwise preferences. Each output image is treated as a sample from the corresponding capability distribution. Then, for a image pair, conditioned on the two capability distributions and the observed pairwise preference, the absolute quality gap is estimated using latent-variable inference based on truncated normal distribution, which serves as fine-grained feedback during training. It does not require a reward model and can be computed offline, thus introducing no additional training overhead. We conduct ArenaPO training on Pick-a-Pic v2 and HPD v3 datasets, showing that ArenaPO consistently outperforms existing baselines.

preprint2026arXiv

OSAQ: Outlier Self-Absorption for Accurate Low-bit LLM Quantization

Large Language Models (LLMs) have demonstrated remarkable capabilities. However, their massive parameter scale leads to significant resource consumption and latency during inference. Post-training weight-only quantization offers a promising solution by reducing model size and accelerating token generation through alleviating the memory-bound issue. Nevertheless, the presence of inherent systematic outliers in weights continues to be a major obstacle. While existing methods, such as scaling and rotation, attempt to address this issue, the performance remains unsatisfactory. In this paper, we propose Outlier Self-Absorption Quantization (OSAQ), which performs additive weight suppression guided by the second-order low-rank property for low-bit weight-only quantization of LLMs. Specifically, we observe that the Hessian exhibits low-rank consistency across different inputs, with certain directions consistently showing vanishing curvature. Leveraging this property, we identify a stable null space of the Hessian and then construct an additive weight transformation by linearly combining the vectors within this null space, thereby suppressing weight outliers without affecting the task loss. This additive transformation can be absorbed into the weights offline, requiring no inter-layer transformations and introducing no inference overhead. Moreover, the construction is efficiently achieved by a closed-form solution, without resource-intensive training or iterative procedures. Extensive experiments demonstrate that OSAQ effectively suppresses outliers and enhances low-bit quantization performance. For instance, in 2-bit quantization, OSAQ, when integrated with GPTQ, achieves over 40% lower perplexity compared to vanilla GPTQ.

preprint2022arXiv

Correlating Gravitational Waves with $W$-boson Mass, FIMP Dark Matter, and Majorana Seesaw Mechanism

We study a minimal extension of the Standard Model by introducing three right-handed neutrinos and a new scotogenic scalar doublet, in which the mass splittings between neutral and charged components are responsible for the $W$-boson mass newly measured by the CDF collaboration. This model can not only generate non-vanishing Majorana neutrino masses via the interaction of right-handed neutrinos and scotogenic scalars, but also explain the Universe's missing matter in the form of FIMP dark matter. We also study the influence of the mass splitting on the first order electroweak phase transition, and find that it can further enhance the transition strength and thus induce gravitational waves during the phase transition, which may be detected in the forthcoming detectors such as U-DECIGO.

preprint2022arXiv

Photosphere Recession and Luminosity of Homologous Explosions Revisited

By assuming the photosphere located at the outmost edge of the ejecta, Arnett et al. (1980, 1982, 1989) presented the light curves of homologous explosions in supernovae analytically and numerically to include recombination effects. Actually as homologous expansion proceeds, the photosphere recedes deeper into the ejecta. In this situation, the photosphere radius increases at early times and decreases later on which can be described by a simple method proposed by Liu et al. (2018). To study how the photosphere recession effect the luminosity evolution, we impose a boundary condition on the photosphere to determine the spatial and time distribution of the temperature of the ejecta which is clarified to be reasonable. We find that the photosphere recession reduce the luminosity compared with the previous result without the recession, which can be tested with observations of Type-IIP supernovae.

preprint2021arXiv

Probing the flavor-specific scalar mediator for the muon $(g-2)$ deviation, the proton radius puzzle and the light dark matter production

Flavor-specific scalar bosons exist in various Standard Model extensions and couple to a single generation of fermions via a global flavor symmetry breaking mechanism. Given this strategy, we propose a MeV flavor-specific scalar model in dimension-$5$ operator series, which explains the muon g-2 anomaly and proton radius puzzle by coupling with the muon and down-quark at the same time. The framework is consistent with the null result of high-intensity searches. Specifically, the supernova constraints for muon couplings become weakened by including the contribution of down-quark interaction. The parameter space for explaining muon $g-2$ discrepancy is available when $10\%$ energy deposition is required in the energy explosion process in the supernova, but this is ruled out by the $1\%$ energy deposition requirement. We also investigate the searches for mediator and dark matter and the resulting constraints on viable parameter space such as nuclear physics constraints, direct detection for light boosted dark matter, and possible CMB constraints. When compared to conventional dark matter production, light dark matter production has two additional modifications: bound state formation and early kinetic equilibrium decoupling. We are now looking into the implications of these effects on the relic density of light dark matter.

preprint2020arXiv

$λ$-Differential operators and $λ$-differential modules for the Virasoro algebra

The concept of $λ$-differential operators is a natural generalization of differential operators and difference operators. In this paper, we determine the $λ$-differential Lie algebraic structure on the Witt algebra and the Virasoro algebra for invertible $λ$. Then we consider several families of modules over the Virasoro algebra with explicit module actions and determine the $λ$-differential module structures on them.

preprint2020arXiv

Wavefront prediction using artificial neural networks for open-loop Adaptive Optics

Latency in the control loop of adaptive optics (AO) systems can severely limit performance. Under the frozen flow hypothesis linear predictive control techniques can overcome this, however identification and tracking of relevant turbulent parameters (such as wind speeds) is required for such parametric techniques. This can complicate practical implementations and introduce stability issues when encountering variable conditions. Here we present a nonlinear wavefront predictor using a Long Short-Term Memory (LSTM) artificial neural network (ANN) that assumes no prior knowledge of the atmosphere and thus requires no user input. The ANN is designed to predict the open-loop wavefront slope measurements of a Shack-Hartmann wavefront sensor (SH-WFS) one frame in advance to compensate for a single-frame delay in a simulated $7\times7$ single-conjugate adaptive optics (SCAO) system operating at 150 Hz. We describe how the training regime of the LSTM ANN affects prediction performance and show how the performance of the predictor varies under various guide star magnitudes. We show that the prediction remains stable when both wind speed and direction are varying. We then extend our approach to a more realistic two-frame latency system. AO system performance when using the LSTM predictor is enhanced for all simulated conditions with prediction errors within 19.9 to 40.0 nm RMS of a latency-free system operating under the same conditions compared to a bandwidth error of $78.3\pm4.4$ nm RMS.