Source author record

Xuewen Liu

Xuewen Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-ph math.RT astro-ph.HE astro-ph.IM Computer Vision hep-ex Machine Learning math.QA

Catalog footprint

What is connected

12works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Arena as Offline Reward: Efficient Fine-Grained Preference Optimization for Diffusion Models

Reinforcement learning from human feedback (RLHF) effectively promotes preference alignment of text-to-image (T2I) diffusion models. To improve computational efficiency, direct preference optimization (DPO), which avoids explicit reward modeling, has been widely studied. However, its reliance on binary feedback limits it to coarse-grained modeling on chosen-rejected pairs, resulting in suboptimal optimization. In this paper, we propose ArenaPO, which leverages Arena scores as offline rewards to provide refined feedback, thus achieving efficient and fine-grained optimization without a reward model. This enables ArenaPO to benefit from both the rich rewards of traditional RLHF and the efficiency of DPO. Specifically, we first construct a model Arena in which each model's capability is represented as a Gaussian distribution, and infer these capabilities by traversing the annotated pairwise preferences. Each output image is treated as a sample from the corresponding capability distribution. Then, for a image pair, conditioned on the two capability distributions and the observed pairwise preference, the absolute quality gap is estimated using latent-variable inference based on truncated normal distribution, which serves as fine-grained feedback during training. It does not require a reward model and can be computed offline, thus introducing no additional training overhead. We conduct ArenaPO training on Pick-a-Pic v2 and HPD v3 datasets, showing that ArenaPO consistently outperforms existing baselines.

preprint2026arXiv

OSAQ: Outlier Self-Absorption for Accurate Low-bit LLM Quantization

Large Language Models (LLMs) have demonstrated remarkable capabilities. However, their massive parameter scale leads to significant resource consumption and latency during inference. Post-training weight-only quantization offers a promising solution by reducing model size and accelerating token generation through alleviating the memory-bound issue. Nevertheless, the presence of inherent systematic outliers in weights continues to be a major obstacle. While existing methods, such as scaling and rotation, attempt to address this issue, the performance remains unsatisfactory. In this paper, we propose Outlier Self-Absorption Quantization (OSAQ), which performs additive weight suppression guided by the second-order low-rank property for low-bit weight-only quantization of LLMs. Specifically, we observe that the Hessian exhibits low-rank consistency across different inputs, with certain directions consistently showing vanishing curvature. Leveraging this property, we identify a stable null space of the Hessian and then construct an additive weight transformation by linearly combining the vectors within this null space, thereby suppressing weight outliers without affecting the task loss. This additive transformation can be absorbed into the weights offline, requiring no inter-layer transformations and introducing no inference overhead. Moreover, the construction is efficiently achieved by a closed-form solution, without resource-intensive training or iterative procedures. Extensive experiments demonstrate that OSAQ effectively suppresses outliers and enhances low-bit quantization performance. For instance, in 2-bit quantization, OSAQ, when integrated with GPTQ, achieves over 40% lower perplexity compared to vanilla GPTQ.

preprint2022arXiv

Correlating Gravitational Waves with $W$-boson Mass, FIMP Dark Matter, and Majorana Seesaw Mechanism

We study a minimal extension of the Standard Model by introducing three right-handed neutrinos and a new scotogenic scalar doublet, in which the mass splittings between neutral and charged components are responsible for the $W$-boson mass newly measured by the CDF collaboration. This model can not only generate non-vanishing Majorana neutrino masses via the interaction of right-handed neutrinos and scotogenic scalars, but also explain the Universe's missing matter in the form of FIMP dark matter. We also study the influence of the mass splitting on the first order electroweak phase transition, and find that it can further enhance the transition strength and thus induce gravitational waves during the phase transition, which may be detected in the forthcoming detectors such as U-DECIGO.

preprint2022arXiv

Photosphere Recession and Luminosity of Homologous Explosions Revisited

By assuming the photosphere located at the outmost edge of the ejecta, Arnett et al. (1980, 1982, 1989) presented the light curves of homologous explosions in supernovae analytically and numerically to include recombination effects. Actually as homologous expansion proceeds, the photosphere recedes deeper into the ejecta. In this situation, the photosphere radius increases at early times and decreases later on which can be described by a simple method proposed by Liu et al. (2018). To study how the photosphere recession effect the luminosity evolution, we impose a boundary condition on the photosphere to determine the spatial and time distribution of the temperature of the ejecta which is clarified to be reasonable. We find that the photosphere recession reduce the luminosity compared with the previous result without the recession, which can be tested with observations of Type-IIP supernovae.

preprint2021arXiv

Probing the flavor-specific scalar mediator for the muon $(g-2)$ deviation, the proton radius puzzle and the light dark matter production

Flavor-specific scalar bosons exist in various Standard Model extensions and couple to a single generation of fermions via a global flavor symmetry breaking mechanism. Given this strategy, we propose a MeV flavor-specific scalar model in dimension-$5$ operator series, which explains the muon g-2 anomaly and proton radius puzzle by coupling with the muon and down-quark at the same time. The framework is consistent with the null result of high-intensity searches. Specifically, the supernova constraints for muon couplings become weakened by including the contribution of down-quark interaction. The parameter space for explaining muon $g-2$ discrepancy is available when $10\%$ energy deposition is required in the energy explosion process in the supernova, but this is ruled out by the $1\%$ energy deposition requirement. We also investigate the searches for mediator and dark matter and the resulting constraints on viable parameter space such as nuclear physics constraints, direct detection for light boosted dark matter, and possible CMB constraints. When compared to conventional dark matter production, light dark matter production has two additional modifications: bound state formation and early kinetic equilibrium decoupling. We are now looking into the implications of these effects on the relic density of light dark matter.

preprint2020arXiv

$λ$-Differential operators and $λ$-differential modules for the Virasoro algebra

The concept of $λ$-differential operators is a natural generalization of differential operators and difference operators. In this paper, we determine the $λ$-differential Lie algebraic structure on the Witt algebra and the Virasoro algebra for invertible $λ$. Then we consider several families of modules over the Virasoro algebra with explicit module actions and determine the $λ$-differential module structures on them.

preprint2020arXiv

Wavefront prediction using artificial neural networks for open-loop Adaptive Optics

Latency in the control loop of adaptive optics (AO) systems can severely limit performance. Under the frozen flow hypothesis linear predictive control techniques can overcome this, however identification and tracking of relevant turbulent parameters (such as wind speeds) is required for such parametric techniques. This can complicate practical implementations and introduce stability issues when encountering variable conditions. Here we present a nonlinear wavefront predictor using a Long Short-Term Memory (LSTM) artificial neural network (ANN) that assumes no prior knowledge of the atmosphere and thus requires no user input. The ANN is designed to predict the open-loop wavefront slope measurements of a Shack-Hartmann wavefront sensor (SH-WFS) one frame in advance to compensate for a single-frame delay in a simulated $7\times7$ single-conjugate adaptive optics (SCAO) system operating at 150 Hz. We describe how the training regime of the LSTM ANN affects prediction performance and show how the performance of the predictor varies under various guide star magnitudes. We show that the prediction remains stable when both wind speed and direction are varying. We then extend our approach to a more realistic two-frame latency system. AO system performance when using the LSTM predictor is enhanced for all simulated conditions with prediction errors within 19.9 to 40.0 nm RMS of a latency-free system operating under the same conditions compared to a bandwidth error of $78.3\pm4.4$ nm RMS.

preprint2016arXiv

Exploring open-charm decay mode $Λ_c\barΛ_c$ of charmonium-like state $Y(4630)$

The newly observed $X, Y, Z$ exotic states are definitely not in the standard $Q\bar Q'$ structures, thus their existence composes a challenge to our understanding on the fundamental principles of hadron physics. Therefore the studies on their decay patterns which are determined by the non-perturbative QCD will definitely shed light on the concerned physics. Generally the four-quark states might be in a molecular state or tetraquark or their mixture. In this work, we adopt the suggestion that $Y(4630)$ is a charmonium-like tetraquark made of a diquark and an anti-diquark. If it is true, its favorable decay mode should be $Y(4630)$ decaying into an open-charm baryon pair, since such a transition occurs via strong interaction and is super-OZI-allowed. In this work, we calculate the decay width of $Y(4630)\toΛ_c\barΛ_c$ in the framework of the quark pair creation (QPC) model. Our numerical results on the partial width computed in the tetraquark configuration coincide with the Belle data within a certain error tolerance.

preprint2016arXiv

Study of structures and dynamical decay mechanisms for multiquark systems

The inner structures of the multiquark states are an interesting subject in hadron physics, generally they may be in tetraquark states which are composed of colored constituents, or in molecular states which are composed of two color singlets, or their mixtures. Therefore, the mechanisms which bind the constituents in a unique system and induce the multiquark states to decay would be different in those cases. In this work, using the quantum mechanics we analyze the dynamical mechanisms inducing decays of the tetraquarks where $Y(4630)$ stands as an example for the study, we also comment on the molecular states without making numerical computations.

preprint2016arXiv

Type-III two Higgs doublet model plus a pseudoscalar confronted with $h\rightarrowμτ$, muon $g-2$ and dark matter

In this work, we introduce an extra singlet pseudoscalar into the Type-III two Higgs doublet model (2HDM) which is supposed to solve a series of problems in the modern particle-cosmology. With existence of a light pseudoscalar, the $h\rightarrowμτ$ excess measured at CMS and as well as the $(g-2)_μ$ anomaly could be simultaneously explained within certain parameter spaces that can also tolerate the data on the flavor-violating processes $τ\rightarrowμγ$ and Higgs decay gained at LHC. Within the same parameter spaces, the DM relic abundance is well accounted. Moreover, the recently observed Galactic Center gamma ray excess(GCE) is proposed to realize through dark matter(DM) pair annihilations, and in this work, the scenario of the annihilation being mediated by the pseudoscalar is also addressed.

preprint2015arXiv

Study on direct pion emission in decay $D^{*+} \to D^+ π$

The QCD multipole expansion (QCDME) is based on the quantum field theory, so should be more reliable. However, on another aspect, it refers to the non-perturbative QCD , so that has a certain application range. Even though it successfully explains the data of transition among members of the $Υ$ ($ψ$) family, as Eichten indicates, beyond the production threshold of mediate states it fails to meet data by several orders. In this work, by studying a simple decay mode $D^*\to D+π^0$, where a pion may be emitted before $D^*$ transiting into $D$, we analyze the contribution of QCD multipole expansion. Whereas as the $Dπ$ portal is open, the dominant contribution is an OZI allowed process where a light quark-pair is excited out from vacuum and its contribution can be evaluated by the $^3P_0$ model. Since the direct pion emission is a process which is OZI suppressed and violates the isospin conservation, its contribution must be much smaller than the dominant one. By a careful calculation, we may quantitatively estimate how small the QCDME contribution should be and offer a quantitative interpretation for Eichten's statement.

preprint2010arXiv

Harish-Chandra modules over the $\Q$ Heisenberg-Virasoro Algebra

In this paper, it is proved that all irreducible Harish-Chandra modules over the $\Q$ Heisenberg-Virasoro algebra are of intermediate series (all weight spaces are 1-dimensional).

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint