Researcher profile

Xian Gao

Xian Gao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Beyond Uniform SVD:Dual-Level Optimization across Columns and Modules for LLM Compression

Low-rank decomposition, particularly Singular Value Decomposition (SVD), is a pivotal technique for mitigating the storage and computational demands of Large Language Models (LLMs). However, prevalent SVD-based approaches overlook the critical phenomenon that decomposition errors exhibit significant disparity across different components of the parameter matrix, often leading to suboptimal approximation. Furthermore, existing methods lack a direct metric to evaluate the importance of individual weight matrices. To address these limitations, we propose Duo-SVD (Dual-level Optimization SVD), a novel training-free framework that synergizes optimization at both the column and the module levels. First, Duo-SVD incorporates a Column-Preserving Strategy that explicitly retains columns exhibiting high decomposition errors, while applying low-rank approximation solely to those with lower errors. Second, at the module level, we employ a Module-Adaptive Allocation Strategy that formulates ratio allocation as a global constrained optimization problem based on perturbation-induced model deviation. Extensive experiments demonstrate that Duo-SVD consistently outperforms state-of-the-art SVD-based baselines and structured pruning methods, establishing it as a superior paradigm for efficient LLM compression.

preprint2026arXiv

On the Fragility of Data Attribution When Learning Is Distributed

Data attribution has become an important component of pricing, auditing, and governance in machine learning pipelines, yet most attribution methods implicitly assume that attribution values faithfully reflect participants' contributions. We show that this assumption can fail: a single participant in a standard distributed training workflow can substantially inflate its measured attribution value while preserving global utility. Our attribution-first attack uses latent optimization to inject small synthetic batches that preserve utility while exploiting non-IID label coverage and evaluator sensitivities. Across datasets, models, and multiple marginal-utility evaluators, the attack consistently increases the adversary's attribution value and reshapes the relative attribution structure among benign clients without degrading accuracy or triggering geometry-based defenses. These results show that attribution itself forms a new attack surface and motivate the development of attribution-robust and incentive-compatible scoring mechanisms.

preprint2026arXiv

OnlineMate: An LLM-Based Multi-Agent Companion System for Cognitive Support in Online Learning

In online learning environments, students often lack personalized peer interactions, which are crucial for cognitive development and learning engagement. Although previous studies have employed large language models (LLMs) to simulate interactive learning environments, these interactions are limited to conversational exchanges, failing to adapt to learners' individualized cognitive and psychological states. As a result, students' engagement is low and they struggle to gain inspiration. To address this challenge, we propose OnlineMate, a multi-agent learning companion system driven by LLMs integrated with Theory of Mind (ToM). OnlineMate simulates peer-like roles, infers learners' psychological states such as misunderstandings and confusion during collaborative discussions, and dynamically adjusts interaction strategies to support higher-order thinking. Comprehensive evaluations, including simulation-based experiments, human assessments, and real classroom trials, demonstrate that OnlineMate significantly promotes deep learning and cognitive engagement by elevating students' average cognitive level while substantially improving emotional engagement scores.

preprint2025arXiv

Degrees of freedom of quadratic scalar-nonmetricity theory

We study the number of degrees of freedom (DOFs) in quadratic scalar-nonmetricity (QSN) theory, whose Lagrangian is the linear combination of five quadratic nonmetricity invariants with coefficients depending on a dynamical scalar field. Working in the coincident gauge, we perform the Arnowitt-Deser-Misner decomposition and classify QSN models into distinct cases according to the numbers of their primary constraints. For cases that are physically viable in the sense that both a consistent cosmological background and tensor gravitational waves exist, we count the number of degrees of freedom based on two approaches. First we investigate the linear cosmological perturbations around an FLRW background. Then we perform a Dirac-Bergmann Hamiltonian constraint analysis to count the number of DOFs at the nonperturbative level. We focus on three representative cases. In case II, both the perturbative and nonperturbative approaches yield the same result, which indicates that the theory propagates 10 degrees of freedom. In contrast, in cases V and VI, the Hamiltonian analysis yields 8 degrees of freedom, while only 6 and 5 modes are visible at linear order in perturbations, respectively. This indicates that additional modes are strongly coupled on cosmological backgrounds.

preprint2022arXiv

Cosmological perturbations in the spatially covariant gravity with a dynamical lapse function

We investigate the scalar perturbations in a class of spatially covariant gravity theory with a dynamical lapse function. Generally, there are two scalar degrees of freedom due to the presence of the velocity of the lapse function. We treat the scalar perturbations as analogues of those in a two-field inflationary mode, in which one is light mode and the other is the heavy mode. This is justified by the fact that the scalar mode due to the dynamical lapse function becomes infinitely heavy in the limit when the lapse function reduces to be an auxiliary variable. The standard approaches of multiple filed perturbations can be applied to deal with our model. By integrating out the heavy mode and derive the effective theory for the single light field, we find the solution to the single mode in the form of plane waves. Then we calculate the corrections to the power spectrum of the light mode from the heavy mode, by making use of the standard perturbative method of field theory. At last, when the two fields are not weakly coupled, we find a power law mode for the coupled system in large scales.

preprint2021arXiv

Covariant 3+1 correspondence of the spatially covariant gravity and the degeneracy conditions

A necessary condition for a generally covariant scalar-tensor theory to be ghostfree is that it contains no extra degrees of freedom in the unitary gauge, in which the Lagrangian corresponds to the spatially covariant gravity. Comparing with analysing the scalar-tensor theory directly, it is simpler to map the spatially covariant gravity to the generally covariant scalar-tensor theory using the gauge recovering procedures. In order to ensure the resulting scalar-tensor theory to be ghostfree absolutely, i.e., no matter if the unitary gauge is accessible, a further covariant degeneracy/constraint analysis is required. We develop a method of covariant 3+1 correspondence, which map the spatially covariant gravity to the scalar-tensor theory in 3+1 decomposed form without fixing any coordinates. Then the degeneracy conditions to remove the extra degrees of freedom can be found easily. As an illustration of this approach, we show how the Horndeski theory is recovered from the spatially covariant gravity. This approach can be used to find more general ghostfree scalar-tensor theory.

preprint2020arXiv

Higher derivative scalar-tensor monomials and their classification

We make a full classification of scalar monomials built of the Riemann curvature tensor up to the quadratic order and of the covariant derivatives of the scalar field up to the third order. From the point of view of the effective field theory, the third or even higher order covariant derivatives of the scalar field are of the same order as the higher curvature terms, and thus should be taken into account. Moreover, higher curvature terms and higher order derivatives of the scalar field are complementary to each other, of which novel ghost-free combinations may exist. We make a systematic classification of all the possible monomials, according to the numbers of Riemann tensor and higher derivatives of the scalar field in each monomial. Complete basis of monomials at each order are derived, of which linear combinations may yield novel ghost-free Lagrangians. We also develop a diagrammatic representation of the monomials, which may help to simplify the analysis.

preprint2020arXiv

Minimally modified gravity with an auxiliary constraint: a Hamiltonian construction

Working directly with a general Hamiltonian for the spacetime metric with the $3+1$ decomposition and keeping only the spatial covariance, we investigate the possibility of reducing the number of degrees of freedom by introducing an auxiliary constraint. The auxiliary constraint is considered as part of the definition of the theory. Through a general Hamiltonian analysis, we find the conditions for the Hamiltonian as well as for the auxiliary constraint, under which the theory propagates two tensorial degrees of freedom only. The class of theories satisfying these conditions can be viewed as a new construction for the type-II minimally modified gravity theories, which propagate the same degrees of freedom of but are not equivalent to general relativity in the vacuum. We also illustrate our formalism by a concrete example, and derive the dispersion relation for the gravitational waves, which can be constrained by observations.

preprint2020arXiv

Structural phase transitions and photoluminescence mechanism in a layer of 3D hybrid perovskite nanocrystals

Although the structural phase transitions in single-crystal hybrid methyl-ammonium (MA) lead halide perovskites (MAPbX3, X = Cl, Br, I) are common phenomena, they have never been observed in the corresponding nanocrystals. Here we demonstrate that two-photon-excited photoluminescence (PL) spectroscopy is capable of monitoring the structural phase transitions in MAPbX3 nanocrystals because nonlinear susceptibilities govern the light absorption rates. We provide experimental evidence that the orthorhombic-to-tetragonal structural phase transition in a single layer of 20-nm-sized 3D MAPbBr3 nanocrystals is spread out within the 70 - 140 K range. This structural phase instability range arises because, unlike in single-crystal MAPbX3, free rotations of MA ions in the corresponding nanocrystals are no longer restricted by a long-range MA dipole order. The resulting configurational entropy loss can be even enhanced by the interfacial electric field arising due to charge separation at the MAPbBr3/ZnO heterointerface, extending the orthorhombic-to-tetragonal structural phase instability range from 70 to 230 K. We conclude that the weak sensitivity of conventional one-photon-excited PL spectroscopy to the structural phase transitions in 3D MAPbX3 nanocrystals results from the structural phase instability providing negligible distortions of PbX6 octahedra. In contrast, the intensity of two-photon-excited PL and electric-field-induced one-photon-excited PL still remains sensitive enough to weak structural distortions due to the higher rank tensor nature of nonlinear susceptibilities involved. We also show that room-temperature PL originates from the radiative recombination of the optical-phonon vibrationally excited polaronic quasiparticles with energies might exceed the ground-state Frohlich polaron and Rashba energies due to optical-phonon bottleneck.

preprint2019arXiv

Propagation of the gravitational waves in a cosmological background

We investigate the propagation of the gravitational waves in a cosmological background. Based on the framework of spatially covariant gravity, we derive the general quadratic action for the gravitational waves. The spatial derivatives of the extrinsic curvature and the parity-violating terms are systematically introduced. Special attention is paid to the propagation speed of the gravitational waves. We find that it is possible to make the two polarization modes propagate in the same speed, which may differ from that of the light, in the presence of parity-violating terms in the action. In particular, we identify a large class of spatially covariant gravity theories with parity violation, in which both the polarization modes propagate in the speed of light. Our results imply that there are more possibilities in the framework of spatially covariant gravity in light of the propagation speed of the gravitational waves.

preprint2019arXiv

Spatially covariant gravity theories with two tensorial degrees of freedom: the formalism

Within the general framework of spatially covariant theories of gravity, we study the conditions for having only the two tensorial degrees of freedom. Generally, there are three degrees of freedom propagating in the theory, of which two are tensorial and one is of the scalar type. Through a detailed Hamiltonian analysis, we find two necessary and sufficient conditions to evade the scalar type degree of freedom. The first condition implies that the lapse-extrinsic curvature sector must be degenerate. The second condition ensures that the dimension of the phase space at each spacetime point is even, so that the scalar type degree of freedom is eliminated completely. We also compare our results with the previous studies, and apply our formalism to a simple example, in which the Lagrangian is quadratic in the extrinsic curvature.