Source author record

Jiajun Xu

Jiajun Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-th astro-ph.CO gr-qc hep-ph astro-ph astro-ph.IM Computation and Language Computer Vision math.AG

Catalog footprint

What is connected

10works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MIND: From Passive Mimicry to Active Reasoning through Capability-Aware Multi-Perspective CoT Distillation

While Large Language Models (LLMs) have emerged with remarkable capabilities in complex tasks through Chain-of-Thought reasoning, practical resource constraints have sparked interest in transferring these abilities to smaller models. However, achieving both domain performance and cross-domain generalization remains challenging. Existing approaches typically restrict students to following a single golden rationale and treat different reasoning paths independently. Due to distinct inductive biases and intrinsic preferences, alongside the student's evolving capacity and reasoning preferences during training, a teacher's "optimal" rationale could act as out-of-distribution noise. This misalignment leads to a degeneration of the student's latent reasoning distribution, causing suboptimal performance. To bridge this gap, we propose MIND, a capability-adaptive framework that transitions distillation from passive mimicry to active cognitive construction. We synthesize diverse teacher perspectives through a novel "Teaching Assistant" network. By employing a Feedback-Driven Inertia Calibration mechanism, this network utilizes inertia-filtered training loss to align supervision with the student's current adaptability, effectively enhancing performance while mitigating catastrophic forgetting. Extensive experiments demonstrate that MIND achieves state-of-the-art performance on both in-distribution and out-of-distribution benchmarks, and our sophisticated latent space analysis further confirms the mechanism of reasoning ability internalization.

preprint2026arXiv

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Visual generative models have achieved remarkable progress in synthesizing photorealistic images and videos, yet aligning their outputs with human preferences across critical dimensions remains a persistent challenge. Though reinforcement learning from human feedback offers promise for preference alignment, existing reward models for visual generation face limitations, including black-box scoring without interpretability and potentially resultant unexpected biases. We present VisionReward, a general framework for learning human visual preferences in both image and video generation. Specifically, we employ a hierarchical visual assessment framework to capture fine-grained human preferences, and leverages linear weighting to enable interpretable preference learning. Furthermore, we propose a multi-dimensional consistent strategy when using VisionReward as a reward model during preference optimization for visual generation. Experiments show that VisionReward can significantly outperform existing image and video reward models on both machine metrics and human evaluation. Notably, VisionReward surpasses VideoScore by 17.2% in preference prediction accuracy, and text-to-video models with VisionReward achieve a 31.6% higher pairwise win rate compared to the same models using VideoScore. All code and datasets are provided at https://github.com/THUDM/VisionReward.

preprint2020arXiv

The defining equations of a class of Richardson and flag varieties on Sp$_{2n}(k)$

This paper aims to focus on Richardson varieties on symplectic groups, especially their combinatorial characterization and defining equations. Schubert varieties and opposite Schubert varieties have profound significance in the study of generalized flag varieties which are not only research objects in algebraic geometry but also ones in representation theory. A more general research object is Richardson variety, which is obtained by the intersection of a Schubert variety and an opposite Schubert variety. The structure of Richardson variety on Grassmannian and its combinatorial characterization are well known, and there are also similar method on quotients of symplectic groups. In the first part of this paper, we calculate the orbit of the symplectic group action, and then rigorously give a method to describe the corresponding quotient by using the nesting subspace sequence of the linear space, i.e. flags. At the same time, the flag is used to describe the Schubert variety and Richardson variety on quotient of symplectic group. The flag varieties of Sp_{2n}(k)/P_d can be viewed as closed subvarieties of Grassmannian. Using the standard monomial theory, we obtain the generators of its ideal, i.e. its defining equations, in homogeneous coordinate ring of Grassmannian. Furthermore, we prove several properties of the type C standard monomial on the symplectic group flag variety. Defining equations of Richardson varieties on Sp_{2n}(k)/P_d are given as well.

preprint2014arXiv

MultiModeCode: An efficient numerical solver for multifield inflation

We present MultiModeCode, a Fortran 95/2000 package for the numerical exploration of multifield inflation models. This program facilitates efficient Monte Carlo sampling of prior probabilities for inflationary model parameters and initial conditions and is the first publicly available code that can efficiently generate large sample-sets for inflation models with $\mathcal O(100)$ fields. The code numerically solves the equations of motion for the background and first-order perturbations of multi-field inflation models with canonical kinetic terms and arbitrary potentials, providing the adiabatic, isocurvature, and tensor power spectra at the end of inflation. For models with sum-separable potentials MultiModeCode also computes the slow-roll prediction via the $δN$ formalism for easy model exploration and validation. We pay particular attention to the isocurvature perturbations as the system approaches the adiabatic limit, showing how to avoid numerical instabilities that affect some other approaches to this problem. We demonstrate the use of MultiModeCode by exploring a few toy models. Finally, we give a concise review of multifield perturbation theory and a user's manual for the program.

preprint2011arXiv

A Brachistochrone Approach to Reconstruct the Inflaton Potential

We propose a new way to implement an inflationary prior to a cosmological dataset that incorporates the inflationary observables at arbitrary order. This approach employs an exponential form for the Hubble parameter $H(ϕ)$ without taking the slow-roll approximation. At lowest non-trivial order, this $H(ϕ)$ has the unique property that it is the solution to the brachistochrone problem for inflation.

preprint2011arXiv

Duality Cascade in Brane Inflation

We show that brane inflation is very sensitive to tiny sharp features in extra dimensions, including those in the potential and in the warp factor. This can show up as observational signatures in the power spectrum and/or non-Gaussianities of the cosmic microwave background radiation (CMBR). One general example of such sharp features is a succession of small steps in a warped throat, caused by Seiberg duality cascade using gauge/gravity duality. We study the cosmological observational consequences of these steps in brane inflation. Since the steps come in a series, the prediction of other steps and their properties can be tested by future data and analysis. It is also possible that the steps are too close to be resolved in the power spectrum, in which case they may show up only in the non-Gaussianity of the CMB temperature fluctuations and/or EE polarization. We study two cases. In the slow-roll scenario where steps appear in the inflaton potential, the sensitivity of brane inflation to the height and width of the steps is increased by several orders of magnitude comparing to that in previously studied large field models. In the IR DBI scenario where steps appear in the warp factor, we find that the glitches in the power spectrum caused by these sharp features are generally small or even unobservable, but associated distinctive non-Gaussianity can be large. Together with its large negative running of the power spectrum index, this scenario clearly illustrates how rich and different a brane inflationary scenario can be when compared to generic slow-roll inflation. Such distinctive stringy features may provide a powerful probe of superstring theory.

preprint2011arXiv

Effective Field Theory and Decoupling in Multi-field Inflation: An Illustrative Case Study

We explore the effects of heavy degrees of freedom on the evolution and perturbations of light modes in multifield inflation. We use a simple two-field model as an example to illustrate the subtleties of integrating out massive fields in a time-dependent background. We show that when adiabaticity is violated due to a sharp turn in field space, the roles of massive and massless field are interchanged, and furthermore the fields are strongly coupled; thus the system cannot be described by an effective single field action. Further analysis shows that the sharp turn imparts a non Bunch-Davis component in each perturbation mode, leading to oscillatory features in the power spectrum, and a large resonantly enhanced bispectrum.

preprint2010arXiv

A Meandering Inflaton

If the cosmological inflationary scenario took place in the cosmic landscape in string theory, the inflaton, the scalar mode responsible for inflation, would have meandered in a complicated multi-dimensional potential. We show that this meandering property naturally leads to many e-folds of inflation, a necessary condition for a successful inflationary scenario. This behavior also leads to fluctuations in the primordial power spectrum of the cosmic microwave background radiation, which may be detected in a near future cosmic variance limited experiment like PLANCK.

preprint2010arXiv

Comment on Asymptotically Safe Inflation

We comment on Weinberg's interesting analysis of asymptotically safe inflation (arXiv:0911.3165). We find that even if the gravity theory exhibits an ultraviolet fixed point, the energy scale during inflation is way too low to drive the theory close to the fixed point value. We choose the specific renormalization groupflow away from the fixed point towards the infrared region that reproduces the Newton's constant and today's cosmological constant. We follow this RG flow path to scales below the Planck scale to study the stability of the inflationary scenario. Again, we find that some fine tuning is necessary to get enough efolds of infflation in the asymptotically safe inflationary scenario.

preprint2007arXiv

Comparing Brane Inflation to WMAP

We compare the simplest realistic brane inflationary model to recent cosmological data, including WMAP 3-year cosmic microwave background (CMB) results, Sloan Digital Sky Survey luminous red galaxies (SDSS LRG) power spectrum data and Supernovae Legacy Survey (SNLS) Type 1a supernovae distance measures. Here, the inflaton is simply the position of a $D3$-brane which is moving towards a $\bar{D}3$-brane sitting at the bottom of a throat (a warped, deformed conifold) in the flux compactified bulk in Type IIB string theory. The analysis includes both the usual slow-roll scenario and the Dirac-Born-Infeld scenario of slow but relativistic rolling. Requiring that the throat is inside the bulk greatly restricts the allowed parameter space. We discuss possible scenarios in which large tensor mode and/or non-Gaussianity may emerge. Here, the properties of a large tensor mode deviate from that in the usual slow-roll scenario, providing a possible stringy signature. Overall, within the brane inflationary scenario, the cosmological data is providing information about the properties of the compactification of the extra dimensions.

Jiajun Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

MIND: From Passive Mimicry to Active Reasoning through Capability-Aware Multi-Perspective CoT Distillation

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

The defining equations of a class of Richardson and flag varieties on Sp$_{2n}(k)$

MultiModeCode: An efficient numerical solver for multifield inflation

A Brachistochrone Approach to Reconstruct the Inflaton Potential

Duality Cascade in Brane Inflation

Effective Field Theory and Decoupling in Multi-field Inflation: An Illustrative Case Study

A Meandering Inflaton

Comment on Asymptotically Safe Inflation

Comparing Brane Inflation to WMAP