Source author record

Tiejun Li

Tiejun Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning cond-mat.stat-mech math-ph math.MP Molecular Networks Biological Physics math.PR physics.comp-ph Artificial Intelligence cond-mat.mtrl-sci math.NA math.OC Numerical Analysis physics.chem-ph

Catalog footprint

What is connected

13works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CrystalREPA: Transferring Physical Priors from Universal MLIPs to Crystal Generative Models

Crystal generative models mainly learn what stable crystals look like, with little explicit supervision for what makes them stable. We reveal a substantial representation gap between state-of-the-art crystal generative models and pretrained universal machine learning interatomic potentials (MLIPs) via energy probing, and show this gap can be closed by a simple training-time alignment. We propose Crystal REPresentation Alignment (CrystalREPA), a plug-and-play framework that aligns the atom-wise hidden states of generative encoders with frozen MLIP representations through an element-aware contrastive objective, transferring stability-aware atomistic priors with marginal training overhead and no additional inference cost. Across three generative frameworks, ten MLIP teachers, and two benchmark datasets, CrystalREPA consistently improves the thermodynamic stability, structural validity, and structural fidelity of generated crystals. Equally important, we find that an MLIP's transfer effectiveness is poorly predicted by its accuracy on standard leaderboards (e.g., Matbench Discovery) but strongly predicted by the distinguishability of its atom-wise representation space, yielding a practical, accuracy-independent criterion for selecting MLIP teachers for generative transfer.

preprint2026arXiv

Free Energy Surface Sampling via Reduced Flow Matching

Sampling the free energy surface, namely, the distribution of collective variables (CVs), is a crucial problem in statistical physics, as it underpins a better understanding of chemical reactions and conformational transitions. Traditional methods for free energy surface sampling involve simulation in high-dimensional configuration space and projecting the resulting configurations onto the CV space. To reduce the computational costs of such sampling, we propose FES-FM, a reduced flow matching (FM) method for free energy sampling (FES). We train a dynamical transport map in the CV space, thereby enabling direct sampling of the free energy surface. For many-particle systems, we construct a prior distribution based on the Hessian at a local minimum of the potential, which ensures both rotation-translation invariance and physically meaningful configurations. We evaluate the proposed method across a variety of potential functions and collective variables. Comparative experiments demonstrate that our approach drastically reduces computational costs while delivering superior accuracy per unit sampling time.

preprint2026arXiv

Improving the Euclidean Diffusion Generation of Manifold Data by Mitigating Score Function Singularity

Euclidean diffusion models have achieved remarkable success in generative modeling across diverse domains, and they have been extended to manifold cases in recent advances. Instead of explicitly utilizing the structure of special manifolds as studied in previous works, in this paper we investigate direct sampling of the Euclidean diffusion models for general manifold-structured data. We reveal the multiscale singularity of the score function in the ambient space, which hinders the accuracy of diffusion-generated samples. We then present an elaborate theoretical analysis of the singularity structure of the score function by decomposing it along the tangential and normal directions of the manifold. To mitigate the singularity and improve the sampling accuracy, we propose two novel methods: (1) Niso-DM, which reduces the scale discrepancies in the score function by utilizing a non-isotropic noise, and (2) Tango-DM, which trains only the tangential component of the score function using a tangential-only loss function. Numerical experiments demonstrate that our methods achieve superior performance on distributions over various manifolds with complex geometries.

preprint2026arXiv

Multi-Task Fine-Tuning Enables Robust Out-of-Distribution Generalization in Atomistic Models

Accurate de novo molecular and materials design requires structure-property models that generalize beyond known regimes. Although pretrained atomistic models achieve strong in-distribution accuracy after fine-tuning, their reliability under out-of-distribution (OOD) conditions remains unclear. We identify a critical failure mode in downstream adaptation: standard fine-tuning induces representation collapse, erasing pretrained chemical and structural priors and severely degrading OOD performance. To address this limitation, we propose multi-task fine-tuning (MFT), which jointly optimizes downstream property prediction with a physically grounded force-field objective inherited from pretraining. This approach preserves essential chemical priors while enabling task-specific adaptation. Across molecular and materials benchmarks, MFT consistently improves OOD generalization, approaching the theoretical limit set by in-distribution accuracy, while outperforming standard fine-tuning, training from scratch, and state-of-the-art task-specific models. These results establish safe adaptation as a central requirement for large atomistic models and position MFT as a practical and data-efficient pathway toward robust molecular and materials discovery.

preprint2022arXiv

Intrinsically Motivated Self-supervised Learning in Reinforcement Learning

In vision-based reinforcement learning (RL) tasks, it is prevalent to assign auxiliary tasks with a surrogate self-supervised loss so as to obtain more semantic representations and improve sample efficiency. However, abundant information in self-supervised auxiliary tasks has been disregarded, since the representation learning part and the decision-making part are separated. To sufficiently utilize information in auxiliary tasks, we present a simple yet effective idea to employ self-supervised loss as an intrinsic reward, called Intrinsically Motivated Self-Supervised learning in Reinforcement learning (IM-SSR). We formally show that the self-supervised loss can be decomposed as exploration for novel states and robustness improvement from nuisance elimination. IM-SSR can be effortlessly plugged into any reinforcement learning with self-supervised auxiliary objectives with nearly no additional cost. Combined with IM-SSR, the previous underlying algorithms achieve salient improvements on both sample efficiency and generalization in various vision-based robotics tasks from the DeepMind Control Suite, especially when the reward signal is sparse.

preprint2022arXiv

Solving eigenvalue PDEs of metastable diffusion processes using artificial neural networks

In this paper, we consider the eigenvalue PDE problem of the infinitesimal generators of metastable diffusion processes. We propose a numerical algorithm based on training artificial neural networks for solving the leading eigenvalues and eigenfunctions of such high-dimensional eigenvalue problem. The algorithm is able to find multiple leading eigenpairs by solving a single training task. It is useful in understanding the dynamical behaviors of metastable processes on large timescales. We demonstrate the capability of our algorithm on a high-dimensional model problem, and on the simple molecular system alanine dipeptide.

preprint2020arXiv

The Graph Limit of The Minimizer of The Onsager-Machlup Functional and Its Computation

The Onsager-Machlup (OM) functional is well-known for characterizing the most probable transition path of a diffusion process with non-vanishing noise. However, it suffers from a notorious issue that the functional is unbounded below when the specified transition time $T$ goes to infinity. This hinders the interpretation of the results obtained by minimizing the OM functional. We provide a new perspective on this issue. Under mild conditions, we show that although the infimum of the OM functional becomes unbounded when $T$ goes to infinity, the sequence of minimizers does contain convergent subsequences on the space of curves. The graph limit of this minimizing subsequence is an extremal of the abbreviated action functional, which is related to the OM functional via the Maupertuis principle with an optimal energy. We further propose an energy-climbing geometric minimization algorithm (EGMA) which identifies the optimal energy and the graph limit of the transition path simultaneously. This algorithm is successfully applied to several typical examples in rare event studies. Some interesting comparisons with the Freidlin-Wentzell action functional are also made.

preprint2016arXiv

Comment on "On Uniqueness of SDE Decomposition in A-type Stochastic Integration" [arXiv:1603.07927v1]

The uniqueness issue of SDE decomposition theory proposed by Ao and his co-workers has recently been discussed. A comprehensive study to investigate connections among different landscape theories [J. Chem. Phys. 144, 094109 (2016)] has pointed out that the decomposition is generally not unique, while Ao et al. (arXiv:1603.07927v1) argues that such conclusions are "incorrect" because of the missing boundary conditions. In this comment, we will combine literatures research and concrete examples to show that the concrete and effective boundary conditions have not been proposed to guarantee the uniqueness, hence the arguments in [arXiv:1603.07927v1] are not sufficient. Moreover, we show that the "uniqueness" of the O-U process decomposition referred by YTA paper is unable to serve as a counterexample to ZL's result since additional assumptions have been made implicitly beyond the original SDE decomposition framework, which cannot be applied to general nonlinear cases. Some other issues such as the failure of gradient expansion method will also be discussed. Our demonstration contributes to better understanding of the relevant papers as well as the SDE decomposition theory.

preprint2016arXiv

Large deviations for two scale chemical kinetic processes

We formulate the large deviations for a class of two scale chemical kinetic processes motivated from biological applications. The result is successfully applied to treat a genetic switching model with positive feedbacks. The corresponding Hamiltonian is convex with respect to the momentum variable as a by-product of the large deviation theory. This property ensures its superiority in the rare event simulations compared with the result obtained by formal WKB asymptotics. The result is of general interest to understand the large deviations for multiscale problems.

preprint2016arXiv

Two-scale large deviations for chemical reaction kinetics through second quantization path integral

Motivated by the study of rare events for a typical genetic switching model in systems biology, in this paper we aim to establish the general two-scale large deviations for chemical reaction systems. We build a formal approach to explicitly obtain the large deviation rate functionals for the considered two-scale processes based upon the second-quantization path integral technique. We get three important types of large deviation results when the underlying two times scales are in three different regimes. This is realized by singular perturbation analysis to the rate functionals obtained by path integral. We find that the three regimes possess the same deterministic mean-field limit but completely different chemical Langevin approximations. The obtained results are natural extensions of the classical large volume limit for chemical reactions. We also discuss its implication on the single-molecule Michaelis-Menten kinetics. Our framework and results can be applied to understand general multi-scale systems including diffusion processes.

preprint2015arXiv

Realization of Waddington's Metaphor: Potential Landscape, Quasi-potential, A-type Integral and Beyond

Motivated by the famous Waddington's epigenetic landscape metaphor in developmental biology, biophysicists and applied mathematicians made different proposals to realize this metaphor in a rationalized way. We adopt comprehensive perspectives to systematically investigate three different but closely related realizations in recent literature: namely the potential landscape theory from the steady state distribution of stochastic differential equations (SDEs), the quasi-potential from the large deviation theory, and the construction through SDE decomposition and A-type integral.The connections among these theories are established in this paper. We demonstrate that the quasi-potential is the zero noise limit of the potential landscape. We also show that the potential function in the third proposal coincides with the quasi-potential. The most probable transition path by minimizing the Onsager-Machlup or Freidlin-Wentzell action functional is discussed as well. Furthermore, we compare the difference between local and global quasi-potential through the exchange of limit order for time and noise amplitude. As a consequence of such explorations, we arrive at the existence result for the SDE decomposition while deny its uniqueness in general cases. It is also clarified that the A-type integral is more appropriate to be applied to the decomposed SDEs rather than the original one. Our results contribute to a better understanding of existing landscape theories for biological systems.

preprint2014arXiv

Finding Transition Pathways on Manifolds

We consider noise-induced transition paths in randomly perturbed dynami- cal systems on a smooth manifold. The classical Freidlin-Wentzell large devia- tion theory in Euclidean spaces is generalized and new forms of action functionals are derived in the spaces of functions and the space of curves to accommodate the intrinsic constraints associated with the manifold. Numerical meth- ods are proposed to compute the minimum action paths for the systems with constraints. The examples of conformational transition paths for a single and double rod molecules arising in polymer science are numerically investigated.

preprint2012arXiv

Transition Path, Quasi-potential Energy Landscape and Stability of Genetic Switches

One of the fundamental cellular processes governed by genetic regulatory networks in cells is the transition among different states under the intrinsic and extrinsic noise. Based on a two-state genetic switching model with positive feedback, we develop a framework to understand the metastability in gene expressions. This framework is comprised of identifying the transition path, reconstructing the global quasi-potential energy landscape, analyzing the uphill and downhill transition paths, etc. It is successfully utilized to investigate the stability of genetic switching models and fluctuation properties in different regimes of gene expression with positive feedback. The quasi-potential energy landscape, which is the rationalized version of Waddington potential, provides a quantitative tool to understand the metastability in more general biological processes with intrinsic noise.

Tiejun Li

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

CrystalREPA: Transferring Physical Priors from Universal MLIPs to Crystal Generative Models

Free Energy Surface Sampling via Reduced Flow Matching

Improving the Euclidean Diffusion Generation of Manifold Data by Mitigating Score Function Singularity

Multi-Task Fine-Tuning Enables Robust Out-of-Distribution Generalization in Atomistic Models

Intrinsically Motivated Self-supervised Learning in Reinforcement Learning

Solving eigenvalue PDEs of metastable diffusion processes using artificial neural networks

The Graph Limit of The Minimizer of The Onsager-Machlup Functional and Its Computation

Comment on "On Uniqueness of SDE Decomposition in A-type Stochastic Integration" [arXiv:1603.07927v1]

Large deviations for two scale chemical kinetic processes

Two-scale large deviations for chemical reaction kinetics through second quantization path integral

Realization of Waddington's Metaphor: Potential Landscape, Quasi-potential, A-type Integral and Beyond

Finding Transition Pathways on Manifolds

Transition Path, Quasi-potential Energy Landscape and Stability of Genetic Switches