Researcher profile

Zhenli Xu

Zhenli Xu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Random Batch Sum-of-Gaussians Method for Molecular Dynamics of Born-Mayer-Huggins Systems

The Born-Mayer-Huggins (BMH) potential, which combines Coulomb interactions with dispersion and short-range exponential repulsion, is widely used for ionic materials such as molten salts. However, large-scale molecular dynamics simulations of BMH systems are often limited by computation, communication, and memory costs. We recently proposed the random batch sum-of-Gaussians (RBSOG) method, which accelerates Coulomb calculations by using a sum-of-Gaussians (SOG) decomposition to split the potential into short- and long-range parts and by applying importance sampling in Fourier space for the long-range part. In this work, we extend the RBSOG to BMH systems and incorporate a random batch list (RBL) scheme to further accelerate the short-range part, yielding a unified framework for efficient simulations with the BMH potential. The combination of the SOG decomposition and the RBL enables an efficient and scalable treatment of both long- and short-range interactions in BMH system, particularly the RBL well handles the medium-range exponential repulsion and dispersion by the random batch neighbor list. Error estimate is provided to show the theoretical convergence of the RBL force. We evaluate the framework on molten NaCl and mixed alkali halide with up to $5\times10^6$ atoms on $2048$ CPU cores. Compared to the Ewald-based particle-particle particle-mesh method and the RBSOG-only method, our method achieves approximately $4\sim10\times$ and $2\times$ speedups while using $1000$ cores, respectively, under the same level of structural and thermodynamic accuracy and with a reduced memory usage. These results demonstrate the attractive performance of our method in accuracy and scalability for MD simulations with long-range interactions.

preprint2022arXiv

Improved random batch Ewald method in molecular dynamics simulations

The random batch Ewald (RBE) is an efficient and accurate method for molecular dynamics (MD) simulations of physical systems at the nano-/micro- scale. The method shows great potential to solve the computational bottleneck of long-range interactions, motivating a necessity to accelerating short-range components of the non-bonded interactions for a further speedup of MD simulations. In this work, we present an improved RBE method for the non-bonding interactions by introducing the random batch idea to constructing neighbor lists for the treatment of both the short-range part of the Ewald splitting and the Lennard-Jones potential. The efficiency of the novel neighbor list algorithm owes to the stochastic minibatch strategy which can significantly reduce the total number of neighbors. We obtan the error estimate and convergence by theoretical analysis and implement the improved RBE method in the LAMMPS package. Benchmark simulations are performed to demonstrate the accuracy and stability of the algorithm. Numerical tests on computer performance by conducting large-scaled MD simulations for systems including up to 0.1 billion water molecules, run on massive cluster with up to 50 thousand CPU cores, demonstrating the attractive features such as the high parallel scalability and memory-saving of the method in comparison to the existing methods.

preprint2022arXiv

Random batch sum-of-Gaussians method for molecular dynamics simulations of particle systems

We develop an accurate, highly efficient and scalable random batch sum-of-Gaussians (RBSOG) method for molecular dynamics simulations of systems with long-range interactions. The idea of the RBSOG method is based on a sum-of-Gaussians decomposition of the Coulomb kernel, and then a random batch importance sampling on the Fourier space is employed for approximating the summation of the Fourier expansion of the Gaussians with large bandwidths (the long-range components). The importance sampling significantly reduces the computational cost, resulting in a scalable algorithm by avoiding the use of communication-intensive fast Fourier transform. Theoretical analysis is present to demonstrate the unbiasedness of the approximate force, the controllability of variance and the weak convergence of the algorithm. The resulting method has $\mathcal{O}(N)$ complexity with low communication latency. Accurate simulation results on both dynamical and equilibrium properties of benchmark problems are reported to illustrate the attractive performance of the method. Simulations on parallel computing are also performed to show the high parallel efficiency. The RBSOG method can be straightforwardly extended to more general interactions with long ranged kernels, and thus is promising to construct fast algorithms of a series of molecular dynamics methods for various interacting kernels.

preprint2021arXiv

A kernel-independent sum-of-Gaussians method by de la Vallée-Poussin sums

Approximation of interacting kernels by sum of Gaussians (SOG) is frequently required in many applications of scientific and engineering computing in order to construct efficient algorithms for kernel summation or convolution problems. In this paper, we propose a kernel-independent SOG method by introducing the de la Vallée-Poussin sum and Chebyshev polynomials. The SOG works for general interacting kernels and the lower bound of Gaussian bandwidths is tunable and thus the Gaussians can be easily summed by fast Gaussian algorithms. The number of Gaussians can be further reduced via the model reduction based on the balanced truncation based on the square root method. Numerical results on the accuracy and model reduction efficiency show attractive performance of the proposed method.

preprint2021arXiv

HSMA: An O(N) electrostatics package implemented in LAMMPS

We implement two recently developed fast Coulomb solvers, HSMA3D [J. Chem. Phys. 149 (8) (2018) 084111] and HSMA2D [J. Chem. Phys. 152 (13) (2020) 134109], into a new user package HSMA for molecular dynamics simulation engine LAMMPS. The HSMA package is designed for efficient and accurate modeling of electrostatic interactions in 3D and 2D periodic systems with dielectric effects at the O(N) cost. The implementation is hybrid MPI and OpenMP parallelized and compatible with existing LAMMPS functionalities. The vectorization technique following AVX512 instructions is adopted for acceleration. To establish the validity of our implementation, we have presented extensive comparisons to the widely used particle-particle particle-mesh (PPPM) algorithm in LAMMPS and other dielectric solvers. With the proper choice of algorithm parameters and parallelization setup, the package enables calculations of electrostatic interactions that outperform the standard PPPM in speed for a wide range of particle numbers.

preprint2021arXiv

Linear-Scaling Selected Inversion based on Hierarchical Interpolative Factorization for Self Green's Function for Modified Poisson-Boltzmann Equation in Two Dimensions

This paper studies an efficient numerical method for solving modified Poisson-Boltzmann (MPB) equations with the self Green's function as a state equation to describe electrostatic correlations in ionic systems. Previously, the most expensive point of the MPB solver is the evaluation of Green's function. The evaluation of Green's function requires solving high-dimensional partial differential equations, which is the computational bottleneck for solving MPB equations. Numerically, the MPB solver only requires the evaluation of Green's function as the diagonal part of the inverse of the discrete elliptic differential operator of the Debye-Hückel equation. Therefore, we develop a fast algorithm by a coupling of the selected inversion and hierarchical interpolative factorization. By the interpolative factorization, our new selected inverse algorithm achieves linear scaling to compute the diagonal of the inverse of this discrete operator. The accuracy and efficiency of the proposed algorithm will be demonstrated by extensive numerical results for solving MPB equations.

preprint2021arXiv

Superscalability of the random batch Ewald method

Coulomb interaction, following an inverse-square force-law, quantifies the amount of force between two stationary and electrically charged particles. The long-range nature of Coulomb interactions poses a major challenge to molecular dynamics simulations which are major tools for problems at the nano-/micro- scale. Various algorithms are developed to calculate the pairwise Coulomb interactions to a linear scaling but the poor scalability limits the size of simulated systems. Here, we conduct an efficient molecular dynamics algorithm with the random batch Ewald method on all-atom systems where the complete Fourier components in the Coulomb interaction are replaced by randomly selected mini-batches. By simulating the $N$-body systems up to 100 million particles using $10$ thousand CPU cores, we show that this algorithm furnishes $O(N)$ complexity, almost perfect scalability and an order of magnitude faster computational speed when compared to the existing state-of-the-art algorithms. Further examinations of our algorithm on distinct systems, including pure water, micro-phase-separated electrolyte and protein solution demonstrate that the spatiotemporal information on all time and length scales investigated and thermodynamic quantities derived from our algorithm are in perfect agreement with those obtained from the existing algorithms. Therefore, our algorithm provides a breakthrough solution on scalability of computing the Coulomb interaction. It is particularly useful and cost-effective to simulate ultra-large systems, which was either impossible or very costing to conduct using existing algorithms, thus would benefit the broad community of sciences.

preprint2020arXiv

L1-based reduced over collocation and hyper reduction for steady state and time-dependent nonlinear equations

The task of repeatedly solving parametrized partial differential equations (pPDEs) in, e.g. optimization or interactive applications, makes it imperative to design highly efficient and equally accurate surrogate models. The reduced basis method (RBM) presents as such an option. Enabled by a mathematically rigorous error estimator, RBM constructs a low-dimensional subspace of the parameter-induced high fidelity solution manifold from which an approximate solution is computed. It can improve efficiency by several orders of magnitudes leveraging an offline-online decomposition procedure. However, this decomposition, usually through the empirical interpolation method (EIM) when the PDE is nonlinear or its parameter dependence nonaffine, is either challenging to implement, or severely degrades online efficiency. In this paper, we augment and extend the EIM approach as a direct solver, as opposed to an assistant, for solving nonlinear pPDEs on the reduced level. The resulting method, called Reduced Over-Collocation method (ROC), is stable and capable of avoiding the efficiency degradation inherent to a traditional application of EIM. Two critical ingredients of the scheme are collocation at about twice as many locations as the dimension of the reduced solution space, and an efficient L1-norm-based error indicator for the strategic selection of the parameter values to build the reduced solution space. Together, these two ingredients render the proposed L1-ROC scheme both offline- and online-efficient. A distinctive feature is that the efficiency degradation appearing in alternative RBM approaches that utilize EIM for nonlinear and nonaffine problems is circumvented, both in the offline and online stages. Numerical tests on different families of time-dependent and steady-state nonlinear problems demonstrate the high efficiency and accuracy of L1-ROC and its superior stability performance.