Researcher profile

Qiang Du

Qiang Du contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2026arXiv

A Derivative-Free Saddle-search Algorithm With Linear Convergence Rate

We propose a derivative-free saddle-search algorithm designed to locate transition states using only function evaluations. The algorithm employs a nested architecture consisting of an inner eigenvector search and an outer saddle-point search. Through rigorous numerical analysis, we prove the almost sure convergence of the inner step under suitable assumptions. Furthermore, we establish the convergence of the outer search using a decaying step size, while demonstrating linear convergence under constant step size and boundedness conditions. Numerical experiments are provided to validate our theoretical results and demonstrate the algorithm's practical applicability.

preprint2025arXiv

A Particle Algorithm for Mean-Field Variational Inference

Variational inference is a fast and scalable alternative to Markov chain Monte Carlo and has been widely applied to posterior inference tasks in statistics and machine learning. A traditional approach for implementing mean-field variational inference (MFVI) is coordinate ascent variational inference (CAVI), which relies crucially on parametric assumptions on complete conditionals. We introduce a novel particle-based algorithm for MFVI, named PArticle VI (PAVI), for nonparametric mean-field approximation. We obtain non-asymptotic error bounds for our algorithm. To our knowledge, this is the first end-to-end guarantee for particle-based MFVI.

preprint2023arXiv

Asymptotically compatibility of a class of numerical schemes for a nonlocal traffic flow model

This paper considers numerical discretization of a nonlocal conservation law modeling vehicular traffic flows involving nonlocal inter-vehicle interactions. The nonlocal model involves an integral over the range measured by a horizon parameter and it recovers the local Lighthill-Richards-Whitham model as the nonlocal horizon parameter goes to zero. Good numerical schemes for simulating these parameterized nonlocal traffic flow models should be robust with respect to the change of the model parameters but this has not been systematically investigated in the literature. We fill this gap through a careful study of a class of finite volume numerical schemes with suitable discretizations of the nonlocal integral, which include several schemes proposed in the literature and their variants. Our main contributions are to demonstrate the asymptotically compatibility of the schemes, which includes both the uniform convergence of the numerical solutions to the unique solution of nonlocal continuum model for a given positive horizon parameter and the convergence to the unique entropy solution of the local model as the mesh size and the nonlocal horizon parameter go to zero simultaneously. It is shown that with the asymptotically compatibility, the schemes can provide robust numerical computation under the changes of the nonlocal horizon parameter.

preprint2023arXiv

Nonlocal boundary-value problems with local boundary conditions

We describe and analyze nonlocal integro-differential equations with classical local boundary conditions. The interaction kernel of the nonlocal operator has horizon parameter dependent on position in the domain, and vanishes as the boundary of the domain is approached. This heterogeneous localization allows for boundary values to be captured in the trace sense. We state and prove a nonlocal Green's identity for these nonlocal operators that involve local boundary terms. We use this identity to state and establish the well-posedness of variational formulations of the nonlocal problems with several types of classical boundary conditions. We show the consistency of these nonlocal boundary-value problems with their classical local counterparts in the vanishing horizon limit via the convergence of solutions. The Poisson data for the local boundary-value problem is permitted to be quite irregular, belonging to the dual of the classical Sobolev space. Heterogeneously mollifying this Poisson data for the local problem on the same length scale as the horizon and using the regularity of the interaction kernel, we show that the solutions to the nonlocal boundary-value problem with the mollified Poisson data actually belong to the classical Sobolev space, and converge weakly to the unique variational solution of the classical Poisson problem with original Poisson data.

preprint2022arXiv

Feedback and control systems for future linear colliders: White Paper for Snowmass 2021 Topical Group AF07-RF

Particle accelerators for high energy physics will generate TeV-scale particle beams in large, multi-Km size machines colliding high brightness beams at the interaction point [1-4]. The high luminosity in such machines is achieved by producing very small asymmetric beam size at the interaction point, with short durations to minimize beam-beam effects. Tuning energy, timing and position of the beam for optimal performance will require high-precision controls of amplitude and phase of high-frequency electromagnetic fields and real-time processing of complex algorithms. The stability of the colliding beams has a large impact on the collider&#39;s effective luminosity. Therefore, the technology readiness level of diagnostic and control systems will be a major consideration in the collider design. The technical requirements of such systems depend on the specifics of beam parameters, such as transverse and longitudinal dimensions, charge/pulse and beam pulse format, which are driven by the accelerating technology of choice. While feedback systems with single bunch position monitor resolution below 50 nm and latency <300 ns have been demonstrated in beam test facilities, many advanced collider concepts make use of higher repetition rates, brighter beams and higher accelerating frequencies, and will require better performance, up to 1-2 order of magnitude, demanding aggressive R&D to be able to deliver and maintain the targeted luminosity.

preprint2022arXiv

On the convergence to local limit of nonlocal models with approximated interaction neighborhoods

Many nonlocal models have adopted Euclidean balls as the nonlocal interaction neighborhoods. When solving them numerically, it is sometimes convenient to adopt polygonal approximations of such balls. A crucial question is, to what extent such approximations affect the nonlocal operators and the corresponding solutions. While recent works have analyzed this issue for a fixed horizon parameter, the question remains open in the case of a small or vanishing horizon parameter, which happens often in many practical applications and has significant impact on the reliability and robustness of nonlocal modeling and simulations. In this work, we are interested in addressing this issue and establishing the convergence of the nonlocal solutions associated with polygonally approximated interaction neighborhoods to the local limit of the original nonlocal solutions. Our finding reveals that the new nonlocal solution does not converge to the correct local limit when the number of sides of polygons is uniformly bounded. On the other hand, if the number of sides tends to infinity, the desired convergence can be established. These results may be used to guide future computational studies of nonlocal models.

preprint2022arXiv

On the Ternary Ohta-Kawasaki Free Energy and Its One-dimensional Global Minimizers

We study the ternary Ohta-Kawasaki free energy that has been used to model triblock copolymer systems. Its one-dimensional global minimizers are conjectured to have cyclic patterns. However, some physical experiments and computer simulations found triblock copolymers forming noncyclic lamellar patterns. In this work, by comparing the free energies of the cyclic pattern and some noncyclic candidates, we show that the conjecture does not hold for some choices of parameters. Our results suggest that even in one dimension, the global minimizers may take on very different patterns in different parameter regimes. To unify the existing choices of the long range coefficient matrix, we present a reformulation of the long range term using a generalized charge interpretation, and thereby propose conditions on the matrix in order for the global minimizers to reproduce physically relevant nanostructures of block copolymers.

preprint2022arXiv

Robustness test of the spacegroupMining model for determining space groups from atomic pair distribution function data

Machine learning models based on convolutional neural networks have been used for predicting space groups of crystal structures from their atomic pair distribution function (PDF). However, the PDFs used to train the model are calculated using a fixed set of parameters that reflect specific experimental conditions, and the accuracy of the model when given PDFs generated with different choices of these parameters is unknown. In this paper, we report that the results of the top-1 accuracy and top-6 accuracy are robust when applied to PDFs of different choices of experimental parameters $r_\text{max}$, $Q_\text{max}$, $Q_\text{damp}$ and atomic displacement parameters.

preprint2022arXiv

The average distance problem with an Euler elastica penalization

We consider the minimization of an average distance functional defined on a two-dimensional domain $Ω$ with an Euler elastica penalization associated with $\pd Ω$, the boundary of $Ω$. The average distance is given by \begin{equation*} \int_Ω \dist^p(x,\pd Ω)\d x \end{equation*} where $p\geq 1$ is a given parameter, and $\dist(x,\pd Ω)$ is the Hausdorff distance between $\{x\}$ and $\pd Ω$. The penalty term is a multiple of the Euler elastica (i.e., the Helfrich bending energy or the Willmore energy) of the boundary curve ${\pd Ω}$, which is proportional to the integrated squared curvature defined on $\pd Ω$, as given by \begin{equation*} \la \int_{\pd Ω} κ_{\pd Ω}^2\d\H_{\llcorner \pd Ω}^1, \end{equation*} where $κ_{\pd Ω}$ denotes the (signed) curvature of $\pd Ω$ and $\la>0$ denotes a penalty constant. The domain $Ω$ is allowed to vary among compact, convex sets of $\mathbb{R}^2$ with Hausdorff dimension equal to $2$\tcr{.} Under no a priori assumptions on the regularity of the boundary $\pd Ω$, we prove the existence of minimizers of $E_{p,\la}$. Moreover, we establish the $C^{1,1}$-regularity of its minimizers. An original construction of a suitable family of competitors plays a decisive role in proving the regularity.

preprint2022arXiv

The average distance problem with perimeter-to-area ratio penalization

In this paper we consider the functional \begin{equation*} E_{p,\la}(Ω):=\int_Ω\dist^p(x,\pd Ω)\d x+\la \frac{\H^1(\pd Ω)}{\H^2(Ω)}. \end{equation*} Here $p\geq 1$, $\la>0$ are given parameters, the unknown $Ω$ varies among compact, convex, Hausdorff two-dimensional sets of $\R^2$, $\pd Ω$ denotes the boundary of $Ω$, and $\dist(x,\pd Ω):=\inf_{y\in\pd Ω}|x-y|$. The integral term $\int_Ω\dist^p(x,\pd Ω)\d x$ quantifies the &#34;easiness&#34; for points in $Ω$ to reach the boundary, while $\frac{\H^1(\pd Ω)}{\H^2(Ω)}$ is the perimeter-to-area ratio. The main aim is to prove existence and $C^{1,1}$-regularity of minimizers of $\E$.

preprint2022arXiv

The Discovery of Dynamics via Linear Multistep Methods and Deep Learning: Error Estimation

Identifying hidden dynamics from observed data is a significant and challenging task in a wide range of applications. Recently, the combination of linear multistep methods (LMMs) and deep learning has been successfully employed to discover dynamics, whereas a complete convergence analysis of this approach is still under development. In this work, we consider the deep network-based LMMs for the discovery of dynamics. We put forward error estimates for these methods using the approximation property of deep networks. It indicates, for certain families of LMMs, that the $\ell^2$ grid error is bounded by the sum of $O(h^p)$ and the network approximation error, where $h$ is the time step size and $p$ is the local truncation error order. Numerical results of several physically relevant examples are provided to demonstrate our theory.

preprint2021arXiv

A fast two-stage algorithm for non-negative matrix factorization in streaming data

In this article, we study algorithms for nonnegative matrix factorization (NMF) in various applications involving streaming data. Utilizing the continual nature of the data, we develop a fast two-stage algorithm for highly efficient and accurate NMF. In the first stage, an alternating non-negative least squares (ANLS) framework is used, in combination with active set method with warm-start strategy for the solution of subproblems. In the second stage, an interior point method is adopted to accelerate the local convergence. The convergence of the proposed algorithm is proved. The new algorithm is compared with some existing algorithms in benchmark tests using both real-world data and synthetic data. The results demonstrate the advantage of our algorithm in finding high-precision solutions.

preprint2021arXiv

Physics-Informed Deep Learning for Traffic State Estimation

Traffic state estimation (TSE), which reconstructs the traffic variables (e.g., density) on road segments using partially observed data, plays an important role on efficient traffic control and operation that intelligent transportation systems (ITS) need to provide to people. Over decades, TSE approaches bifurcate into two main categories, model-driven approaches and data-driven approaches. However, each of them has limitations: the former highly relies on existing physical traffic flow models, such as Lighthill-Whitham-Richards (LWR) models, which may only capture limited dynamics of real-world traffic, resulting in low-quality estimation, while the latter requires massive data in order to perform accurate and generalizable estimation. To mitigate the limitations, this paper introduces a physics-informed deep learning (PIDL) framework to efficiently conduct high-quality TSE with small amounts of observed data. PIDL contains both model-driven and data-driven components, making possible the integration of the strong points of both approaches while overcoming the shortcomings of either. This paper focuses on highway TSE with observed data from loop detectors, using traffic density as the traffic variables. We demonstrate the use of PIDL to solve (with data from loop detectors) two popular physical traffic flow models, i.e., Greenshields-based LWR and three-parameter-based LWR, and discover the model parameters. We then evaluate the PIDL-based highway TSE using the Next Generation SIMulation (NGSIM) dataset. The experimental results show the advantages of the PIDL-based approach in terms of estimation accuracy and data efficiency over advanced baseline TSE methods.

preprint2020arXiv

A Reinforced Topic-Aware Convolutional Sequence-to-Sequence Model for Abstractive Text Summarization

In this paper, we propose a deep learning approach to tackle the automatic summarization tasks by incorporating topic information into the convolutional sequence-to-sequence (ConvS2S) model and using self-critical sequence training (SCST) for optimization. Through jointly attending to topics and word-level alignment, our approach can improve coherence, diversity, and informativeness of generated summaries via a biased probability generation mechanism. On the other hand, reinforcement training, like SCST, directly optimizes the proposed model with respect to the non-differentiable metric ROUGE, which also avoids the exposure bias during inference. We carry out the experimental evaluation with state-of-the-art methods over the Gigaword, DUC-2004, and LCSTS datasets. The empirical results demonstrate the superiority of our proposed method in the abstractive summarization.

preprint2020arXiv

Discovery of Dynamics Using Linear Multistep Methods

Linear multistep methods (LMMs) are popular time discretization techniques for the numerical solution of differential equations. Traditionally they are applied to solve for the state given the dynamics (the forward problem), but here we consider their application for learning the dynamics given the state (the inverse problem). This repurposing of LMMs is largely motivated by growing interest in data-driven modeling of dynamics, but the behavior and analysis of LMMs for discovery turn out to be significantly different from the well-known, existing theory for the forward problem. Assuming a highly idealized setting of being given the exact state with a zero residual of the discrete dynamics, we establish for the first time a rigorous framework based on refined notions of consistency and stability to yield convergence using LMMs for discovery. When applying these concepts to three popular $M-$step LMMs, the Adams-Bashforth, Adams-Moulton, and Backwards Differentiation Formula schemes, the new theory suggests that Adams-Bashforth for $M$ ranging from $1$ and $6$, Adams-Moulton for $M=0$ and $M=1$, and Backwards Differentiation Formula for all positive $M$ are convergent, and, otherwise, the methods are not convergent in general. In addition, we provide numerical experiments to both motivate and substantiate our theoretical analysis.

preprint2020arXiv

Maximum bound principles for a class of semilinear parabolic equations and exponential time differencing schemes

The ubiquity of semilinear parabolic equations has been illustrated in their numerous applications ranging from physics, biology, to materials and social sciences. In this paper, we consider a practically desirable property for a class of semilinear parabolic equations of the abstract form $u_t=\mathcal{L}u+f[u]$ with $\mathcal{L}$ being a linear dissipative operator and $f$ being a nonlinear operator in space, namely a time-invariant maximum bound principle, in the sense that the time-dependent solution $u$ preserves for all time a uniform pointwise bound in absolute value imposed by its initial and boundary conditions. We first study an analytical framework for some sufficient conditions on $\mathcal{L}$ and $f$ that lead to such a maximum bound principle for the time-continuous dynamic system of infinite or finite dimensions. Then, we utilize a suitable exponential time differencing approach with a properly chosen generator of contraction semigroup to develop first- and second-order accurate temporal discretization schemes, that satisfy the maximum bound principle unconditionally in the time-discrete setting. Error estimates of the proposed schemes are derived along with their energy stability. Extensions to vector- and matrix-valued systems are also discussed. We demonstrate that the abstract framework and analysis techniques developed here offer an effective and unified approach to study the maximum bound principle of the abstract evolution equation that cover a wide variety of well-known models and their numerical discretization schemes. Some numerical experiments are also carried out to verify the theoretical results.

preprint2020arXiv

Nonlocal-in-time dynamics and crossover of diffusive regimes

We study a simple nonlocal-in-time dynamic system proposed for the effective modeling of complex diffusive regimes in heterogeneous media. We present its solutions and their commonly studied statistics such as the mean square distance. This interesting model employs a nonlocal operator to replace the conventional first-order time-derivative. It introduces a finite memory effect of a constant length encoded through a kernel function. The nonlocal-in-time operator is related to fractional time derivatives that rely on the entire time-history on one hand, while reduces to, on the other hand, the classical time derivative if the length of the memory window diminishes. This allows us to demonstrate the effectiveness of the nonlocal-in-time model in capturing the crossover widely observed in nature between the initial sub-diffusion and the long time normal diffusion.

preprint2020arXiv

The Graph Limit of The Minimizer of The Onsager-Machlup Functional and Its Computation

The Onsager-Machlup (OM) functional is well-known for characterizing the most probable transition path of a diffusion process with non-vanishing noise. However, it suffers from a notorious issue that the functional is unbounded below when the specified transition time $T$ goes to infinity. This hinders the interpretation of the results obtained by minimizing the OM functional. We provide a new perspective on this issue. Under mild conditions, we show that although the infimum of the OM functional becomes unbounded when $T$ goes to infinity, the sequence of minimizers does contain convergent subsequences on the space of curves. The graph limit of this minimizing subsequence is an extremal of the abbreviated action functional, which is related to the OM functional via the Maupertuis principle with an optimal energy. We further propose an energy-climbing geometric minimization algorithm (EGMA) which identifies the optimal energy and the graph limit of the transition path simultaneously. This algorithm is successfully applied to several typical examples in rare event studies. Some interesting comparisons with the Freidlin-Wentzell action functional are also made.

preprint2019arXiv

A cooperative game for automated learning of elasto-plasticity knowledge graphs and models with AI-guided experimentation

We introduce a multi-agent meta-modeling game to generate data, knowledge, and models that make predictions on constitutive responses of elasto-plastic materials. We introduce a new concept from graph theory where a modeler agent is tasked with evaluating all the modeling options recast as a directed multigraph and find the optimal path that links the source of the directed graph (e.g. strain history) to the target (e.g. stress) measured by an objective function. Meanwhile, the data agent, which is tasked with generating data from real or virtual experiments (e.g. molecular dynamics, discrete element simulations), interacts with the modeling agent sequentially and uses reinforcement learning to design new experiments to optimize the prediction capacity. Consequently, this treatment enables us to emulate an idealized scientific collaboration as selections of the optimal choices in a decision tree search done automatically via deep reinforcement learning.