Source author record

Fei Lu

Fei Lu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

24works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

From Events to Trending: A Multi-Stage Hotspots Detection Method Based on Generative Query Indexing

LLM-based conversational systems have become a popular gateway for information access, yet most existing chatbots struggle to handle news-related trending queries effectively. To improve user experience, an effective trending query detection method is urgently needed to enable differentiated processing of such target traffic. However, current research on trending detection tailored to the dialogue system scenario remains largely unexplored, and methods designed for traditional search engines often underperform in conversational contexts due to radically distinct query distributions and expression patterns. To fill this gap, we propose a multi-stage framework for trending detection, which achieves systematic optimization from both offline generation and online identification perspectives. Specifically, our framework first exploits selected hot events to generate index queries, establishing a key bridge between static events and dynamic user queries. It then employs a retrieval matching mechanism for real-time online detection of trending queries, where we introduce a cascaded recall and ranking architecture to balance detection efficiency and accuracy. Furthermore, to better adapt to the practical application scenario, our framework adopts a single-recall module as a cold-start strategy to collect online data for fine-tuning the reranker. Extensive experiments demonstrate that our framework significantly outperforms baseline methods in both offline evaluations and online A/B tests, and user satisfaction is relatively improved by 27\% in terms of positive-negative feedback ratio.

preprint2024arXiv

Scalable iterative data-adaptive RKHS regularization

We present iDARR, a scalable iterative Data-Adaptive RKHS Regularization method, for solving ill-posed linear inverse problems. The method searches for solutions in subspaces where the true solution can be identified, with the data-adaptive RKHS penalizing the spaces of small singular values. At the core of the method is a new generalized Golub-Kahan bidiagonalization procedure that recursively constructs orthonormal bases for a sequence of RKHS-restricted Krylov subspaces. The method is scalable with a complexity of $O(kmn)$ for $m$-by-$n$ matrices with $k$ denoting the iteration numbers. Numerical tests on the Fredholm integral equation and 2D image deblurring show that it outperforms the widely used $L^2$ and $l^2$ norms, producing stable accurate solutions consistently converging when the noise level decays.

preprint2022arXiv

Data adaptive RKHS Tikhonov regularization for learning kernels in operators

We present DARTR: a Data Adaptive RKHS Tikhonov Regularization method for the linear inverse problem of nonparametric learning of function parameters in operators. A key ingredient is a system intrinsic data-adaptive (SIDA) RKHS, whose norm restricts the learning to take place in the function space of identifiability. DARTR utilizes this norm and selects the regularization parameter by the L-curve method. We illustrate its performance in examples including integral operators, nonlinear operators and nonlocal operators with discrete synthetic data. Numerical results show that DARTR leads to an accurate estimator robust to both numerical error due to discrete data and noise in data, and the estimator converges at a consistent rate as the data mesh refines under different levels of noises, outperforming two baseline regularizers using $l^2$ and $L^2$ norms.

preprint2022arXiv

Nonparametric learning of kernels in nonlocal operators

Nonlocal operators with integral kernels have become a popular tool for designing solution maps between function spaces, due to their efficiency in representing long-range dependence and the attractive feature of being resolution-invariant. In this work, we provide a rigorous identifiability analysis and convergence study for the learning of kernels in nonlocal operators. It is found that the kernel learning is an ill-posed or even ill-defined inverse problem, leading to divergent estimators in the presence of modeling errors or measurement noises. To resolve this issue, we propose a nonparametric regression algorithm with a novel data adaptive RKHS Tikhonov regularization method based on the function space of identifiability. The method yields a noisy-robust convergent estimator of the kernel as the data resolution refines, on both synthetic and real-world datasets. In particular, the method successfully learns a homogenized model for the stress wave propagation in a heterogeneous solid, revealing the unknown governing laws from real-world data at microscale. Our regularization method outperforms baseline methods in robustness, generalizability and accuracy.

preprint2022arXiv

Stochastic Data-Driven Variational Multiscale Reduced Order Models

Trajectory-wise data-driven reduced order models (ROMs) tend to be sensitive to training data, and thus lack robustness. We propose to construct a robust stochastic ROM closure (S-ROM) from data consisting of multiple trajectories from random initial conditions. The S-ROM is a low-dimensional time series model for the coefficients of the dominating proper orthogonal decomposition (POD) modes inferred from data. Thus, it achieves reduction both space and time, leading to simulations orders of magnitude faster than the full order model. We show that both the estimated POD modes and parameters in the S-ROM converge when the number of trajectories increases. Thus, the S-ROM is robust when the training data size increases. We demonstrate the S-ROM on a 1D Burgers equation with a viscosity $ν= 0.002$ and with random initial conditions. The numerical results verify the convergence. Furthermore, the S-ROM makes accurate trajectory-wise predictions from new initial conditions and with a prediction time far beyond the training range, and it quantifies the spread of uncertainties due to the unresolved scales.

preprint2022arXiv

Unsupervised learning of observation functions in state-space models by nonparametric moment methods

We investigate the unsupervised learning of non-invertible observation functions in nonlinear state-space models. Assuming abundant data of the observation process along with the distribution of the state process, we introduce a nonparametric generalized moment method to estimate the observation function via constrained regression. The major challenge comes from the non-invertibility of the observation function and the lack of data pairs between the state and observation. We address the fundamental issue of identifiability from quadratic loss functionals and show that the function space of identifiability is the closure of a RKHS that is intrinsic to the state process. Numerical results show that the first two moments and temporal correlations, along with upper and lower bounds, can identify functions ranging from piecewise polynomials to smooth functions, leading to convergent estimators. The limitations of this method, such as non-identifiability due to symmetry and stationarity, are also discussed.

preprint2021arXiv

ISALT: Inference-based schemes adaptive to large time-stepping for locally Lipschitz ergodic systems

Efficient simulation of SDEs is essential in many applications, particularly for ergodic systems that demand efficient simulation of both short-time dynamics and large-time statistics. However, locally Lipschitz SDEs often require special treatments such as implicit schemes with small time-steps to accurately simulate the ergodic measure. We introduce a framework to construct inference-based schemes adaptive to large time-steps (ISALT) from data, achieving a reduction in time by several orders of magnitudes. The key is the statistical learning of an approximation to the infinite-dimensional discrete-time flow map. We explore the use of numerical schemes (such as the Euler-Maruyama, a hybrid RK4, and an implicit scheme) to derive informed basis functions, leading to a parameter inference problem. We introduce a scalable algorithm to estimate the parameters by least squares, and we prove the convergence of the estimators as data size increases. We test the ISALT on three non-globally Lipschitz SDEs: the 1D double-well potential, a 2D multi-scale gradient system, and the 3D stochastic Lorenz equation with degenerate noise. Numerical results show that ISALT can tolerate time-step magnitudes larger than plain numerical schemes. It reaches optimal accuracy in reproducing the invariant measure when the time-step is medium-large.

preprint2021arXiv

Shock trace prediction by reduced models for a viscous stochastic Burgers equation

Viscous shocks are a particular type of extreme events in nonlinear multiscale systems, and their representation requires small scales. Model reduction can thus play an important role in reducing the computational cost for an efficient prediction of shocks. Yet, reduced models typically aim to approximate large-scale dominating dynamics, which do not resolve the small scales by design. To resolve this representation barrier, we introduce a new qualitative characterization of the space-time locations of shocks, named as the ``shock trace'', via a space-time indicator function based on an empirical resolution-adaptive threshold. Different from the exact shocks, the shock traces can be captured within the representation capacity of the large scales, which facilitates the forecast of the timing and locations of the shocks utilizing reduced models. Within the context of a viscous stochastic Burgers equation, we show that a data-driven reduced model, in the form of nonlinear autoregression (NAR) time series models, can accurately predict the random shock traces, with relatively low rates of false predictions. The NAR model significantly outperforms the corresponding Galerkin truncated model in the scenario of either noiseless or noisy observations. The results illustrate the importance of the data-driven closure terms in the NAR model, which account for the effects of the unresolved small scale dynamics on the resolved ones due to nonlinear interactions.

preprint2020arXiv

Learning interaction kernels in heterogeneous systems of agents from multiple trajectories

Systems of interacting particles or agents have wide applications in many disciplines such as Physics, Chemistry, Biology and Economics. These systems are governed by interaction laws, which are often unknown: estimating them from observation data is a fundamental task that can provide meaningful insights and accurate predictions of the behaviour of the agents. In this paper, we consider the inverse problem of learning interaction laws given data from multiple trajectories, in a nonparametric fashion, when the interaction kernels depend on pairwise distances. We establish a condition for learnability of interaction kernels, and construct estimators that are guaranteed to converge in a suitable $L^2$ space, at the optimal min-max rate for 1-dimensional nonparametric regression. We propose an efficient learning algorithm based on least squares, which can be implemented in parallel for multiple trajectories and is therefore well-suited for the high dimensional, big data regime. Numerical simulations on a variety examples, including opinion dynamics, predator-swarm dynamics and heterogeneous particle dynamics, suggest that the learnability condition is satisfied in models used in practice, and the rate of convergence of our estimator is consistent with the theory. These simulations also suggest that our estimators are robust to noise in the observations, and produce accurate predictions of dynamics in relative large time intervals, even when they are learned from data collected in short time intervals.

preprint2020arXiv

Learning interaction kernels in stochastic systems of interacting particles from multiple trajectories

We consider stochastic systems of interacting particles or agents, with dynamics determined by an interaction kernel which only depends on pairwise distances. We study the problem of inferring this interaction kernel from observations of the positions of the particles, in either continuous or discrete time, along multiple independent trajectories. We introduce a nonparametric inference approach to this inverse problem, based on a regularized maximum likelihood estimator constrained to suitable hypothesis spaces adaptive to data. We show that a coercivity condition enables us to control the condition number of this problem and prove the consistency of our estimator, and that in fact it converges at a near-optimal learning rate, equal to the min-max rate of $1$-dimensional non-parametric regression. In particular, this rate is independent of the dimension of the state space, which is typically very high. We also analyze the discretization errors in the case of discrete-time observations, showing that it is of order $1/2$ in terms of the time gaps between observations. This term, when large, dominates the sampling error and the approximation error, preventing convergence of the estimator. Finally, we exhibit an efficient parallel algorithm to construct the estimator from data, and we demonstrate the effectiveness of our algorithm with numerical tests on prototype systems including stochastic opinion dynamics and a Lennard-Jones model.

preprint2020arXiv

On the identifiability of interaction functions in systems of interacting particles

We address a fundamental issue in the nonparametric inference for systems of interacting particles: the identifiability of the interaction functions. We prove that the interaction functions are identifiable for a class of first-order stochastic systems, including linear systems with general initial laws and nonlinear systems with stationary distributions. We show that a coercivity condition is sufficient for identifiability and becomes necessary when the number of particles approaches infinity. The coercivity is equivalent to the strict positivity of related integral operators, which we prove by showing that their integral kernels are strictly positive definite by using Müntz type theorems.

preprint2019arXiv

A Low Temperature Functioning CoFeB/MgO Based Perpendicular Magnetic Tunnel Junction for Cryogenic Nonvolatile Random Access Memory

We investigated the low temperature performance of CoFeB/MgO based perpendicular magnetic tunnel junctions (pMTJs) by characterizing their quasi-static switching voltage, high speed pulse write error rate and endurance down to 9 K. pMTJ devices exhibited high magnetoresistance (>120%) and reliable (error rate<10-4) bi-directional switching with 2 to 200 ns voltage pulses. The endurance of the devices at 9 K surpassed that at 300 K by three orders of magnitude under the same write conditions, functioning for more than 10^12 cycles with 10 ns write pulses. The critical switching voltage at 9 K was observed to increase by 33% to 93%, depending on pulse duration, compared to that at 350 K. Ferromagnetic resonance and magnetization measurements on blanket pMTJ film stacks suggest that the increased switching voltage is associated with an increase in effective magnetic anisotropy and magnetization of free layer with decreasing temperature. Our work demonstrates that CoFeB/MgO based pMTJs have great potential to enable cryogenic MRAM and that their low temperature magnetization and effective magnetic anisotropy can be further optimized to lower operating power and improve endurance.

preprint2019arXiv

Nonparametric inference of interaction laws in systems of agents from trajectory data

Inferring the laws of interaction between particles and agents in complex dynamical systems from observational data is a fundamental challenge in a wide variety of disciplines. We propose a non-parametric statistical learning approach to estimate the governing laws of distance-based interactions, with no reference or assumption about their analytical form, from data consisting trajectories of interacting agents. We demonstrate the effectiveness of our learning approach both by providing theoretical guarantees, and by testing the approach on a variety of prototypical systems in various disciplines. These systems include homogeneous and heterogeneous agents systems, ranging from particle systems in fundamental physics to agent-based systems modeling opinion dynamics under the social influence, prey-predator dynamics, flocking and swarming, and phototaxis in cell dynamics.

preprint2016arXiv

Data-based stochastic model reduction for the Kuramoto--Sivashinsky equation

The problem of constructing data-based, predictive, reduced models for the Kuramoto-Sivashinsky equation is considered, under circumstances where one has observation data only for a small subset of the dynamical variables. Accurate prediction is achieved by developing a discrete-time stochastic reduced system, based on a NARMAX (Nonlinear Autoregressive Moving Average with eXogenous input) representation. The practical issue, with the NARMAX representation as with any other, is to identify an efficient structure, i.e., one with a small number of terms and coefficients. This is accomplished here by estimating coefficients for an approximate inertial form. The broader significance of the results is discussed.

preprint2015arXiv

A discrete approach to stochastic parametrization and dimensional reduction in nonlinear dynamics

Many physical systems are described by nonlinear differential equations that are too complicated to solve in full. A natural way to proceed is to divide the variables into those that are of direct interest and those that are not, formulate solvable approximate equations for the variables of greater interest, and use data and statistical methods to account for the impact of the other variables. In the present paper the problem is considered in a fully discrete-time setting, which simplifies both the analysis of the data and the numerical algorithms. The resulting time series are identified by a NARMAX (nonlinear autoregression moving average with exogenous input) representation familiar from engineering practice. The connections with the Mori-Zwanzig formalism of statistical physics are discussed, as well as an application to the Lorenz 96 system.

preprint2015arXiv

Sampling, feasibility, and priors in Bayesian estimation

Importance sampling algorithms are discussed in detail, with an emphasis on implicit sampling, and applied to data assimilation via particle filters. Implicit sampling makes it possible to use the data to find high-probability samples at relatively low cost, making the assimilation more efficient. A new analysis of the feasibility of data assimilation is presented, showing in detail why feasibility depends on the Frobenius norm of the covariance matrix of the noise and not on the number of variables. A discussion of the convergence of particular particle filters follows. A major open problem in numerical data assimilation is the determination of appropriate priors, a progress report on recent work on this problem is given. The analysis highlights the need for a careful attention both to the data and to the physics in data assimilation problems.

preprint2014arXiv

Limitations of polynomial chaos expansions in the Bayesian solution of inverse problems

Polynomial chaos expansions are used to reduce the computational cost in the Bayesian solutions of inverse problems by creating a surrogate posterior that can be evaluated inexpensively. We show, by analysis and example, that when the data contain significant information beyond what is assumed in the prior, the surrogate posterior can be very different from the posterior, and the resulting estimates become inaccurate. One can improve the accuracy by adaptively increasing the order of the polynomial chaos, but the cost may increase too fast for this to be cost effective compared to Monte Carlo sampling without a surrogate posterior.

preprint2013arXiv

Convergence of densities of some functionals of Gaussian processes

The aim of this paper is to establish the uniform convergence of the densities of a sequence of random variables, which are functionals of an underlying Gaussian process, to a normal density. Precise estimates for the uniform distance are derived by using the techniques of Malliavin calculus, combined with Stein's method for normal approximation. We need to assume some non-degeneracy conditions. First, the study is focused on random variables in a fixed Wiener chaos, and later, the results are extended to the uniform convergence of the derivatives of the densities and to the case of random vectors in some fixed chaos, which are uniformly non-degenerate in the sense of Malliavin calculus. Explicit upper bounds for the uniform norm are obtained for random variables in the second Wiener chaos, and an application to the convergence of densities of the least square estimator for the drift parameter in Ornstein-Uhlenbeck processes is discussed.

preprint2013arXiv

Non-degeneracy of some Sobolev Pseudo-norms of fractional Brownian motion

Applying an upper bound estimate for small $L^{2}$ ball probability for fractional Brownian motion (fBm), we prove the non-degeneracy of some Sobolev pseudo-norms of fBm.

preprint2013arXiv

Synthesis and spectroscopic characterization of completely isotactic polyacrylonitrile

Completely isotactic polyacrylonitrile (i-PAN) has been synthesized successfully by an improved urea inclusion polymerization. The tacticity of prepared samples were confirmed by 13C nuclear magnetic resonance (NMR), Fourier transform infrared (FT-IR) and X-ray diffraction (XRD).

preprint2013arXiv

The Formation and Characteristics of Acrylonitrile/Urea Inclusion Compound

The formation process and composition of the acrylonitrile/urea inclusion compounds (AN/UIC) with different aging times and AN/urea molar feed ratios are studied by differential scanning calorimetry (DSC) and X-ray diffraction (XRD). It is suggested that DSC could be one of the helpful methods to determine the guest/host ratio and the heat of decomposition. Meanwhile, the guest/host ratio and heat of deformation are obtained, which are 1.17 and 5361.53 J/mol, respectively. It is suggested AN molecules included in urea canal lattice may be packed flat against each other. It is found that the formation of AN/UIC depends on the aging time. XRD results reveal that once AN molecules enter urea lattice, AN/UIC are formed, which possess the final structure. When AN molecules are sufficient, the length of AN molecular arrays in urea canals increases as aging time prolonging until urea tunnels are saturated by AN.

preprint2012arXiv

Feynman--Kac formula for the heat equation driven by fractional noise with Hurst parameter $H<1/2$

In this paper, a Feynman-Kac formula is established for stochastic partial differential equation driven by Gaussian noise which is, with respect to time, a fractional Brownian motion with Hurst parameter $H<1/2$. To establish such a formula, we introduce and study a nonlinear stochastic integral from the given Gaussian noise. To show the Feynman--Kac integral exists, one still needs to show the exponential integrability of nonlinear stochastic integral. Then, the approach of approximation with techniques from Malliavin calculus is used to show that the Feynman-Kac integral is the weak solution to the stochastic partial differential equation.

preprint2012arXiv

The Influence of in-medium NN cross-sections, symmetry potential and impact parameter on the isospin observables

We explore the influence of in-medium nucleon-nucleon cross section, symmetry potential and impact parameter on isospin sensitive observables in intermediate-energy heavy-ion collisions with the ImQMD05 code, a modified version of Quantum Molecular Dynamics model. At incident velocities above the Fermi velocity, we find that the density dependence of symmetry potential plays a more important role on the double neutron to proton ratio $DR(n/p)$ and the isospin transport ratio $R_i$ than the in-medium nucleon-nucleon cross sections, provided that the latter are constrained to a fixed total NN collision rate. We also explore both $DR(n/p)$ and $R_i$ as a function of the impact parameter. Since the copious production of intermediate mass fragments is a distinguishing feature of intermediate-energy heavy-ion collisions, we examine the isospin transport ratios constructed from different groups of fragments. We find that the values of the isospin transport ratios for projectile rapidity fragments with $Z\ge20$ are greater than those constructed from the entire projectile rapidity source. We believe experimental investigations of this phenomenon can be performed. These may provide significant tests of fragmentation time scales predicted by ImQMD calculations.

preprint2011arXiv

Hölder Continuity of the Solution for a Class of Nonlinear SPDE Arising from One Dimensional Superprocesses

The Hölder continuity of the solution to a nonlinear stochastic partial differential equation arising from one dimensional super process is obtained. It is proved that the Hölder exponent in time variable is as close as to 1/4, improving the result of 1/10 in a recent paper by Li et al [3]. The method is to use the Malliavin calculus. The Hölder continuity in spatial variable x of exponent 1/2 is also obtained by using this new approach. This Hölder continuity result is sharp since the corresponding linear heat equation has the same Hölder continuity.

Fei Lu

What is connected

Connect this record

See the researcher in context

Building this map preview

24 published item(s)

From Events to Trending: A Multi-Stage Hotspots Detection Method Based on Generative Query Indexing

Scalable iterative data-adaptive RKHS regularization

Data adaptive RKHS Tikhonov regularization for learning kernels in operators

Nonparametric learning of kernels in nonlocal operators

Stochastic Data-Driven Variational Multiscale Reduced Order Models

Unsupervised learning of observation functions in state-space models by nonparametric moment methods

ISALT: Inference-based schemes adaptive to large time-stepping for locally Lipschitz ergodic systems

Shock trace prediction by reduced models for a viscous stochastic Burgers equation

Learning interaction kernels in heterogeneous systems of agents from multiple trajectories

Learning interaction kernels in stochastic systems of interacting particles from multiple trajectories

On the identifiability of interaction functions in systems of interacting particles

A Low Temperature Functioning CoFeB/MgO Based Perpendicular Magnetic Tunnel Junction for Cryogenic Nonvolatile Random Access Memory

Nonparametric inference of interaction laws in systems of agents from trajectory data

Data-based stochastic model reduction for the Kuramoto--Sivashinsky equation

A discrete approach to stochastic parametrization and dimensional reduction in nonlinear dynamics

Sampling, feasibility, and priors in Bayesian estimation

Limitations of polynomial chaos expansions in the Bayesian solution of inverse problems

Convergence of densities of some functionals of Gaussian processes

Non-degeneracy of some Sobolev Pseudo-norms of fractional Brownian motion

Synthesis and spectroscopic characterization of completely isotactic polyacrylonitrile

The Formation and Characteristics of Acrylonitrile/Urea Inclusion Compound

Feynman--Kac formula for the heat equation driven by fractional noise with Hurst parameter $H<1/2$

The Influence of in-medium NN cross-sections, symmetry potential and impact parameter on the isospin observables

Hölder Continuity of the Solution for a Class of Nonlinear SPDE Arising from One Dimensional Superprocesses