Source author record

Song Wei

Song Wei appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.ST Methodology Statistics Theory math-ph math.MP physics.flu-dyn

Catalog footprint

What is connected

8works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Online Kernel CUSUM for Change-Point Detection

We present a computationally efficient online kernel Cumulative Sum (CUSUM) method for change-point detection that utilizes the maximum over a set of kernel statistics to account for the unknown change-point location. Our approach exhibits increased sensitivity to small changes compared to existing kernel-based change-point detection methods, including Scan-B statistic, corresponding to a non-parametric Shewhart chart-type procedure. We provide accurate analytic approximations for two key performance metrics: the Average Run Length (ARL) and Expected Detection Delay (EDD), which enable us to establish an optimal window length to be on the order of the logarithm of ARL to ensure minimal power loss relative to an oracle procedure with infinite memory. Moreover, we introduce a recursive calculation procedure for detection statistics to ensure constant computational and memory complexity, which is essential for online implementation. Through extensive experiments on both simulated and real data, we demonstrate the competitive performance of our method and validate our theoretical results.

preprint2024arXiv

Transfer Learning for Causal Effect Estimation

We present a Transfer Causal Learning (TCL) framework when target and source domains share the same covariate/feature spaces, aiming to improve causal effect estimation accuracy in limited data. Limited data is very common in medical applications, where some rare medical conditions, such as sepsis, are of interest. Our proposed method, named \texttt{$\ell_1$-TCL}, incorporates $\ell_1$ regularized TL for nuisance models (e.g., propensity score model); the TL estimator of the nuisance parameters is plugged into downstream average causal/treatment effect estimators (e.g., inverse probability weighted estimator). We establish non-asymptotic recovery guarantees for the \texttt{$\ell_1$-TCL} with generalized linear model (GLM) under the sparsity assumption in the high-dimensional setting, and demonstrate the empirical benefits of \texttt{$\ell_1$-TCL} through extensive numerical simulation for GLM and recent neural network nuisance models. Our method is subsequently extended to real data and generates meaningful insights consistent with medical literature, a case where all baseline methods fail.

preprint2023arXiv

Optimal Sub-sampling to Boost Power of Kernel Sequential Change-point Detection

We present a novel scheme to boost detection power for kernel maximum mean discrepancy based sequential change-point detection procedures. Our proposed scheme features an optimal sub-sampling of the history data before the detection procedure, in order to tackle the power loss incurred by the random sub-sample from the enormous history data. We apply our proposed scheme to both Scan $B$ and Kernel Cumulative Sum (CUSUM) procedures, and improved performance is observed from extensive numerical experiments.

preprint2021arXiv

Goodness-of-Fit Test for Mismatched Self-Exciting Processes

Recently there have been many research efforts in developing generative models for self-exciting point processes, partly due to their broad applicability for real-world applications. However, rarely can we quantify how well the generative model captures the nature or ground-truth since it is usually unknown. The challenge typically lies in the fact that the generative models typically provide, at most, good approximations to the ground-truth (e.g., through the rich representative power of neural networks), but they cannot be precisely the ground-truth. We thus cannot use the classic goodness-of-fit (GOF) test framework to evaluate their performance. In this paper, we develop a GOF test for generative models of self-exciting processes by making a new connection to this problem with the classical statistical theory of Quasi-maximum-likelihood estimator (QMLE). We present a non-parametric self-normalizing statistic for the GOF test: the Generalized Score (GS) statistics, and explicitly capture the model misspecification when establishing the asymptotic distribution of the GS statistic. Numerical simulation and real-data experiments validate our theory and demonstrate the proposed GS test's good performance.

preprint2021arXiv

Inferring serial correlation with dynamic backgrounds

Sequential data with serial correlation and an unknown, unstructured, and dynamic background is ubiquitous in neuroscience, psychology, and econometrics. Inferring serial correlation for such data is a fundamental challenge in statistics. We propose a total variation constrained least square estimator coupled with hypothesis tests to infer the serial correlation in the presence of unknown and unstructured dynamic background. The total variation constraint on the dynamic background encourages a piece-wise constant structure, which can approximate a wide range of dynamic backgrounds. The tuning parameter is selected via the Ljung-Box test to control the bias-variance trade-off. We establish a non-asymptotic upper bound for the estimation error through variational inequalities. We also derive a lower error bound via Fano's method and show the proposed method is near-optimal. Numerical simulation and a real study in psychology demonstrate the excellent performance of our proposed method compared with the state-of-the-art.

preprint2021arXiv

Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization

Numerous empirical evidences have corroborated the importance of noise in nonconvex optimization problems. The theory behind such empirical observations, however, is still largely unknown. This paper studies this fundamental problem through investigating the nonconvex rectangular matrix factorization problem, which has infinitely many global minima due to rotation and scaling invariance. Hence, gradient descent (GD) can converge to any optimum, depending on the initialization. In contrast, we show that a perturbed form of GD with an arbitrary initialization converges to a global optimum that is uniquely determined by the injected noise. Our result implies that the noise imposes implicit bias towards certain optima. Numerical experiments are provided to support our theory.

preprint2016arXiv

Fractional constitutive equation (FACE) for non-Newtonian fluid flow: Theoretical description

Non-Newtonian fluid flow might be driven by spatially nonlocal velocity, the dynamics of which can be described by promising fractional derivative models. This short communication proposes a space FrActional-order Constitutive Equation (FACE) that links viscous shear stress with the velocity gradient, and then interprets physical properties of non-Newtonian fluids for steady pipe flow. Results show that the generalized FACE model contains previous non-Newtonian fluid flow models as end-members by simply adjusting the order of the fractional index, and a preliminary test shows that the FACE model conveniently captures the observed growth of shear stress for various velocity gradients. Further analysis of the velocity profile, frictional head loss, and Reynolds number using the FACE model also leads to analytical tools and criterion that can significantly extend standard models in quantifying the complex dynamics of non-Newtonian fluid flow with a wide range of spatially nonlocal velocities.

preprint2013arXiv

A Matlab toolbox for fractional relaxation-oscillation equations

Stress relaxation and oscillation damping of complex viscoelastic media often manifest history- and path-dependent physical behaviors and cannot accurately be described by the classical models. Recent research found that fractional derivative models can characterize such complex relaxation and damping. However, to our best knowledge, easy-to-use numerical software is not available for fractional relaxation-oscillation (FRO) equations. This paper is to introduce an open source free Matlab toolbox which we developed in recent years for numerical solution of the FRO equations. This FRO toolbox uses the predictor-corrector approach for the discretization of time fractional derivative, and non-expert users can accurately solve fractional relaxation-oscillation equations via a friendly graphical user interface. Compared with experimental data, our numerical experiments show that the FRO toolbox is highly efficient and accurate to simulate viscoelastic stress relaxation and damped vibration. This free toolbox will help promote the research and practical use of fractional relaxation-oscillation equations.

Song Wei

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

Online Kernel CUSUM for Change-Point Detection

Transfer Learning for Causal Effect Estimation

Optimal Sub-sampling to Boost Power of Kernel Sequential Change-point Detection

Goodness-of-Fit Test for Mismatched Self-Exciting Processes

Inferring serial correlation with dynamic backgrounds

Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization

Fractional constitutive equation (FACE) for non-Newtonian fluid flow: Theoretical description

A Matlab toolbox for fractional relaxation-oscillation equations