Researcher profile

Hau-Tieng Wu

Hau-Tieng Wu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
21works
0followers
17topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

21 published item(s)

preprint2026arXiv

On spectral interference of the short-time Fourier transform and its nonlinear variations

Spectral interference, the frequency counterpart of the beating phenomenon in the time domain, can severely distort time-frequency representations (TFRs) in physical applications. We study this phenomenon for the short-time Fourier transform (STFT) with a Gaussian window and for nonlinear refinements based on the reassignment method, with an emphasis on the synchrosqueezing transform (SST). Working with a two-component harmonic model, we quantify when STFT can (and cannot) resolve two nearby frequencies: a sharp transition occurs at a critical gap that scales inversely to kernel bandwidth and depends explicitly on the amplitude ratio. Below this threshold, the spectrogram ridges undergo bifurcation and form repeating time-frequency bubbles, which we describe asymptotically and, in the balanced-amplitude case, approximate closely by ellipses. We then analyze the STFT phase, showing a canonical winding behavior, and relate the complex-valued SST reassignment map to a holomorphic structure via the Bargmann transform. In the two-component setting the reassignment rule admits an explicit Mobius-geometry description, sending frequency lines to circular arcs in the complex plane. Finally, viewing SST and reassignment through a measure mapping perspective, we derive small-kernel asymptotics that explain when reassignment sharpens energy and when it produces distorted or misleading TFRs; we also introduce a generalized synchrosqueezing framework that isolates the role of STFT weighting and clarifies how alternative choices can mitigate interference in certain regimes.

preprint2026arXiv

Probabilistic Analysis of Scalogram Ridges in Signal Processing

While ridges in the scalogram, determined by the squared modulus of analytic wavelet transform (AWT), is a widely accepted concept and utilized in nonstationary time series analysis, their behavior in noisy environments remains underexplored. Our object is to provide a theoretical foundation for scalogram ridges by defining ridges as a potentially set-valued random process connecting local maxima of the scalogram along the scale axis and analyzing their properties when the signal fulfills the adaptive harmonic model and is contaminated by stationary Gaussian noise. In addition to establishing several key properties of the AWT for random processes, we investigate the probabilistic characteristics of the resulting random ridge points in the scalogram. Specifically, we establish the uniqueness property of the ridge point at individual time instances and prove the upper hemicontinuity of the ridge random process. Furthermore, we derive bounds on the probability that the deviation between the ridges of noisy and clean signals exceeds a specified threshold, and these bounds depend on the signal-to-noise ratio. To achieve these ridge deviation results, we derive maximal inequalities for the complex modulus of nonstationary Gaussian processes, leveraging classical tools such as the Borell-TIS inequality and Dudley's theorem, which might be of independent interest.

preprint2025arXiv

Efficient Artifacts Removal for Adaptive Deep Brain Stimulation and a Temporal Event Localization Analysis

Adaptive deep brain stimulation (aDBS) leverages symptom-related biomarkers to deliver personalized neuromodulation therapy, with the potential to improve treatment efficacy and reduce power consumption compared to conventional DBS. However, stimulation-induced signal contamination remains a major technical barrier to advancing its clinical application. Existing artifact removal strategies, both front-end and back-end, face trade-offs between artifact suppression and algorithmic flexibility. Among back-end algorithms, Shrinkage and Manifold-based Artifact Removal using Template Adaptation (SMARTA) has shown promising performance in mitigating stimulus artifacts with minimal distortion to local field potentials (LFPs), but its high computational demand and inability to handle transient direct current (DC) artifacts limit its use in real-time applications. To address this, we developed SMARTA+, a computationally efficient extension of SMARTA capable of suppressing both stimulus and transient DC artifacts while supporting flexible algorithmic design. We evaluated SMARTA+ using semi-real aDBS data and real data from Parkinson's disease patients. Compared to SMARTA and other established methods, SMARTA+ achieved comparable or superior artifact removal while significantly reducing computation time. It preserved spectral and temporal structures, ranging from beta band to high-frequency oscillations, and demonstrated robustness across diverse stimulation protocols. Temporal event localization analysis further showed improved accuracy in detecting beta bursts. These findings support SMARTA+ as a promising tool for advancing real-time, closed-loop aDBS systems.

preprint2024arXiv

Design a Metric Robust to Complicated High Dimensional Noise for Efficient Manifold Denoising

In this manuscript, we propose an efficient manifold denoiser based on landmark diffusion and optimal shrinkage under the complicated high dimensional noise and compact manifold setup. It is flexible to handle several setups, including the high ambient space dimension with a manifold embedding that occupies a subspace of high or low dimensions, and the noise could be colored and dependent. A systematic comparison with other existing algorithms on both simulated and real datasets is provided. This manuscript is mainly algorithmic and we report several existing tools and numerical results. Theoretical guarantees and more comparisons will be reported in the official paper of this manuscript.

preprint2022arXiv

Disentangling modes with crossover instantaneous frequencies by synchrosqueezed chirplet transforms, from theory to application

Analysis of signals with oscillatory modes with crossover instantaneous frequencies is a challenging problem in time series analysis. One way to handle this problem is lifting the 2-dimensional time-frequency representation to a 3-dimensional representation, called time-frequency-chirp rate (TFC) representation, by adding one extra chirp rate parameter so that crossover frequencies are disentangled in higher dimension. The chirplet transform is an algorithm for this lifting idea, which leads to a TFC representation. However, in practice, we found that it has a strong ``blurring'' effect in the chirp rate axis, which limits its application in real-world data. Moreover, to our knowledge, we have limited mathematical understanding of the chirplet transform in the literature. Motivated by the need for the real-world data analysis, in this paper, we propose the synchrosqueezed chirplet transform (SCT) that enhances the TFC representation given by the chirplet transform. The resulting concentrated TFC representation has high contrast so that one can better distinguish different modes with crossover instantaneous frequencies. The basic idea is to use the phase information in the chirplet transform to determine a reassignment rule that sharpens the TFC representation determined by the chirplet transform. We also analyze the chirplet transform and provide theoretical guarantees of SCT.

preprint2022arXiv

Eigenvector Phase Retrieval: Recovering eigenvectors from the absolute value of their entries

We consider the eigenvalue problem $Ax = λx$ where $A \in \mathbb{R}^{n \times n}$ and the eigenvalue is also real $λ\in \mathbb{R}$. If we are given $A$, $λ$ and, additionally, the absolute value of the entries of $x$ (the vector $(|x_i|)_{i=1}^n$), is there a fast way to recover $x$? In particular, can this be done quicker than computing $x$ from scratch? This may be understood as a special case of the phase retrieval problem. We present a randomized algorithm which provably converges in expectation whenever $λ$ is a simple eigenvalue. The problem should become easier when $|λ|$ is large and we discuss another algorithm for that case as well.

preprint2022arXiv

Predicting Trust Using Automated Assessment of Multivariate Interactional Synchrony

Diverse disciplines are interested in how the coordination of interacting agents' movements, emotions, and physiology over time impacts social behavior. Here, we describe a new multivariate procedure for automating the investigation of this kind of behaviorally-relevant "interactional synchrony", and introduce a novel interactional synchrony measure based on features of dynamic time warping (DTW) paths. We demonstrate that our DTW path-based measure of interactional synchrony between facial action units of two people interacting freely in a natural social interaction can be used to predict how much trust they will display in a subsequent Trust Game. We also show that our approach outperforms univariate head movement models, models that consider participants' facial action units independently, and models that use previously proposed synchrony or similarity measures. The insights of this work can be applied to any research question that aims to quantify the temporal coordination of multiple signals over time, but has immediate applications in psychology, medicine, and robotics.

preprint2022arXiv

Scalability and robustness of spectral embedding: landmark diffusion is all you need

While spectral embedding is a widely applied dimension reduction technique in various fields, so far it is still challenging to make it scalable to handle ``big data''. On the other hand, the robustness property is less explored and there exists only limited theoretical results. Motivated by the need of handling such data, recently we proposed a novel spectral embedding algorithm, which we coined Robust and Scalable Embedding via Landmark Diffusion (ROSELAND). In short, we measure the affinity between two points via a set of landmarks, which is composed of a small number of points, and ``diffuse'' on the dataset via the landmark set to achieve a spectral embedding. Roseland can be viewed as a generalization of the commonly applied spectral embedding algorithm, the diffusion map (DM), in the sense that it shares various properties of DM. In this paper, we show that Roseland is not only numerically scalable, but also preserves the geometric properties via its diffusion nature under the manifold setup; that is, we theoretically explore the asymptotic behavior of Roseland under the manifold setup, including handling the U-statistics-like quantities, and provide a $L^\infty$ spectral convergence with a rate. Moreover, we offer a high dimensional noise analysis and show that Roseland is robust to noise. We also compare Roseland with other existing algorithms with numerical simulations.

preprint2022arXiv

Spatiotemporal Analysis Using Riemannian Composition of Diffusion Operators

Multivariate time-series have become abundant in recent years, as many data-acquisition systems record information through multiple sensors simultaneously. In this paper, we assume the variables pertain to some geometry and present an operator-based approach for spatiotemporal analysis. Our approach combines three components that are often considered separately: (i) manifold learning for building operators representing the geometry of the variables, (ii) Riemannian geometry of symmetric positive-definite matrices for multiscale composition of operators corresponding to different time samples, and (iii) spectral analysis of the composite operators for extracting different dynamic modes. We propose a method that is analogous to the classical wavelet analysis, which we term Riemannian multi-resolution analysis (RMRA). We provide some theoretical results on the spectral analysis of the composite operators, and we demonstrate the proposed method on simulations and on real data.

preprint2021arXiv

An Efficient Forecasting Approach to Reduce Boundary Effects in Real-Time Time-Frequency Analysis

Time-frequency (TF) representations of time series are intrinsically subject to the boundary effects. As a result, the structures of signals that are highlighted by the representations are garbled when approaching the boundaries of the TF domain. In this paper, for the purpose of real-time TF information acquisition of nonstationary oscillatory time series, we propose a numerically efficient approach for the reduction of such boundary effects. The solution relies on an extension of the analyzed signal obtained by a forecasting technique. In the case of the study of a class of locally oscillating signals, we provide a theoretical guarantee of the performance of our approach. Following a numerical verification of the algorithmic performance of our approach, we validate it by implementing it on biomedical signals.

preprint2021arXiv

Convergence of Graph Laplacian with kNN Self-tuned Kernels

Kernelized Gram matrix $W$ constructed from data points $\{x_i\}_{i=1}^N$ as $W_{ij}= k_0( \frac{ \| x_i - x_j \|^2} {σ^2} )$ is widely used in graph-based geometric data analysis and unsupervised learning. An important question is how to choose the kernel bandwidth $σ$, and a common practice called self-tuned kernel adaptively sets a $σ_i$ at each point $x_i$ by the $k$-nearest neighbor (kNN) distance. When $x_i$'s are sampled from a $d$-dimensional manifold embedded in a possibly high-dimensional space, unlike with fixed-bandwidth kernels, theoretical results of graph Laplacian convergence with self-tuned kernels have been incomplete. This paper proves the convergence of graph Laplacian operator $L_N$ to manifold (weighted-)Laplacian for a new family of kNN self-tuned kernels $W^{(α)}_{ij} = k_0( \frac{ \| x_i - x_j \|^2}{ ε\hatρ(x_i) \hatρ(x_j)})/\hatρ(x_i)^α\hatρ(x_j)^α$, where $\hatρ$ is the estimated bandwidth function {by kNN}, and the limiting operator is also parametrized by $α$. When $α= 1$, the limiting operator is the weighted manifold Laplacian $Δ_p$. Specifically, we prove the point-wise convergence of $L_N f $ and convergence of the graph Dirichlet form with rates. Our analysis is based on first establishing a $C^0$ consistency for $\hatρ$ which bounds the relative estimation error $|\hatρ - \barρ|/\barρ$ uniformly with high probability, where $\barρ = p^{-1/d}$, and $p$ is the data density function. Our theoretical results reveal the advantage of self-tuned kernel over fixed-bandwidth kernel via smaller variance error in low-density regions. In the algorithm, no prior knowledge of $d$ or data density is needed. The theoretical results are supported by numerical experiments on simulated data and hand-written digit image data.

preprint2021arXiv

Prenatal stress perturbs fetal iron homeostasis in a sex-specific manner

What is the influence of chronic maternal prenatal stress (PS) on fetal iron homeostasis? In a prospective case-control study in 164 pregnant women, we show that cord blood transferrin saturation is lower in male stressed neonates. The total effect of PS exposure on fetal ferritin revealed a decrease of 15.4% compared with controls. Electrocardiogram-based Fetal Stress Index (FSI) identified affected fetuses non-invasively during the third trimester of gestation. FSI-based timely detection of fetuses affected by PS can support early individualized iron supplementation and neurodevelopmental follow-up to prevent long-term sequelae due to PS-exacerbated impairment of the iron homeostasis.

preprint2020arXiv

A persistent homology approach to heart rate variability analysis with an application to sleep-wake classification

Persistent homology (PH) is a recently developed theory in the field of algebraic topology to study shapes of datasets. It is an effective data analysis tool that is robust to noise and has been widely applied. We demonstrate a general pipeline to apply PH to study time series; particularly the instantaneous heart rate time series for the heart rate variability (HRV) analysis. The first step is capturing the shapes of time series from two different aspects -- {the PH's and hence persistence diagrams of its} sub-level set and Taken's lag map. Second, we propose a systematic {and computationally efficient} approach to summarize persistence diagrams, which we coined {\em persistence statistics}. To demonstrate our proposed method, we apply these tools to the HRV analysis and the sleep-wake, REM-NREM (rapid eyeball movement and non rapid eyeball movement) and sleep-REM-NREM classification problems. The proposed algorithm is evaluated on three different datasets via the cross-database validation scheme. The performance of our approach is better than the state-of-the-art algorithms, and the result is consistent throughout different datasets.

preprint2020arXiv

Airflow recovery from thoracic and abdominal movements using Synchrosqueezing Transform and Locally Stationary Gaussian Process Regression

Airflow signal encodes rich information about respiratory system. While the gold standard for measuring airflow is to use a spirometer with an occlusive seal, this is not practical for ambulatory monitoring of patients. Advances in sensor technology have made measurement of motion of the thorax and abdomen feasible with small inexpensive devices, but estimation of airflow from these time series is challenging. We propose to use the nonlinear-type time-frequency analysis tool, synchrosqueezing transform, to properly represent the thoracic and abdominal movement signals as the features, which are used to recover the airflow by the locally stationary Gaussian process. We show that, using a dataset that contains respiratory signals under normal sleep conditions, an accurate prediction can be achieved by fitting the proposed model in the feature space both in the intra- and inter-subject setups. We also apply our method to a more challenging case, where subjects under general anesthesia underwent transitions from pressure support to unassisted ventilation to further demonstrate the utility of the proposed method.

preprint2020arXiv

An Adaptive QRS Detection Algorithm for Ultra-Long-Term ECG Recordings

Background: Accurate detection of QRS complexes during mobile, ultra-long-term ECG monitoring is challenged by instances of high heart rate, dramatic and persistent changes in signal amplitude, and intermittent deformations in signal quality that arise due to subject motion, background noise, and misplacement of the ECG electrodes. Purpose: We propose a revised QRS detection algorithm which addresses the above-mentioned challenges. Methods and Results: Our proposed algorithm is based on a state-of-the-art algorithm after applying two key modifications. The first modification is implementing local estimates for the amplitude of the signal. The second modification is a mechanism by which the algorithm becomes adaptive to changes in heart rate. We validated our proposed algorithm against the state-of-the-art algorithm using short-term ECG recordings from eleven annotated databases available at Physionet, as well as four ultra-long-term (14-day) ECG recordings which were visually annotated at a central ECG core laboratory. On the database of ultra-long-term ECG recordings, our proposed algorithm showed a sensitivity of 99.90% and a positive predictive value of 99.73%. Meanwhile, the state-of-the-art QRS detection algorithm achieved a sensitivity of 99.30% and a positive predictive value of 99.68% on the same database. The numerical efficiency of our new algorithm was evident, as a 14-day recording sampled at 200 Hz was analyzed in approximately 157 seconds. Conclusions: We developed a new QRS detection algorithm. The efficiency and accuracy of our algorithm makes it a good fit for mobile health applications, ultra-long-term and pathological ECG recordings, and the batch processing of large ECG databases.

preprint2020arXiv

Convergence analysis of Adaptive Locally Iterative Filtering and SIFT method

Adaptive Local Iterative Filtering (ALIF) is a currently proposed novel time-frequency analysis tool. It has been empirically shown that ALIF is able to separate components and overcome the mode-mixing problem. However, so far its convergence is still an open problem, particularly for highly nonstationary signals, due to the fact that the kernel associated with ALIF is non-translational invariant, non-convolutional and non-symmetric. Our first contribution in this work is providing a convergence analysis of ALIF. From the practical perspective, ALIF depends on a robust frequencies estimator, based on which the decomposition can be achieved. Our second contribution is proposing a robust and adaptive decomposition method for noisy and nonstationary signals, which we coined the Synchrosqueezing Iterative Filtering Technique (SIFT). In SIFT, we apply the synchrosqueezing transform to estimate the instantaneous frequency, and then apply the ALIF to decompose a signal. We show numerically the ability of this new approach in handling highly nonstationary signals.

preprint2020arXiv

Numerical computation of triangular complex spherical designs with small mesh ratio

This paper provides triangular spherical designs for the complex unit sphere $Ω^d$ by exploiting the natural correspondence between the complex unit sphere in $d$ dimensions and the real unit sphere in $2d-1$. The existence of triangular and square complex spherical $t$-designs with the optimal order number of points is established. A variational characterization of triangular complex designs is provided, with particular emphasis on numerical computation of efficient triangular complex designs with good geometric properties as measured by their mesh ratio. We give numerical examples of triangular spherical $t$-designs on complex unit spheres of dimension $d=2$ to $6$.

preprint2020arXiv

On the behavior of $1$-Laplacian Ratio Cuts on nearly rectangular domains

Given a connected set $Ω_0 \subset \mathbb{R}^2$, define a sequence of sets $(Ω_n)_{n=0}^{\infty}$ where $Ω_{n+1}$ is the subset of $Ω_n$ where the first eigenfunction of the (properly normalized) Neumann $p-$Laplacian $ -Δ^{(p)} ϕ= λ_1 |ϕ|^{p-2} ϕ$ is positive (or negative). For $p=1$, this is also referred to as the Ratio Cut of the domain. We conjecture that, unless $Ω_0$ is an isosceles right triangle, these sets converge to the set of rectangles with eccentricity bounded by 2 in the Gromov-Hausdorff distance as long as they have a certain distance to the boundary $\partial Ω_0$. We establish some aspects of this conjecture for $p=1$ where we prove that (1) the 1-Laplacian spectral cut of domains sufficiently close to rectangles of a given aspect ratio is a circular arc that is closer to flat than the original domain (leading eventually to quadrilaterals) and (2) quadrilaterals close to a rectangle of aspect ratio $2$ stay close to quadrilaterals and move closer to rectangles in a suitable metric. We also discuss some numerical aspects and pose many open questions.

preprint2020arXiv

On the spectral property of kernel-based sensor fusion algorithms of high dimensional data

We apply local laws of random matrices and free probability theory to study the spectral properties of two kernel-based sensor fusion algorithms, nonparametric canonical correlation analysis (NCCA) and alternating diffusion (AD), for two simultaneously recorded high dimensional datasets under the null hypothesis. The matrix of interest is the product of the kernel matrices associated with the databsets, which may not be diagonalizable in general. We prove that in the regime where dimensions of both random vectors are comparable to the sample size, if NCCA and AD are conducted using a smooth kernel function, then the first few nontrivial eigenvalues will converge to real deterministic values provided the datasets are independent Gaussian random vectors. Toward the claimed result, we also provide a convergence rate of eigenvalues of a kernel affinity matrix.

preprint2019arXiv

Can a composite heart rate variability biomarker shed new insights about autism spectrum disorder in school-aged children?

High-frequency heart rate variability (HRV) has identified parasympathetic nervous system alterations in autism spectrum disorder (ASD). In a cohort of school-aged children with and without ASD, we test a set of alternative linear and nonlinear HRV measures, including phase rectified signal averaging, applied to a segment of resting ECG, for associations with ASD vs. other psychiatric conditions. Using machine learning, we identify HRV measures derived from time, frequency, and geometric signal-analytical domains that (1) identify children with ASD relative to peers with receiver operating curve area of .89, and (2) differentiate such children from those with conduct problems or depression. Despite the small cohort and lack of prospective external validation, these preliminary results warrant larger prospective validation studies.