Source author record

Jack Xin

Jack Xin appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

32works

25topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Stochastic Genetic Interacting Particle Method for Reaction-Diffusion-Advection Equations

We develop and analyze a stochastic genetic interacting particle method (SGIP) for reaction-diffusion-advection (RDA) equations. The SGIP method employs operator splitting to approximate the advection-diffusion and reaction processes, treating the former using particle drift-diffusion and the latter via exact or implicit integration of reaction dynamics over bins, where particle density is estimated using a histogram. A key innovation is the incorporation of adaptive resampling to close the loop of particle and density field description of solutions, mimicking the selection mechanism in genetics. Resampling is also crucial for maintaining long-term stability by redistributing particles in accordance with the evolving density field. We provide a comprehensive error analysis and establish convergence bounds under appropriate regularity assumptions. Numerical experiments in one to three space dimensions demonstrate the method's effectiveness across various reaction types (Fisher-Kolmogorov-Petrovsky-Piskunov (FKPP), cubic, Arrhenius) and flow configurations (shear, cellular, cat's eye, Arnold-Beltrami-Childress (ABC) flows), showing excellent agreement with the finite difference method (FDM) while offering computational advantages for complex flow geometries and higher-dimensional problems.

preprint2026arXiv

USEMA: a Scalable Efficient Mamba Like Attention for Medical Image Segmentation

Accurate medical image segmentation is an integral part of the medical image analysis pipeline that requires the ability to merge local and global information. While vision transformers are able to capture global interactions using vanilla self-attention, their quadratic computational complexity in the input size remains a struggle for medical image segmentation tasks. Motivated by the dispersion property of vanilla self-attention and recent development of Mamba form of attention, Scalable and Efficient Mamba like Attention (SEMA) utilizes token localization via local window attention to avoid dispersion and maintain focusing, complemented by theoretically consistent arithmetic averaging to capture global aspect of attention. In this work, we present USEMA, a hybrid UNet architecture that merges the local feature extraction ability of convolutional neural networks (CNNs) with SEMA attention. We conduct experiments with USEMA across a variety of modalities and image sizes, demonstrating improved computational efficiency compared to transformer based models using full self-attention, and superior segmentation performance relative to purely convolution and Mamba-based models.

preprint2022arXiv

An integrated recurrent neural network and regression model with spatial and climatic couplings for vector-borne disease dynamics

We developed an integrated recurrent neural network and nonlinear regression spatio-temporal model for vector-borne disease evolution. We take into account climate data and seasonality as external factors that correlate with disease transmitting insects (e.g. flies), also spill-over infections from neighboring regions surrounding a region of interest. The climate data is encoded to the model through a quadratic embedding scheme motivated by recommendation systems. The neighboring regions' influence is modeled by a long short-term memory neural network. The integrated model is trained by stochastic gradient descent and tested on leish-maniasis data in Sri Lanka from 2013-2018 where infection outbreaks occurred. Our model outperformed ARIMA models across a number of regions with high infections, and an associated ablation study renders support to our modeling hypothesis and ideas.

preprint2022arXiv

Channel Pruning In Quantization-aware Training: An Adaptive Projection-gradient Descent-shrinkage-splitting Method

We propose an adaptive projection-gradient descent-shrinkage-splitting method (APGDSSM) to integrate penalty based channel pruning into quantization-aware training (QAT). APGDSSM concurrently searches weights in both the quantized subspace and the sparse subspace. APGDSSM uses shrinkage operator and a splitting technique to create sparse weights, as well as the Group Lasso penalty to push the weight sparsity into channel sparsity. In addition, we propose a novel complementary transformed l1 penalty to stabilize the training for extreme compression.

preprint2022arXiv

DeepParticle: learning invariant measure by a deep neural network minimizing Wasserstein distance on data generated from an interacting particle method

We introduce the so called DeepParticle method to learn and generate invariant measures of stochastic dynamical systems with physical parameters based on data computed from an interacting particle method (IPM). We utilize the expressiveness of deep neural networks (DNNs) to represent the transform of samples from a given input (source) distribution to an arbitrary target distribution, neither assuming distribution functions in closed form nor a finite state space for the samples. In training, we update the network weights to minimize a discrete Wasserstein distance between the input and target samples. To reduce computational cost, we propose an iterative divide-and-conquer (a mini-batch interior point) algorithm, to find the optimal transition matrix in the Wasserstein distance. We present numerical results to demonstrate the performance of our method for accelerating IPM computation of invariant measures of stochastic dynamical systems arising in computing reaction-diffusion front speeds through chaotic flows. The physical parameter is a large Peclét number reflecting the advection dominated regime of our interest.

preprint2022arXiv

Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention VAE

Variational auto-encoder (VAE) is an effective neural network architecture to disentangle a speech utterance into speaker identity and linguistic content latent embeddings, then generate an utterance for a target speaker from that of a source speaker. This is possible by concatenating the identity embedding of the target speaker and the content embedding of the source speaker uttering a desired sentence. In this work, we propose to improve VAE models with self-attention and structural regularization (RGSM). Specifically, we found a suitable location of VAE's decoder to add a self-attention layer for incorporating non-local information in generating a converted utterance and hiding the source speaker's identity. We applied relaxed group-wise splitting method (RGSM) to regularize network weights and remarkably enhance generalization performance. In experiments of zero-shot many-to-many voice conversion task on VCTK data set, with the self-attention layer and relaxed group-wise splitting method, our model achieves a gain of speaker classification accuracy on unseen speakers by 28.3\% while slightly improved conversion voice quality in terms of MOSNet scores. Our encouraging findings point to future research on integrating more variety of attention structures in VAE framework while controlling model size and overfitting for advancing zero-shot many-to-many voice conversions.

preprint2022arXiv

glassoformer: a query-sparse transformer for post-fault power grid voltage prediction

We propose GLassoformer, a novel and efficient transformer architecture leveraging group Lasso regularization to reduce the number of queries of the standard self-attention mechanism. Due to the sparsified queries, GLassoformer is more computationally efficient than the standard transformers. On the power grid post-fault voltage prediction task, GLassoformer shows remarkably better prediction than many existing benchmark algorithms in terms of accuracy and stability.

preprint2022arXiv

Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs

Learning neural ODEs often requires solving very stiff ODE systems, primarily using explicit adaptive step size ODE solvers. These solvers are computationally expensive, requiring the use of tiny step sizes for numerical stability and accuracy guarantees. This paper considers learning neural ODEs using implicit ODE solvers of different orders leveraging proximal operators. The proximal implicit solver consists of inner-outer iterations: the inner iterations approximate each implicit update step using a fast optimization algorithm, and the outer iterations solve the ODE system over time. The proximal implicit ODE solver guarantees superiority over explicit solvers in numerical stability and computational efficiency. We validate the advantages of proximal implicit solvers over existing popular neural ODE solvers on various challenging benchmark tasks, including learning continuous-depth graph neural networks and continuous normalizing flows.

preprint2022arXiv

RARTS: An Efficient First-Order Relaxed Architecture Search Method

Differentiable architecture search (DARTS) is an effective method for data-driven neural network design based on solving a bilevel optimization problem. Despite its success in many architecture search tasks, there are still some concerns about the accuracy of first-order DARTS and the efficiency of the second-order DARTS. In this paper, we formulate a single level alternative and a relaxed architecture search (RARTS) method that utilizes the whole dataset in architecture learning via both data and network splitting, without involving mixed second derivatives of the corresponding loss functions like DARTS. In our formulation of network splitting, two networks with different but related weights cooperate in search of a shared architecture. The advantage of RARTS over DARTS is justified by a convergence theorem and an analytically solvable model. Moreover, RARTS outperforms DARTS and its variants in accuracy and search efficiency, as shown in adequate experimental results. For the task of searching topological architecture, i.e., the edges and the operations, RARTS obtains a higher accuracy and 60\% reduction of computational cost than second-order DARTS on CIFAR-10. RARTS continues to out-perform DARTS upon transfer to ImageNet and is on par with recent variants of DARTS even though our innovation is purely on the training algorithm without modifying search space. For the task of searching width, i.e., the number of channels in convolutional layers, RARTS also outperforms the traditional network pruning benchmarks. Further experiments on the public architecture search benchmark like NATS-Bench also support the preeminence of RARTS.

preprint2022arXiv

Searching Intrinsic Dimensions of Vision Transformers

It has been shown by many researchers that transformers perform as well as convolutional neural networks in many computer vision tasks. Meanwhile, the large computational costs of its attention module hinder further studies and applications on edge devices. Some pruning methods have been developed to construct efficient vision transformers, but most of them have considered image classification tasks only. Inspired by these results, we propose SiDT, a method for pruning vision transformer backbones on more complicated vision tasks like object detection, based on the search of transformer dimensions. Experiments on CIFAR-100 and COCO datasets show that the backbones with 20\% or 40\% dimensions/parameters pruned can have similar or even better performance than the unpruned models. Moreover, we have also provided the complexity analysis and comparisons with the previous pruning methods.

preprint2021arXiv

Structure Assisted NMF Methods for Separation of Degenerate Mixture Data with Application to NMR Spectroscopy

In this paper, we develop structure assisted nonnegative matrix factorization (NMF) methods for blind source separation of degenerate data. The motivation originates from nuclear magnetic resonance (NMR) spectroscopy, where a multiple mixture NMR spectra are recorded to identify chemical compounds with similar structures. Consider the linear mixing model (LMM), we aim to identify the chemical compounds involved when the mixing process is known to be nearly singular. We first consider a class of data with dominant interval(s) (DI) where each of source signals has dominant peaks over others. Besides, a nearly singular mixing process produces degenerate mixtures. The DI condition implies clustering structures in the data points. Hence, the estimation of the mixing matrix could be achieved by data clustering. Due to the presence of the noise and the degeneracy of the data, a small deviation in the estimation may introduce errors in the output. To resolve this problem and improve robustness of the separation, methods are developed in two aspects. One is to find better estimation of the mixing matrix by allowing a constrained perturbation to the clustering output, and it can be achieved by a quadratic programming. The other is to seek sparse source signals by exploiting the DI condition, and it solves an $\ell_1$ optimization. If no source information is available, we propose to adopt the nonnegative matrix factorization approach by incorporating the matrix structure (parallel columns of the mixing matrix) into the cost function and develop multiplicative iteration rules for the numerical solutions. We present experimental results of NMR data to show the performance and reliability of the method in the applications arising in NMR spectroscopy.

preprint2020arXiv

A Recurrent Neural Network and Differential Equation Based Spatiotemporal Infectious Disease Model with Application to COVID-19

The outbreaks of Coronavirus Disease 2019 (COVID-19) have impacted the world significantly. Modeling the trend of infection and real-time forecasting of cases can help decision making and control of the disease spread. However, data-driven methods such as recurrent neural networks (RNN) can perform poorly due to limited daily samples in time. In this work, we develop an integrated spatiotemporal model based on the epidemic differential equations (SIR) and RNN. The former after simplification and discretization is a compact model of temporal infection trend of a region while the latter models the effect of nearest neighboring regions. The latter captures latent spatial information. %that is not publicly reported. We trained and tested our model on COVID-19 data in Italy, and show that it out-performs existing temporal models (fully connected NN, SIR, ARIMA) in 1-day, 3-day, and 1-week ahead forecasting especially in the regime of limited training data.

preprint2020arXiv

Convergence of a Relaxed Variable Splitting Method for Learning Sparse Neural Networks via $\ell_1, \ell_0$, and transformed-$\ell_1$ Penalties

Sparsification of neural networks is one of the effective complexity reduction methods to improve efficiency and generalizability. We consider the problem of learning a one hidden layer convolutional neural network with ReLU activation function via gradient descent under sparsity promoting penalties. It is known that when the input data is Gaussian distributed, no-overlap networks (without penalties) in regression problems with ground truth can be learned in polynomial time at high probability. We propose a relaxed variable splitting method integrating thresholding and gradient descent to overcome the lack of non-smoothness in the loss function. The sparsity in network weight is realized during the optimization (training) process. We prove that under $\ell_1, \ell_0$; and transformed-$\ell_1$ penalties, no-overlap networks can be learned with high probability, and the iterative weights converge to a global limit which is a transformation of the true weight under a novel thresholding operation. Numerical experiments confirm theoretical findings, and compare the accuracy and sparsity trade-off among the penalties.

preprint2020arXiv

Convergence of stochastic structure-preserving schemes for computing effective diffusivity in random flows

In this paper, we propose stochastic structure-preserving schemes to compute the effective diffusivity for particles moving in random flows. We first introduce the motion of particles using the Lagrangian formulation, which is modeled by stochastic differential equations (SDEs). We also discuss the definition of the corrector problem and effective diffusivity. Then we propose stochastic structure-preserving schemes to solve the SDEs and provide a sharp convergence analysis for the numerical schemes in computing effective diffusivity. The convergence analysis follows a probabilistic approach, which interprets the solution process generated by our numerical schemes as a Markov process. By using the central limit theorem for the solution process, we obtain the convergence analysis of our method in computing long time solutions. Most importantly our convergence analysis reveals the connection of discrete-type and continuous-type corrector problems, which is fundamental and interesting. We present numerical results to demonstrate the accuracy and efficiency of the proposed method and investigate the convection-enhanced diffusion phenomenon in two- and three-dimensional incompressible random flows.

preprint2020arXiv

Enhanced Diffusivity in Perturbed Senile Reinforced Random Walk Models

We consider diffusivity of random walks with transition probabilities depending on the number of consecutive traversals of the last traversed edge, the so called senile reinforced random walk (SeRW). In one dimension, the walk is known to be sub-diffusive with identity reinforcement function. We perturb the model by introducing a small probability $δ$ of escaping the last traversed edge at each step. The perturbed SeRW model is diffusive for any $δ>0 $, with enhanced diffusivity ($\gg O(δ^2)$) in the small $δ$ regime. We further study stochastically perturbed SeRW models by having the last edge escape probability of the form $δ\, ξ_n$ with $ξ_n$'s being independent random variables. Enhanced diffusivity in such models are logarithmically close to the so called residual diffusivity (positive in the zero $δ$ limit), with diffusivity between $O\left(\frac{1}{|\logδ|}\right)$ and $O\left(\frac{1}{\log|\logδ|}\right)$. Finally, we generalize our results to higher dimensions where the unperturbed model is already diffusive. The enhanced diffusivity can be as much as $O(\log^{-2}δ)$.

preprint2020arXiv

Lorentzian Peak Sharpening and Sparse Blind Source Separation for NMR Spectroscopy

In this paper, we introduce a preprocessing technique for blind source separation (BSS) of nonnegative and overlapped data. For Nuclear Magnetic Resonance spectroscopy (NMR), the classical method of Naanaa and Nuzillard (NN) requires the condition that source signals to be non-overlapping at certain locations while they are allowed to overlap with each other elsewhere. NN's method works well with data signals that possess stand alone peaks (SAP). The SAP does not hold completely for realistic NMR spectra however. Violation of SAP often introduces errors or artifacts in the NN's separation results. To address this issue, a preprocessing technique is developed here based on Lorentzian peak shapes and weighted peak sharpening. The idea is to superimpose the original peak signal with its weighted negative second order derivative. The resulting sharpened (narrower and taller) peaks enable NN's method to work with a more relaxed SAP condition, the so called dominant peaks condition (DPS), and deliver improved results. To achieve an optimal sharpening while preserving the data nonnegativity, we prove the existence of an upper bound of the weight parameter and propose a selection criterion. Numerical experiments on NMR spectroscopy data show satisfactory performance of our proposed method.

preprint2020arXiv

Two-Grid based Adaptive Proper Orthogonal Decomposition Algorithm for Time Dependent Partial Differential Equations

In this article, we propose a two-grid based adaptive proper orthogonal decomposition (POD) method to solve the time dependent partial differential equations. Based on the error obtained in the coarse grid, we propose an error indicator for the numerical solution obtained in the fine grid. Our new algorithm is cheap and easy to be implement. We apply our new method to the solution of time-dependent advection-diffusion equations with the Kolmogorov flow and the ABC flow. The numerical results show that our method is more efficient than the existing POD methods.

preprint2016arXiv

Ballistic Orbits and Front Speed Enhancement for ABC Flows

We study the two main types of trajectories of the ABC flow in the near-integrable regime: spiral orbits and edge orbits. The former are helical orbits which are perturbations of similar orbits that exist in the integrable regime, while the latter exist only in the non-integrable regime. We prove existence of ballistic (i.e., linearly growing) spiral orbits by using the contraction mapping principle in the Hamiltonian formulation, and we also find and analyze ballistic edge orbits. We discuss the relationship of existence of these orbits with questions concerning front propagation in the presence of flows, in particular, the question of linear (i.e., maximal possible) front speed enhancement rate for ABC flows.

preprint2016arXiv

Iterative $\ell_1$ minimization for non-convex compressed sensing

An algorithmic framework, based on the difference of convex functions algorithm (DCA), is proposed for minimizing a class of concave sparse metrics for compressed sensing problems. The resulting algorithm iterates a sequence of $\ell_1$ minimization problems. An exact sparse recovery theory is established to show that the proposed framework always improves on the basis pursuit ($\ell_1$ minimization) and inherits robustness from it. Numerical examples on success rates of sparse solution recovery illustrate further that, unlike most existing non-convex compressed sensing solvers in the literature, our method always out-performs basis pursuit, no matter how ill-conditioned the measurement matrix is. Moreover, the iterative $\ell_1$ (IL$_1$) algorithm lead by a wide margin the state-of-the-art algorithms on $\ell_{1/2}$ and logarithimic minimizations in the strongly coherent (highly ill-conditioned) regime, despite the same objective functions. Last but not least, in the application of magnetic resonance imaging (MRI), IL$_1$ algorithm easily recovers the phantom image with just 7 line projections.

preprint2016arXiv

Minimization of Transformed $L_1$ Penalty: Closed Form Representation and Iterative Thresholding Algorithms

The transformed $l_1$ penalty (TL1) functions are a one parameter family of bilinear transformations composed with the absolute value function. When acting on vectors, the TL1 penalty interpolates $l_0$ and $l_1$ similar to $l_p$ norm ($p \in (0,1)$). In our companion paper, we showed that TL1 is a robust sparsity promoting penalty in compressed sensing (CS) problems for a broad range of incoherent and coherent sensing matrices. Here we develop an explicit fixed point representation for the TL1 regularized minimization problem. The TL1 thresholding functions are in closed form for all parameter values. In contrast, the $l_p$ thresholding functions ($p \in [0,1]$) are in closed form only for $p=0,1,1/2,2/3$, known as hard, soft, half, and 2/3 thresholding respectively. The TL1 threshold values differ in subcritical (supercritical) parameter regime where the TL1 threshold functions are continuous (discontinuous) similar to soft-thresholding (half-thresholding) functions. We propose TL1 iterative thresholding algorithms and compare them with hard and half thresholding algorithms in CS test problems. For both incoherent and coherent sensing matrices, a proposed TL1 iterative thresholding algorithm with adaptive subcritical and supercritical thresholds consistently performs the best in sparse signal recovery with and without measurement noise.

preprint2016arXiv

Periodic orbits of the ABC flow with $A=B=C=1$

In this paper, we prove that the ODE system $$ \begin{align*} \dot x &=\sin z+\cos y\\ \dot y &= \sin x+\cos z\\ \dot z &=\sin y + \cos x, \end{align*} $$ whose right-hand side is the Arnold-Beltrami-Childress (ABC) flow with parameters $A=B=C=1$, has periodic orbits on $(2π\mathbb T)^3$ with rotation vectors parallel to $(1,0,0)$, $(0,1,0)$, and $(0,0,1)$. An application of this result is that the well-known G-equation model for turbulent combustion with this ABC flow on $\mathbb R^3$ has a linear (i.e., maximal possible) flame speed enhancement rate as the amplitude of the flow grows.

preprint2016arXiv

Transformed Schatten-1 Iterative Thresholding Algorithms for Low Rank Matrix Completion

We study a non-convex low-rank promoting penalty function, the transformed Schatten-1 (TS1), and its applications in matrix completion. The TS1 penalty, as a matrix quasi-norm defined on its singular values, interpolates the rank and the nuclear norm through a nonnegative parameter a. We consider the unconstrained TS1 regularized low-rank matrix recovery problem and develop a fixed point representation for its global minimizer. The TS1 thresholding functions are in closed analytical form for all parameter values. The TS1 threshold values differ in subcritical (supercritical) parameter regime where the TS1 threshold functions are continuous (discontinuous). We propose TS1 iterative thresholding algorithms and compare them with some state-of-the-art algorithms on matrix completion test problems. For problems with known rank, a fully adaptive TS1 iterative thresholding algorithm consistently performs the best under different conditions with ground truth matrix being multivariate Gaussian at varying covariance. For problems with unknown rank, TS1 algorithms with an additional rank estimation procedure approach the level of IRucL-q which is an iterative reweighted algorithm, non-convex in nature and best in performance.

preprint2015arXiv

Computational Modeling of Spectral Data Fitting with Nonlinear Distortions

Substances such as chemical compounds are invisible to human eyes, they are usually captured by sensing equipments with their spectral fingerprints. Though spectra of pure chemicals can be identified by visual inspection, the spectra of their mixtures take a variety of complicated forms. Given the knowledge of spectral references of the constituent chemicals, the task of data fitting is to retrieve their weights, and this usually can be obtained by solving a least squares problem. Complications occur if the basis functions (reference spectra) may not be used directly to best fit the data. In fact, random distortions (spectral variability) such as shifting, compression, and expansion have been observed in some source spectra when the underlying substances are mixed. In this paper, we formulate mathematical model for such nonlinear effects and build them into data fitting algorithms. If minimal knowledge of the distortions is available, a deterministic approach termed {\it augmented least squares} is developed and it fits the spectral references along with their derivatives to the mixtures. If the distribution of the distortions is known a prior, we consider to solve the problem with maximum likelihood estimators which incorporate the shifts into the variance matrix. The proposed methods are substantiated with numerical examples including data from Raman spectroscopy (RS), nuclear magnetic resonance (NMR), and differential optical absorption spectroscopy (DOAS) and show satisfactory results.

preprint2015arXiv

Weighted Elastic Net Penalized Mean-Variance Portfolio Design and Computation

It is well known that the out-of-sample performance of Markowitz's mean-variance portfolio criterion can be negatively affected by estimation errors in the mean and covariance. In this paper we address the problem by regularizing the mean-variance objective function with a weighted elastic net penalty. We show that the use of this penalty can be motivated by a robust reformulation of the mean-variance criterion that directly accounts for parameter uncertainty. With this interpretation of the weighted elastic net penalty we derive data driven techniques for calibrating the weighting parameters based on the level of uncertainty in the parameter estimates. We test our proposed technique on US stock return data and our results show that the calibrated weighted elastic net penalized portfolio outperforms both the unpenalized portfolio and uniformly weighted elastic net penalized portfolio. This paper also introduces a novel Adaptive Support Split-Bregman approach which leverages the sparse nature of $\ell_{1}$ penalized portfolios to efficiently compute a solution of our proposed portfolio criterion. Numerical results show that this modification to the Split-Bregman algorithm results in significant improvements in computational speed compared with other techniques.

preprint2014arXiv

A Computational Study of Residual KPP Front Speeds in Time-Periodic Cellular Flows in the Small Diffusion Limit

The minimal speeds ($c^*$) of the Kolmogorov-Petrovsky-Piskunov (KPP) fronts at small diffusion ($ε\ll 1$) in a class of time-periodic cellular flows with chaotic streamlines is investigated in this paper. The variational principle of $c^*$ reduces the computation to that of a principal eigenvalue problem on a periodic domain of a linear advection-diffusion operator with space-time periodic coefficients and small diffusion. To solve the advection dominated time-dependent eigenvalue problem efficiently over large time, a combination of finite element and spectral methods, as well as the associated fast solvers, are utilized to accelerate computation. In contrast to the scaling $c^*=\mathcal{O}(ε^{1/4})$ in steady cellular flows, a new relation $c^* = \mathcal{O}(1)$ as $ε\ll 1$ is revealed in the time-periodic cellular flows due to the presence of chaotic streamlines. Residual propagation speed emerges from the Lagrangian chaos which is quantified as a sub-diffusion process.

preprint2014arXiv

PhaseLiftOff: an Accurate and Stable Phase Retrieval Method Based on Difference of Trace and Frobenius Norms

Phase retrieval aims to recover a signal $x \in \mathbb{C}^{n}$ from its amplitude measurements $|<x, a_i > |^2$, $i=1,2,...,m$, where $a_i$'s are over-complete basis vectors, with $m$ at least $3n -2$ to ensure a unique solution up to a constant phase factor. The quadratic measurement becomes linear in terms of the rank-one matrix $X = x x^*$. Phase retrieval is then a rank-one minimization problem subject to linear constraint for which a convex relaxation based on trace-norm minimization (PhaseLift) has been extensively studied recently. At $m=O(n)$, PhaseLift recovers with high probability the rank-one solution. In this paper, we present a precise proxy of rank-one condition via the difference of trace and Frobenius norms which we call PhaseLiftOff. The associated least squares minimization with this penalty as regularization is equivalent to the rank-one least squares problem under a mild condition on the measurement noise. Stable recovery error estimates are valid at $m=O(n)$ with high probability. Computation of PhaseLiftOff minimization is carried out by a convergent difference of convex functions algorithm. In our numerical example, $a_i$'s are Gaussian distributed. Numerical results show that PhaseLiftOff outperforms PhaseLift and its nonconvex variant (log-determinant regularization), and successfully recovers signals near the theoretical lower limit on the number of measurements without the noise.

preprint2013arXiv

A Method for Finding Structured Sparse Solutions to Non-negative Least Squares Problems with Applications

Demixing problems in many areas such as hyperspectral imaging and differential optical absorption spectroscopy (DOAS) often require finding sparse nonnegative linear combinations of dictionary elements that match observed data. We show how aspects of these problems, such as misalignment of DOAS references and uncertainty in hyperspectral endmembers, can be modeled by expanding the dictionary with grouped elements and imposing a structured sparsity assumption that the combinations within each group should be sparse or even 1-sparse. If the dictionary is highly coherent, it is difficult to obtain good solutions using convex or greedy methods, such as non-negative least squares (NNLS) or orthogonal matching pursuit. We use penalties related to the Hoyer measure, which is the ratio of the $l_1$ and $l_2$ norms, as sparsity penalties to be added to the objective in NNLS-type models. For solving the resulting nonconvex models, we propose a scaled gradient projection algorithm that requires solving a sequence of strongly convex quadratic programs. We discuss its close connections to convex splitting methods and difference of convex programming. We also present promising numerical results for example DOAS analysis and hyperspectral demixing problems.

preprint2012arXiv

A Numerical Study of Turbulent Flame Speeds of Curvature and Strain G-equations in Cellular Flows

We study front speeds of curvature and strain G-equations arising in turbulent combustion. These G-equations are Hamilton-Jacobi type level set partial differential equations (PDEs) with non-coercive Hamiltonians and degenerate nonlinear second order diffusion. The Hamiltonian of strain G-equation is also non-convex. Numerical computation is performed based on monotone discretization and weighted essentially nonoscillatory (WENO) approximation of transformed G-equations on a fixed periodic domain. The advection field in the computation is a two dimensional Hamiltonian flow consisting of a periodic array of counter-rotating vortices, or cellular flows. Depending on whether the evolution is predominantly in the hyperbolic or parabolic regimes, suitable explicit and semi-implicit time stepping methods are chosen. The turbulent flame speeds are computed as the linear growth rates of large time solutions. A new nonlinear parabolic PDE is proposed for the reinitialization of level set functions to prevent piling up of multiple bundles of level sets on the periodic domain. We found that the turbulent flame speed $s_T$ of the curvature G-equation is enhanced as the intensity $A$ of cellular flows increases, at a rate between those of the inviscid and viscous G-equations. The $s_T$ of the strain G-equation increases in small $A$, decreases in larger $A$, then drops down to zero at a large enough but finite value $A_{*}$. The flame front ceases to propagate at this critical intensity $A_*$, and is quenched by the cellular flow.

preprint2012arXiv

Turbulent Flame Speeds of G-equation Models in Unsteady Cellular Flows

We perform a computationl study of front speeds of G-equation models in time dependent cellular flows. The G-equations arise in premixed turbulent combustion, and are Hamilton-Jacobi type level set partial differential equations (PDEs). The curvature-strain G equations are also non-convex with degenerate diffusion. The computation is based on monotone finite difference discretization and weighted essentially nonoscillatory (WENO) methods. We found that the large time front speeds lock into the frequency of time periodic cellular flows in curvature-strain G-equations similar to what occurs in the basic inviscid G-equation. However, such frequency locking phenomenon disappears in viscous G-equation, and in the inviscid G-equation if time periodic oscillation of the cellular flow is replaced by time stochastic oscillation.

preprint2011arXiv

A convex model for non-negative matrix factorization and dimensionality reduction on physical space

A collaborative convex framework for factoring a data matrix $X$ into a non-negative product $AS$, with a sparse coefficient matrix $S$, is proposed. We restrict the columns of the dictionary matrix $A$ to coincide with certain columns of the data matrix $X$, thereby guaranteeing a physically meaningful dictionary and dimensionality reduction. We use $l_{1,\infty}$ regularization to select the dictionary from the data and show this leads to an exact convex relaxation of $l_0$ in the case of distinct noise free data. We also show how to relax the restriction-to-$X$ constraint by initializing an alternating minimization approach with the solution of the convex model, obtaining a dictionary close to but not necessarily in $X$. We focus on applications of the proposed framework to hyperspectral endmember and abundances identification and also show an application to blind source separation of NMR data.

preprint2011arXiv

Analysis and Comparison of Large Time Front Speeds in Turbulent Combustion Models

Predicting turbulent flame speed (the large time front speed) is a fundamental problem in turbulent combustion theory. Several models have been proposed to study the turbulent flame speed, such as the G-equations, the F-equations (Majda-Souganidis model) and reaction-diffusion-advection (RDA) equations. In the first part of this paper, we show that flow induced strain reduces front speeds of G-equations in periodic compressible and shear flows. The F-equations arise in asymptotic analysis of reaction-diffusion-advection equations and are quadratically nonlinear analogues of the G-equations. In the second part of the paper, we compare asymptotic growth rates of the turbulent flame speeds from the G-equations, the F-equations and the RDA equations in the large amplitude ($A$) regime of spatially periodic flows. The F and G equations share the same asymptotic front speed growth rate; in particular, the same sublinear growth law $A\over \log(A)$ holds in cellular flows. Moreover, in two space dimensions, if one of these three models (G-equation, F-equation and the RDA equation) predicts the bending effect (sublinear growth in the large flow), so will the other two. The nonoccurrence of speed bending is characterized by the existence of periodic orbits on the torus and the property of their rotation vectors in the advective flow fields. The cat's eye flow is discussed as a typical example of directional dependence of the front speed bending. The large time front speeds of the viscous F-equation have the same growth rate as those of the inviscid F and G-equations in two dimensional periodic incompressible flows.

preprint2005arXiv

Variational Principle of KPP Front Speeds in Temporally Random Shear Flows

We establish the variational principle of Kolmogorov-Petrovsky-Piskunov (KPP) front speeds in temporally random shear flows inside an infinite cylinder, under suitable assumptions of the shear field. A key quantity in the variational principle is the almost sure Lyapunov exponent of a heat operator with random potential. The variational principle then allows us to bound and compute the front speeds. We show the linear and quadratic laws of speed enhancement as well as a resonance-like dependence of front speed on the temporal shear correlation length. To prove the variational principle, we use the comparison principle of solutions, the path integral representation of solutions, and large deviation estimates of the associated stochastic flows.

Jack Xin

What is connected

Connect this record

See the researcher in context

Building this map preview

32 published item(s)

A Stochastic Genetic Interacting Particle Method for Reaction-Diffusion-Advection Equations

USEMA: a Scalable Efficient Mamba Like Attention for Medical Image Segmentation

An integrated recurrent neural network and regression model with spatial and climatic couplings for vector-borne disease dynamics

Channel Pruning In Quantization-aware Training: An Adaptive Projection-gradient Descent-shrinkage-splitting Method

DeepParticle: learning invariant measure by a deep neural network minimizing Wasserstein distance on data generated from an interacting particle method

Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention VAE

glassoformer: a query-sparse transformer for post-fault power grid voltage prediction

Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs

RARTS: An Efficient First-Order Relaxed Architecture Search Method

Searching Intrinsic Dimensions of Vision Transformers

Structure Assisted NMF Methods for Separation of Degenerate Mixture Data with Application to NMR Spectroscopy

A Recurrent Neural Network and Differential Equation Based Spatiotemporal Infectious Disease Model with Application to COVID-19

Convergence of a Relaxed Variable Splitting Method for Learning Sparse Neural Networks via $\ell_1, \ell_0$, and transformed-$\ell_1$ Penalties

Convergence of stochastic structure-preserving schemes for computing effective diffusivity in random flows

Enhanced Diffusivity in Perturbed Senile Reinforced Random Walk Models

Lorentzian Peak Sharpening and Sparse Blind Source Separation for NMR Spectroscopy

Two-Grid based Adaptive Proper Orthogonal Decomposition Algorithm for Time Dependent Partial Differential Equations

Ballistic Orbits and Front Speed Enhancement for ABC Flows

Iterative $\ell_1$ minimization for non-convex compressed sensing

Minimization of Transformed $L_1$ Penalty: Closed Form Representation and Iterative Thresholding Algorithms

Periodic orbits of the ABC flow with $A=B=C=1$

Transformed Schatten-1 Iterative Thresholding Algorithms for Low Rank Matrix Completion

Computational Modeling of Spectral Data Fitting with Nonlinear Distortions

Weighted Elastic Net Penalized Mean-Variance Portfolio Design and Computation

A Computational Study of Residual KPP Front Speeds in Time-Periodic Cellular Flows in the Small Diffusion Limit

PhaseLiftOff: an Accurate and Stable Phase Retrieval Method Based on Difference of Trace and Frobenius Norms

A Method for Finding Structured Sparse Solutions to Non-negative Least Squares Problems with Applications

A Numerical Study of Turbulent Flame Speeds of Curvature and Strain G-equations in Cellular Flows

Turbulent Flame Speeds of G-equation Models in Unsteady Cellular Flows

A convex model for non-negative matrix factorization and dimensionality reduction on physical space

Analysis and Comparison of Large Time Front Speeds in Turbulent Combustion Models

Variational Principle of KPP Front Speeds in Temporally Random Shear Flows