Source author record

Xiaoqun Zhang

Xiaoqun Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC math.NA eess.IV Machine Learning

Catalog footprint

What is connected

11works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A stochastic three-block splitting algorithm and its application to quantized deep neural networks

Deep neural networks (DNNs) have made great progress in various fields. In particular, the quantized neural network is a promising technique making DNNs compatible on resource-limited devices for memory and computation saving. In this paper, we mainly consider a non-convex minimization model with three blocks to train quantized DNNs and propose a new stochastic three-block alternating minimization (STAM) algorithm to solve it. We develop a convergence theory for the STAM algorithm and obtain an $ε$-stationary point with optimal convergence rate $\mathcal{O}(ε^{-4})$. Furthermore, we apply our STAM algorithm to train DNNs with relaxed binary weights. The experiments are carried out on three different network structures, namely VGG-11, VGG-16 and ResNet-18. These DNNs are trained using two different data sets, CIFAR-10 and CIFAR-100, respectively. We compare our STAM algorithm with some classical efficient algorithms for training quantized neural networks. The test accuracy indicates the effectiveness of STAM algorithm for training relaxed binary quantization DNNs.

preprint2020arXiv

A parameterized Douglas-Rachford Splitting algorithm for nonconvex optimization

In this paper, we study a parameterized Douglas-Rachford splitting method for a class of nonconvex optimization problem. A new merit function is constructed to establish the convergence of the whole sequence generated by the parameterized Douglas-Rachford splitting method. We then apply the parameterized Douglas-Rachford splitting method to three important classes of nonconvex optimization problems arising in data science: sparsity constrained least squares problem, feasibility problem and low rank matrix completion. Numerical results validate the effectiveness of the parameterized Douglas-Rachford splitting method compared with some other classical methods.

preprint2020arXiv

A Stochastic Variance Reduced Primal Dual Fixed Point Method For Linearly Constrained Separable Optimization

In this paper we combine the stochastic variance reduced gradient (SVRG) method [17] with the primal dual fixed point method (PDFP) proposed in [7] to solve a sum of two convex functions and one of which is linearly composite. This type of problems are typically arisen in sparse signal and image reconstruction. The proposed SVRG-PDFP can be seen as a generalization of Prox-SVRG [37] originally designed for the minimization of a sum of two convex functions. Based on some standard assumptions, we propose two variants, one is for strongly convex objective function and the other is for general convex cases. Convergence analysis shows that the convergence rate of SVRG-PDFP is O(1/k) (here k is the iteration number) for general convex objective function and linear for k strongly convex case. Numerical examples on machine learning and CT image reconstruction are provided to show the effectiveness of the algorithms.

preprint2020arXiv

A three-operator splitting algorithm for nonconvex sparsity regularization

Sparsity regularization has been largely applied in many fields, such as signal and image processing and machine learning. In this paper, we mainly consider nonconvex minimization problems involving three terms, for the applications such as: sparse signal recovery and low rank matrix recovery. We employ a three-operator splitting proposed by Davis and Yin (called DYS) to solve the resulting possibly nonconvex problems and develop the convergence theory for this three-operator splitting algorithm in the nonconvex case. We show that if the step size is chosen less than a computable threshold, then the whole sequence converges to a stationary point. By defining a new decreasing energy function associated with the DYS method, we establish the global convergence of the whole sequence and a local convergence rate under an additional assumption that this energy function is a Kurdyka-$Ł$ojasiewicz function. We also provide sufficient conditions for the boundedness of the generated sequence. Finally, some numerical experiments are conducted to compare the DYS algorithm with some classical efficient algorithms for sparse signal recovery and low rank matrix completion. The numerical results indicate that DYS method outperforms the exsiting methods for these specific applications.

preprint2020arXiv

Semi-Implicit Back Propagation

Neural network has attracted great attention for a long time and many researchers are devoted to improve the effectiveness of neural network training algorithms. Though stochastic gradient descent (SGD) and other explicit gradient-based methods are widely adopted, there are still many challenges such as gradient vanishing and small step sizes, which leads to slow convergence and instability of SGD algorithms. Motivated by error back propagation (BP) and proximal methods, we propose a semi-implicit back propagation method for neural network training. Similar to BP, the difference on the neurons are propagated in a backward fashion and the parameters are updated with proximal mapping. The implicit update for both hidden neurons and parameters allows to choose large step size in the training algorithm. Finally, we also show that any fixed point of convergent sequences produced by this algorithm is a stationary point of the objective loss function. The experiments on both MNIST and CIFAR-10 demonstrate that the proposed semi-implicit BP algorithm leads to better performance in terms of both loss decreasing and training/validation accuracy, compared to SGD and a similar algorithm ProxBP.

preprint2019arXiv

Low-Dose CT with Deep Learning Regularization via Proximal Forward Backward Splitting

Low dose X-ray computed tomography (LDCT) is desirable for reduced patient dose. This work develops image reconstruction methods with deep learning (DL) regularization for LDCT. Our methods are based on unrolling of proximal forward-backward splitting (PFBS) framework with data-driven image regularization via deep neural networks. In contrast with PFBS-IR that utilizes standard data fidelity updates via iterative reconstruction (IR) method, PFBS-AIR involves preconditioned data fidelity updates that fuse analytical reconstruction (AR) method and IR in a synergistic way, I.e. fused analytical and iterative reconstruction (AIR). The results suggest that DL-regularized methods (PFBS-IR and PFBS-AIR) provided better reconstruction quality from conventional wisdoms (AR or IR), and DL-based postprocessing method (FBPConvNet). In addition, owing to AIR, PFBS-AIR noticeably outperformed PFBS-IR.

preprint2016arXiv

A primal-dual fixed point algorithm for multi-block convex minimization

We extend a primal-dual fixed point algorithm (PDFP) proposed in [5] to solve two kinds of separable multi-block minimization problems, arising in signal processing and imaging science. This work shows the flexibility of applying PDFP algorithm to multi-block problems and illustrate how practical and fully decoupled schemes can be derived, especially for parallel implementation of large scale problems. The connections and comparisons to the alternating direction method of multiplier (ADMM) are also present. We demonstrate how different algorithms can be obtained by splitting the problems in different ways through the classic example of sparsity regularized least square model with constraint. In particular, for a class of linearly constrained problems, which are of great interest in the context of multi-block ADMM, can be solved by PDFP with a guarantee of convergence. Finally, some experiments are provided to illustrate the performance of several schemes derived by the PDFP algorithm.

preprint2016arXiv

Limited Tomography Reconstruction via Tight Frame and Sinogram Extrapolation

X-ray computed tomography (CT) is one of widely used diagnostic tools for medical and dental tomographic imaging of the human body. However, the standard filtered backprojection reconstruction method requires the complete knowledge of the projection data. In the case of limited data, the inverse problem of CT becomes more ill-posed, which makes the reconstructed image deteriorated by the artifacts. In this paper, we consider two dimensional CT reconstruction using the horizontally truncated projections. Over the decades, the numerous results including the sparsity model based approach has enabled the reconstruction of the image inside the region of interest (ROI) from the limited knowledge of the data. However, unlike these existing methods, we try to reconstruct the entire CT image from the limited knowledge of the sinogram via the tight frame regularization and the simultaneous sinogram extrapolation. Our proposed model shows more promising numerical simulation results compared with the existing sparsity model based approach.

preprint2016arXiv

Simultaneous Reconstruction and Segmentation for Dynamic SPECT Imaging

This work deals with the reconstruction of dynamic images that incorporate characteristic dynamics in certain subregions, as arising for the kinetics of many tracers in emission tomography (SPECT, PET). We make use of a basis function approach for the unknown tracer concentration by assuming that the region of interest can be divided into subregions with spatially constant concentration curves. Applying a regularized variational framework reminiscent of the Chan-Vese model for image segmentation we simultaneously reconstruct both the labelling functions of the subregions as well as the subconcentrations within each region. Our particular focus is on applications in SPECT with Poisson noise model, resulting in a Kullback-Leibler data fidelity in the variational approach. We present a detailed analysis of the proposed variational model and prove existence of minimizers as well as error estimates. The latter apply to a more general class of problems and generalize existing results in literature since we deal with a nonlinear forward operator and a nonquadratic data fidelity. A computational algorithm based on alternating minimization and splitting techniques is developed for the solution of the problem and tested on appropriately designed synthetic data sets. For those we compare the results to those of standard EM reconstructions and investigate the effects of Poisson noise in the data.

preprint2015arXiv

A Hybrid Segmentation and D-bar Method for Electrical Impedance Tomography

The Regularized D-bar method for Electrical Impedance Tomography provides a rigorous mathematical approach for solving the full nonlinear inverse problem directly, i.e. without iterations. It is based on a low-pass filtering in the (nonlinear) frequency domain. However, the resulting D-bar reconstructions are inherently smoothed leading to a loss of edge distinction. In this paper, a novel approach that combines the rigor of the D-bar approach with the edge-preserving nature of Total Variation regularization is presented. The method also includes a data-driven contrast adjustment technique guided by the key functions (CGO solutions) of the D-bar method. The new TV-Enhanced D-bar Method produces reconstructions with sharper edges and improved contrast while still solving the full nonlinear problem. This is achieved by using the TV-induced edges to increase the truncation radius of the scattering data in the nonlinear frequency domain thereby increasing the radius of the low pass filter. The algorithm is tested on numerically simulated noisy EIT data and demonstrates significant improvements in edge preservation and contrast which can be highly valuable for absolute EIT imaging.

preprint2015arXiv

A primal-dual fixed-point algorithm for minimization of the sum of three convex separable functions

Many problems arising in image processing and signal recovery with multi-regularization can be formulated as minimization of a sum of three convex separable functions. Typically, the objective function involves a smooth function with Lipschitz continuous gradient, a linear composite nonsmooth function and a nonsmooth function. In this paper, we propose a primal-dual fixed-point (PDFP) scheme to solve the above class of problems. The proposed algorithm for three block problems is a fully splitting symmetric scheme, only involving explicit gradient and linear operators without inner iteration, when the nonsmooth functions can be easily solved via their proximity operators, such as $\ell_1$ type regularization. We study the convergence of the proposed algorithm and illustrate its efficiency through examples on fused LASSO and image restoration with non-negative constraint and sparse regularization.

Xiaoqun Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

A stochastic three-block splitting algorithm and its application to quantized deep neural networks

A parameterized Douglas-Rachford Splitting algorithm for nonconvex optimization

A Stochastic Variance Reduced Primal Dual Fixed Point Method For Linearly Constrained Separable Optimization

A three-operator splitting algorithm for nonconvex sparsity regularization

Semi-Implicit Back Propagation

Low-Dose CT with Deep Learning Regularization via Proximal Forward Backward Splitting

A primal-dual fixed point algorithm for multi-block convex minimization

Limited Tomography Reconstruction via Tight Frame and Sinogram Extrapolation

Simultaneous Reconstruction and Segmentation for Dynamic SPECT Imaging

A Hybrid Segmentation and D-bar Method for Electrical Impedance Tomography

A primal-dual fixed-point algorithm for minimization of the sum of three convex separable functions