Source author record

Ziyan Luo

Ziyan Luo appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC math.CO math.SP physics.app-ph cond-mat.mtrl-sci Machine Learning math.NA physics.optics Computer Vision cond-mat.mes-hall Information Theory math.IT math.RA math.ST Numerical Analysis Statistics Theory

Catalog footprint

What is connected

18works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Preconditioned Inexact Stochastic ADMM for Deep Model

The recent advancement of foundation models (FMs) has brought about a paradigm shift, revolutionizing various sectors worldwide. The popular optimizers used to train these models are stochastic gradient descent-based algorithms, which face inherent limitations, such as slow convergence and stringent assumptions for convergence. In particular, data heterogeneity arising from distributed settings poses significant challenges to their theoretical and numerical performance. This paper develops an algorithm, PISA (Preconditioned Inexact Stochastic Alternating Direction Method of Multipliers). Grounded in rigorous theoretical guarantees, the algorithm converges under the sole assumption of Lipschitz continuity of the gradient on a bounded region, thereby removing the need for other conditions commonly imposed by stochastic methods. This capability enables the proposed algorithm to tackle the challenge of data heterogeneity effectively. Moreover, the algorithmic architecture enables scalable parallel computing and supports various preconditions, such as second-order information, second moment, and orthogonalized momentum by Newton-Schulz iterations. Incorporating the latter two preconditions in PISA yields two computationally efficient variants: SISA and NSISA. Comprehensive experimental evaluations for training or fine-tuning diverse deep models, including vision models, large language models, reinforcement learning models, generative adversarial networks, and recurrent neural networks, demonstrate superior numerical performance of SISA and NSISA compared to various state-of-the-art optimizers.

preprint2022arXiv

2D+3D facial expression recognition via embedded tensor manifold regularization

In this paper, a novel approach via embedded tensor manifold regularization for 2D+3D facial expression recognition (FERETMR) is proposed. Firstly, 3D tensors are constructed from 2D face images and 3D face shape models to keep the structural information and correlations. To maintain the local structure (geometric information) of 3D tensor samples in the low-dimensional tensors space during the dimensionality reduction, the $\ell_0$-norm of the core tensors and a tensor manifold regularization scheme embedded on core tensors are adopted via a low-rank truncated Tucker decomposition on the generated tensors. As a result, the obtained factor matrices will be used for facial expression classification prediction. To make the resulting tensor optimization more tractable, $\ell_1$-norm surrogate is employed to relax $\ell_0$-norm and hence the resulting tensor optimization problem has a nonsmooth objective function due to the $\ell_1$-norm and orthogonal constraints from the orthogonal Tucker decomposition. To efficiently tackle this tensor optimization problem, we establish the first-order optimality condition in terms of stationary points, and then design a block coordinate descent (BCD) algorithm with convergence analysis and the computational complexity. Numerical results on BU-3DFE database and Bosphorus databases demonstrate the effectiveness of our proposed approach.

preprint2022arXiv

Low Rank Approximation of Dual Complex Matrices

Dual complex numbers can represent rigid body motion in 2D spaces. Dual complex matrices are linked with screw theory, and have potential applications in various areas. In this paper, we study low rank approximation of dual complex matrices. We define $2$-norm for dual complex vectors, and Frobenius norm for dual complex matrices. These norms are nonnegative dual numbers. We establish the unitary invariance property of dual complex matrices. We study eigenvalues of square dual complex matrices, and show that an $n \times n$ dual complex Hermitian matrix has exactly $n$ eigenvalues, which are dual numbers. We present a singular value decomposition (SVD) theorem for dual complex matrices, define ranks and appreciable ranks for dual complex matrices, and study their properties. We establish an Eckart-Young like theorem for dual complex matrices, and present an algorithm framework for low rank approximation of dual complex matrices via truncated SVD. The SVD of dual complex matrices also provides a basic tool for Principal Component Analysis (PCA) via these matrices. Numerical experiments are reported.

preprint2022arXiv

Normal Cones Intersection Rule and Optimality Analysis for Low-Rank Matrix Optimization with Affine Manifolds

The low-rank matrix optimization with affine manifold (rank-MOA) aims to minimize a continuously differentiable function over a low-rank set intersecting with an affine manifold. This paper is devoted to the optimality analysis for rank-MOA. As a cornerstone, the intersection rule of the Fréchet normal cone to the feasible set of the rank-MOA is established under some mild linear independence assumptions. Aided with the resulting explicit formulae of the underlying normal cone, the so-called F-stationary point and the α-stationary point of rank-MOA are investigated and the relationship with local/global minimizers are then revealed in terms of first-order optimality conditions. Furthermore, the second-order optimality analysis, including the necessary and the sufficient conditions, is proposed based on the second-order differentiation information of the model. All these results will enrich the theory of low-rank matrix optimization and give potential clues to designing efficient numerical algorithms for seeking low rank solutions. Meanwhile, two specific applications of the rank-MOA are discussed to illustrate our proposed optimality analysis.

preprint2021arXiv

Computing One-bit Compressive Sensing via Double-Sparsity Constrained Optimization

One-bit compressive sensing gains its popularity in signal processing and communications due to its low storage costs and low hardware complexity. However, it has been a challenging task to recover the signal only by exploiting the one-bit (the sign) information. In this paper, we appropriately formulate the one-bit compressive sensing into a double-sparsity constrained optimization problem. The first-order optimality conditions for this nonconvex and discontinuous problem are established via the newly introduced $τ$-stationarity, based on which, a gradient projection subspace pursuit (\texttt{GPSP}) algorithm is developed. It is proven that \texttt{GPSP} can converge globally and terminate within finite steps. Numerical experiments have demonstrated its excellent performance in terms of a high order of accuracy with a fast computational speed.

preprint2020arXiv

Spin torque gate magnetic field sensor

Spin-orbit torque provides an efficient pathway to manipulate the magnetic state and magnetization dynamics of magnetic materials, which is crucial for energy-efficient operation of a variety of spintronic devices such as magnetic memory, logic, oscillator, and neuromorphic computing. Here, we describe and experimentally demonstrate a strategy for the realization of a spin torque gate magnetic field sensor with extremely simple structure by exploiting the longitudinal field dependence of the spin torque driven magnetization switching. Unlike most magnetoresistance sensors which require a delicate magnetic bias to achieve a linear response to the external field, the spin torque gate sensor can achieve the same without any magnetic bias, which greatly simplifies the sensor structure. Furthermore, by driving the sensor using an ac current, the dc offset is automatically suppressed, which eliminates the need for a bridge or compensation circuit. We verify the concept using the newly developed WTe2/Ti/CoFeB trilayer and demonstrate that the sensor can work linearly in the range of 3-10 Oe with negligible dc offset.

preprint2020arXiv

Terahertz Emission From an Exchange-Coupled Synthetic Antiferromagnet

We report on terahertz emission from FeMnPt/Ru/FeMnPt and Pt/CoFeB/Ru/CoFeB/Pt synthetic antiferromagnet (SAF) structures upon irradiation by a femtosecond laser; the former is via the anomalous Hall effect, whereas the latter is through the inverse spin Hall effect. The antiparallel alignment of the two ferromagnetic layers leads to a terahertz emission peak amplitude that is almost double that for a corresponding single-layer or bilayer emitter with the same equivalent thickness. In addition, we demonstrate by both simulation and experiment that terahertz emission provides a powerful tool to probe the magnetization reversal processes of individual ferromagnetic layers in a SAF structure, as the terahertz signal is proportional to the vector difference of the magnetizations of the two ferromagnetic layers.

preprint2019arXiv

Terahertz emission from anomalous Hall effect in a single-layer ferromagnet

We report on terahertz emission from a single layer ferromagnet which involves the generation of backflow nonthermal charge current from the ferromagnet/dielectric interface by femtosecond laser excitation and subsequent conversion of the charge current to a transverse transient charge current via the anomalous Hall effect, thereby generating the THz radiation. The THz emission can be either enhanced or suppressed, or even the polarity can be reversed, by introducing a magnetization gradient in the thickness direction of the ferromagnet. Unlike spintronic THz emitters reported previously, it does not require additional non-magnetic layer or Rashba interface.

preprint2016arXiv

Computing The Analytic Connectivity of A Uniform Hypergraph

The analytic connectivity, proposed as a substitute of the algebraic connectivity in the setting of hypergraphs, is an important quantity in spectral hypergraph theory. The definition of the analytic connectivity for a uniform hypergraph involves a series of optimization problems (POPs) associated with the Laplacian tensor of the hypergraph with nonnegativity constraints and a sphere constraint, which poses difficulties in computation. To reduce the involved computation, properties on the algebraic connectivity are further exploited, and several important structured uniform hypergraphs are shown to attain their analytic connectivities at vertices of the minimum degrees, hence admit a relatively less computation by solving a small number of POPs. To efficiently solve each involved POP, we propose a feasible trust region algorithm ({\tt FTR}) by exploiting their special structures. The global convergence of {\tt FTR} to the second-order necessary conditions points is established, and numerical results for both small and large size examples with comparison to other existing algorithms for POPs are reported to demonstrate the efficiency of our proposed algorithm.

preprint2016arXiv

Z-tensors and complementarity problems

Tensors are multidimensional analogs of matrices. In this paper, based on degree-theoretic ideas, we study homogeneous nonlinear complementarity problems induced by tensors. By specializing this to $Z$-tensors (which are tensors with non-positive off-diagonal entries), we describe various equivalent conditions for a $Z$-tensor to have the global solvability property. We show by an example that the global solvability need not imply unique solvability and provide a sufficient and easily checkable condition for unique solvability.

preprint2015arXiv

Characterization Tensors of Balanced Incomplete Block Designs

Balanced incomplete block designs (BIBDs) have wide applications in engineering, business and sciences. In this paper, for each (v, k, λ)-BIBD, we construct a strongly symmetric k-th order v-dimensional tensor. We call such a strongly symmetric tensor the characterization tensor of that BIBD, and the absolute value tensor of the characterization tensor the signless characterization tensor of that BIBD. We study some spectral properties of such characterization tensors and signless characterization tensors. In this way, we provide a new tool to study BIBDs.

preprint2015arXiv

Completely Positive Tensors and Multi-Hypergraphs

Completely positive graphs have been employed to associate with completely positive matrices for characterizing the intrinsic zero patterns. As tensors have been widely recognized as a higher-order extension of matrices, the multi-hypergraph, regarded as a generalization of graphs, is then introduced to associate with tensors for the study of complete positivity. To describe the dependence of the corresponding zero pattern for a special type of completely positive tensors--the $\{0,1\}$ completely positive tensors, the completely positive multi-hypergraph is defined. By characterizing properties of the associated multi-hypergraph, we provide necessary and sufficient conditions for any $(0,1)$ associated tensor to be $\{0,1\}$ completely positive. Furthermore, a necessary and sufficient condition for a uniform multi-hypergraph to be completely positive multi-hypergraph is proposed as well.

preprint2015arXiv

Doubly Nonnegative Tensors, Completely Positive Tensors and Applications

The concept of double nonnegativity of matrices is generalized to doubly nonnegative tensors by means of the nonnegativity of all entries and $H$-eigenvalues. This generalization is defined for tensors of any order (even or odd), while it reduces to the class of nonnegative positive semidefinite tensors in the even order case. We show that many nonnegative structured tensors, which are positive semidefinite in the even order case, are indeed doubly nonnegative as well in the odd order case. As an important subclass of doubly nonnegative tensors, the completely positive tensors are further studied. By using dominance properties for completely positive tensors, we can easily exclude some doubly nonnegative tensors, such as the signless Laplacian tensor of a nonempty $m$-uniform hypergraph with $m\geq 3$, from the class of completely positive tensors. Properties of the doubly nonnegative tensor cone and the completely positive tensor cone are established. Their relation and difference are discussed. These show us a different phenomenon comparing to the matrix case. By employing the proposed properties, more subclasses of these two types of tensors are identified. Particularly, all positive Cauchy tensors with any order are shown to be completely positive. This gives an easily constructible subclass of completely positive tensors, which is significant for the study of completely positive tensor decomposition. A preprocessed Fan-Zhou algorithm is proposed which can efficiently verify the complete positivity of nonnegative symmetric tensors. We also give the solution analysis of tensor complementarity problems with the strongly doubly nonnegative tensor structure.

preprint2015arXiv

P-Tensors, P$_0$-Tensors, and Tensor Complementarity Problem

The concepts of P- and P$_0$-matrices are generalized to P- and P$_0$-tensors of even and odd orders via homogeneous formulae. Analog to the matrix case, our P-tensor definition encompasses many important classes of tensors such as the positive definite tensors, the nonsingular M-tensors, the nonsingular H-tensors with positive diagonal entries, the strictly diagonally dominant tensors with positive diagonal entries, etc. As even-order symmetric PSD tensors are exactly even-order symmetric P$_0$-tensors, our definition of P$_0$-tensors, to some extent, can be regarded as an extension of PSD tensors for the odd-order case. Along with the basic properties of P- and P$_0$-tensors, the relationship among P$_0$-tensors and other extensions of PSD tensors are then discussed for comparison. Many structured tensors are also shown to be P- and P$_0$-tensors. As a theoretical application, the P-tensor complementarity problem is discussed and shown to possess a nonempty and compact solution set.

preprint2015arXiv

The Sparsest Solutions to $Z$-Tensor Complementarity Problems

Finding the sparsest solutions to a tensor complementarity problem is generally NP-hard due to the nonconvexity and noncontinuity of the involved $\ell_0$ norm. In this paper, a special type of tensor complementarity problems with $Z$-tensors has been considered. Under some mild conditions, we show that to pursuit the sparsest solutions is equivalent to solving polynomial programming with a linear objective function. The involved conditions guarantee the desired exact relaxation and also allow to achieve a global optimal solution to the relaxed nonconvex polynomial programming problem. Particularly, in comparison to existing exact relaxation conditions, such as RIP-type ones, our proposed conditions are easy to verify.

preprint2014arXiv

Sparse and Low-Rank Covariance Matrices Estimation

This paper aims at achieving a simultaneously sparse and low-rank estimator from the semidefinite population covariance matrices. We first benefit from a convex optimization which develops $l_1$-norm penalty to encourage the sparsity and nuclear norm to favor the low-rank property. For the proposed estimator, we then prove that with large probability, the Frobenious norm of the estimation rate can be of order $O(\sqrt{s(\log{r})/n})$ under a mild case, where $s$ and $r$ denote the number of sparse entries and the rank of the population covariance respectively, $n$ notes the sample capacity. Finally an efficient alternating direction method of multipliers with global convergence is proposed to tackle this problem, and meantime merits of the approach are also illustrated by practicing numerical simulations.

preprint2013arXiv

New RIC Bounds via l_q-minimization with 0<q<=1 in Compressed Sensing

The restricted isometry constants (RICs) play an important role in exact recovery theory of sparse signals via l_q(0<q<=1) relaxations in compressed sensing. Recently, Cai and Zhang[6] have achieved a sharp bound δ_tk<\sqrt{1-1/t} for t>=4/3 to guarantee the exact recovery of k sparse signals through the l_1 minimization. This paper aims to establish new RICs bounds via l_q(0<q<=1) relaxation. Based on a key inequality on l_q norm, we show that (i) the exact recovery can be succeeded via l_{1/2} and l_1 minimizations if δ_tk<\sqrt{1-1/t} for any t>1, (ii)several sufficient conditions can be derived, such as for any 0<q<1/2, δ_2k<0.5547 when k>=2, for any 1/2<q<1, δ_2k<0.6782 when k>=1, (iii) the bound on δ_k is given as well for any 0<q<=1, especially for q=1/2,1, we obtain δ_k<1/3 when k(>=2) is even or δ_k<0.3203 when k(>=3) is odd.

preprint2011arXiv

The Dominant Eigenvalue of an Essentially Nonnegative Tensor

It is well known that the dominant eigenvalue of a real essentially nonnegative matrix is a convex function of its diagonal entries. This convexity is of practical importance in population biology, graph theory, demography, analytic hierarchy process and so on. In this paper, the concept of essentially nonnegativity is extended from matrices to higher order tensors, and the convexity and log convexity of dominant eigenvalues for such a class of tensors are established. Particularly, for any nonnegative tensor, the spectral radius turns out to be the dominant eigenvalue and hence possesses these convexities. Finally, an algorithm is given to calculate the dominant eigenvalue, and numerical results are reported to show the effectiveness of the proposed algorithm.

Ziyan Luo

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

Preconditioned Inexact Stochastic ADMM for Deep Model

2D+3D facial expression recognition via embedded tensor manifold regularization

Low Rank Approximation of Dual Complex Matrices

Normal Cones Intersection Rule and Optimality Analysis for Low-Rank Matrix Optimization with Affine Manifolds

Computing One-bit Compressive Sensing via Double-Sparsity Constrained Optimization

Spin torque gate magnetic field sensor

Terahertz Emission From an Exchange-Coupled Synthetic Antiferromagnet

Terahertz emission from anomalous Hall effect in a single-layer ferromagnet

Computing The Analytic Connectivity of A Uniform Hypergraph

Z-tensors and complementarity problems

Characterization Tensors of Balanced Incomplete Block Designs

Completely Positive Tensors and Multi-Hypergraphs

Doubly Nonnegative Tensors, Completely Positive Tensors and Applications

P-Tensors, P$_0$-Tensors, and Tensor Complementarity Problem

The Sparsest Solutions to $Z$-Tensor Complementarity Problems

Sparse and Low-Rank Covariance Matrices Estimation

New RIC Bounds via l_q-minimization with 0<q<=1 in Compressed Sensing

The Dominant Eigenvalue of an Essentially Nonnegative Tensor