Source author record

Jiang Hu

Jiang Hu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.ST Statistics Theory math.OC quant-ph Machine Learning

Catalog footprint

What is connected

18works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

Extended parameter shift rules with minimal derivative variance for parameterized quantum circuits

Parameter shift rules (PSRs) are useful methods for computing arbitrary-order derivatives of the cost function in parameterized quantum circuits. The basic idea of PSRs is to evaluate the cost function at different parameter shifts, then use specific coefficients to combine them linearly to obtain the exact derivatives. In this work, we propose an extended parameter shift rule (EPSR) which generalizes a broad range of existing PSRs and has the following two advantages. First, EPSR offers an infinite number of possible parameter shifts, allowing the selection of the optimal parameter shifts to minimize the final derivative variance and thereby obtaining the more accurate derivative estimates with limited quantum resources. Second, EPSR extends the scope of the PSRs in the sense that EPSR can handle arbitrary Hermitian operator $H$ in gate $U(x) = \exp (iHx)$ in the parameterized quantum circuits, while existing PSRs are valid only for simple Hermitian generators $H$ such as simple Pauli words. Additionally, we show that the widely used ``general PSR'', introduced by Wierichs et al. (2022), is a special case of our EPSR, and we prove that it yields globally optimal shifts for minimizing the derivative variance under the weighted-shot scheme. Finally, through numerical simulations, we demonstrate the effectiveness of EPSR and show that the usage of the optimal parameter shifts indeed leads to more accurate derivative estimates.

preprint2025arXiv

Interpolation-based coordinate descent method for parameterized quantum circuits

Parameterized quantum circuits (PQCs) are ubiquitous in the design of hybrid quantum-classical algorithms. In this work, we propose an interpolation-based coordinate descent (ICD) method to address the parameter optimization problem in PQCs. The ICD method provides a unified framework for existing structure optimization techniques such as Rotosolve, sequential minimal optimization, ExcitationSolve, and others. ICD employs interpolation to approximate the PQC cost function, effectively recovering its underlying trigonometric structure, and then performs an argmin update on a single parameter in each iteration. In contrast to previous studies on structure optimization, we determine the optimal interpolation nodes to mitigate statistical errors arising from quantum measurements. Moreover, in the common case of $r$ equidistant frequencies, we show that the optimal interpolation nodes are equidistant nodes with spacing $2π/(2r+1)$ (under constant variance assumption), and that our ICD method simultaneously minimizes the mean squared error, the condition number of the interpolation matrix, and the average variance of the approximated cost function. We perform numerical simulations and test on the MaxCut problem, the transverse field Ising model, and the XXZ model. Numerical results imply that our ICD method is more efficient than the commonly used gradient descent and random coordinate descent method.

preprint2022arXiv

A CLT for the LSS of large dimensional sample covariance matrices with unbounded dispersions

In this paper, we establish the central limit theorem (CLT) for linear spectral statistics (LSS) of large-dimensional sample covariance matrix when the population covariance matrices are not uniformly bounded, which is a nontrivial extension of the Bai-Silverstein theorem (BST) (2004). The latter has strongly stimulated the development of high-dimensional statistics, especially the application of random matrix theory to statistics. However, the assumption of uniform boundedness of the population covariance matrices is found strongly limited to the applications of BST. The aim of this paper is to remove the blockages to the applications of BST. The new CLT, allows the spiked eigenvalues to exist and tend to infinity. It is interesting to note that the roles of either spiked eigenvalues or the bulk eigenvalues or both of the two are dominating in the CLT. Moreover, the results are checked by simulation studies with various population settings. The CLT for LSS is then applied for testing the hypothesis that a covariance matrix $ \bSi $ is equal to an identity matrix. For this, the asymptotic distributions for the corrected likelihood ratio test (LRT) and Nagao's trace test (NT) under alternative are derived, and we also propose the asymptotic power of LRT and NT under certain alternatives.

preprint2022arXiv

Riemannian Natural Gradient Methods

This paper studies large-scale optimization problems on Riemannian manifolds whose objective function is a finite sum of negative log-probability losses. Such problems arise in various machine learning and signal processing applications. By introducing the notion of Fisher information matrix in the manifold setting, we propose a novel Riemannian natural gradient method, which can be viewed as a natural extension of the natural gradient method from the Euclidean setting to the manifold setting. We establish the almost-sure global convergence of our proposed method under standard assumptions. Moreover, we show that if the loss function satisfies certain convexity and smoothness conditions and the input-output map satisfies a Riemannian Jacobian stability condition, then our proposed method enjoys a local linear -- or, under the Lipschitz continuity of the Riemannian Jacobian of the input-output map, even quadratic -- rate of convergence. We then prove that the Riemannian Jacobian stability condition will be satisfied by a two-layer fully connected neural network with batch normalization with high probability, provided that the width of the network is sufficiently large. This demonstrates the practical relevance of our convergence rate result. Numerical experiments on applications arising from machine learning demonstrate the advantages of the proposed method over state-of-the-art ones.

preprint2022arXiv

Spectral Statistics of Sample Block Correlation Matrices

A fundamental concept in multivariate statistics, sample correlation matrix, is often used to infer the correlation/dependence structure among random variables, when the population mean and covariance are unknown. A natural block extension of it, {\it sample block correlation matrix}, is proposed to take on the same role, when random variables are generalized to random sub-vectors. In this paper, we establish a spectral theory of the sample block correlation matrices and apply it to group independent test and related problem, under the high-dimensional setting. More specifically, we consider a random vector of dimension $p$, consisting of $k$ sub-vectors of dimension $p_t$'s, where $p_t$'s can vary from $1$ to order $p$. Our primary goal is to investigate the dependence of the $k$ sub-vectors. We construct a random matrix model called sample block correlation matrix based on $n$ samples for this purpose. The spectral statistics of the sample block correlation matrix include the classical Wilks' statistic and Schott's statistic as special cases. It turns out that the spectral statistics do not depend on the unknown population mean and covariance. Further, under the null hypothesis that the sub-vectors are independent, the limiting behavior of the spectral statistics can be described with the aid of the Free Probability Theory. Specifically, under three different settings of possibly $n$-dependent $k$ and $p_t$'s, we show that the empirical spectral distribution of the sample block correlation matrix converges to the free Poisson binomial distribution, free Poisson distribution (Marchenko-Pastur law) and free Gaussian distribution (semicircle law), respectively. We then further derive the CLTs for the linear spectral statistics of the block correlation matrix under general setting.

preprint2022arXiv

The limiting spectral distribution of large dimensional general information-plus-noise type matrices

Let $ X_{n} $ be $ n\times N $ random complex matrices, $R_{n}$ and $T_{n}$ be non-random complex matrices with dimensions $n\times N$ and $n\times n$, respectively. We assume that the entries of $ X_{n} $ are independent and identically distributed, $ T_{n} $ are nonnegative definite Hermitian matrices and $T_{n}R_{n}R_{n}^{*}= R_{n}R_{n}^{*}T_{n} $. The general information-plus-noise type matrices are defined by $C_{n}=\frac{1}{N}T_{n}^{\frac{1}{2}} \left( R_{n} +X_{n}\right) \left(R_{n}+X_{n}\right)^{*}T_{n}^{\frac{1}{2}} $. In this paper, we establish the limiting spectral distribution of the large dimensional general information-plus-noise type matrices $C_{n}$. Specifically, we show that as $n$ and $N$ tend to infinity proportionally, the empirical distribution of the eigenvalues of $C_{n}$ converges weakly to a non-random probability distribution, which is characterized in terms of a system of equations of its Stieltjes transform.

preprint2020arXiv

Modified Pillai's trace statistics for two high-dimensional sample covariance matrices

The goal of this study was to test the equality of two covariance matrices by using modified Pillai's trace statistics under a high-dimensional framework, i.e., the dimension and sample sizes go to infinity proportionally. In this paper, we introduce two modified Pillai's trace statistics and obtain their asymptotic distributions under the null hypothesis. The benefits of the proposed statistics include the following: (1) the sample size can be smaller than the dimensions; (2) the limiting distributions of the proposed statistics are universal; and (3) we do not restrict the structure of the population covariance matrices. The theoretical results are established under mild and practical assumptions, and their properties are demonstrated numerically by simulations and a real data analysis.

preprint2020arXiv

Strong consistency of the AIC, BIC, $C_p$ and KOO methods in high-dimensional multivariate linear regression

Variable selection is essential for improving inference and interpretation in multivariate linear regression. Although a number of alternative regressor selection criteria have been suggested, the most prominent and widely used are the Akaike information criterion (AIC), Bayesian information criterion (BIC), Mallow's $C_p$, and their modifications. However, for high-dimensional data, experience has shown that the performance of these classical criteria is not always satisfactory. In the present article, we begin by presenting the necessary and sufficient conditions (NSC) for the strong consistency of the high-dimensional AIC, BIC, and $C_p$, based on which we can identify some reasons for their poor performance. Specifically, we show that under certain mild high-dimensional conditions, if the BIC is strongly consistent, then the AIC is strongly consistent, but not vice versa. This result contradicts the classical understanding. In addition, we consider some NSC for the strong consistency of the high-dimensional kick-one-out (KOO) methods introduced by Zhao et al. (1986) and Nishii et al. (1988). Furthermore, we propose two general methods based on the KOO methods and prove their strong consistency. The proposed general methods remove the penalties while simultaneously reducing the conditions for the dimensions and sizes of the regressors. A simulation study supports our consistency conclusions and shows that the convergence rates of the two proposed general KOO methods are much faster than those of the original methods.

preprint2016arXiv

A review of 20 years of naive tests of significance for high-dimensional mean vectors and covariance matrices

In this paper, we will introduce the so called naive tests and give a brief review on the newly development. Naive testing methods are easy to understand and performs robust especially when the dimension is large. In this paper, we mainly focus on reviewing some naive testing methods for the mean vectors and covariance matrices of high dimensional populations and believe this naive test idea can be wildly used in many other testing problems.

preprint2015arXiv

Convergence of the empirical spectral distribution function of Beta matrices

Let $\mathbf{B}_n=\mathbf {S}_n(\mathbf {S}_n+α_n\mathbf {T}_N)^{-1}$, where $\mathbf {S}_n$ and $\mathbf {T}_N$ are two independent sample covariance matrices with dimension $p$ and sample sizes $n$ and $N$, respectively. This is the so-called Beta matrix. In this paper, we focus on the limiting spectral distribution function and the central limit theorem of linear spectral statistics of $\mathbf {B}_n$. Especially, we do not require $\mathbf {S}_n$ or $\mathbf {T}_N$ to be invertible. Namely, we can deal with the case where $p>\max\{n,N\}$ and $p<n+N$. Therefore, our results cover many important applications which cannot be simply deduced from the corresponding results for multivariate $F$ matrices.

preprint2015arXiv

On testing the equality of high dimensional mean vectors with unequal covariance matrices

In this article, we focus on the problem of testing the equality of several high dimensional mean vectors with unequal covariance matrices. This is one of the most important problem in multivariate statistical analysis and there have been various tests proposed in the literature. Motivated by \citet{BaiS96E} and \cite{ChenQ10T}, a test statistic is introduced and the asymptomatic distributions under the null hypothesis as well as the alternative hypothesis are given. In addition, it is compared with a test statistic recently proposed by \cite{SrivastavaK13Ta}. It is shown that our test statistic performs much better especially in the large dimensional case.

preprint2015arXiv

On the semicircular law of large dimensional random quaternion matrices

It is well known that Gaussian symplectic ensemble (GSE) is defined on the space of $n\times n$ quaternion self-dual Hermitian matrices with Gaussian random elements. There is a huge body of literature regarding this kind of matrices. As a natural idea we want to get more universal results by removing the Gaussian condition. For the first step, in this paper we prove that the empirical spectral distribution of the common quaternion self-dual Hermitian matrices tends to semicircular law. The main tool to establish the universal result is given as a lemma in this paper as well.

preprint2014arXiv

Canonical correlation coefficients of high-dimensional normal vectors: finite rank case

Consider a normal vector $\mathbf{z}=(\mathbf{x}',\mathbf{y}')'$, consisting of two sub-vectors $\mathbf{x}$ and $\mathbf{y}$ with dimensions $p$ and $q$ respectively. With $n$ independent observations of $\mathbf{z}$ at hand, we study the correlation between $\mathbf{x}$ and $\mathbf{y}$, from the perspective of the Canonical Correlation Analysis, under the high-dimensional setting: both $p$ and $q$ are proportional to the sample size $n$. In this paper, we focus on the case that $Σ_{\mathbf{x}\mathbf{y}}$ is of finite rank $k$, i.e. there are $k$ nonzero canonical correlation coefficients, whose squares are denoted by $r_1\geq\cdots\geq r_k>0$. Under the additional assumptions $(p+q)/n\to y\in (0,1)$ and $p/q\not\to 1$, we study the sample counterparts of $r_i,i=1,\ldots,k$, i.e. the largest k eigenvalues of the sample canonical correlation matrix $S_{\mathbf{x}\mathbf{x}}^{-1}S_{\mathbf{x}\mathbf{y}}S_{\mathbf{y}\mathbf{y}}^{-1}S_{\mathbf{y}\mathbf{x}}$, namely $λ_1\geq\cdots\geq λ_k$. We show that there exists a threshold $r_c\in(0,1)$, such that for each $i\in\{1,\ldots,k\}$, when $r_i\leq r_c$, $λ_i$ converges almost surely to the right edge of the limiting spectral distribution of the sample canonical correlation matrix, denoted by $d_r$. When $r_i>r_c$, $λ_i$ possesses an almost sure limit in $(d_r,1]$, from which we can recover $r_i$ in turn, thus provide an estimate of the latter in the high-dimensional scenario.

preprint2014arXiv

On the limit of extreme eigenvalues of large dimensional random quaternion matrices

Since E.P.Wigner (1958) established his famous semicircle law, lots of attention has been paid by physicists, probabilists and statisticians to study the asymptotic properties of the largest eigenvalues for random matrices. Bai and Yin (1988) obtained the necessary and sufficient conditions for the strong convergence of the extreme eigenvalues of a Wigner matrix. In this paper, we consider the case of quaternion self-dual Hermitian matrices. We prove the necessary and sufficient conditions for the strong convergence of extreme eigenvalues of quaternion self-dual Hermitian matrices corresponding to the Wigner case.

preprint2014arXiv

Test of Independence for High-dimensional Random Vectors Based on Block Correlation Matrices

In this paper, we are concerned with the independence test for $k$ high-dimensional sub-vectors of a normal vector, with fixed positive integer $k$. A natural high-dimensional extension of the classical sample correlation matrix, namely block correlation matrix, is raised for this purpose. We then construct the so-called Schott type statistic as our test statistic, which turns out to be a particular linear spectral statistic of the block correlation matrix. Interestingly, the limiting behavior of the Schott type statistic can be figured out with the aid of the Free Probability Theory and the Random Matrix Theory. Specifically, we will bring the so-called real second order freeness for Haar distributed orthogonal matrices, derived in \cite{MP2013}, into the framework of this high-dimensional testing problem. Our test does not require the sample size to be larger than the total or any partial sum of the dimensions of the $k$ sub-vectors. Simulated results show the effect of the Schott type statistic, in contrast to those of the statistics proposed in \cite{JY2013} and \cite{JBZ2013}, is satisfactory. Real data analysis is also used to illustrate our method.

preprint2013arXiv

Convergence of Empirical Spectral Distributions of Large Dimensional Quaternion Sample Covariance Matrices

In this paper we establish the limit of the empirical spectral distribution of quaternion sample covariance matrices. Suppose $\mathbf X_n = ({x_{jk}^{(n)}})_{p\times n}$ is a quaternion random matrix. For each $n$, the entries $\{x_{ij}^{(n)}\}$ are independent random quaternion variables with a common mean $μ$ and variance $σ^2>0$. It is shown that the empirical spectral distribution of the quaternion sample covariance matrix $\mathbf S_n=n^{-1}\mathbf X_n\mathbf X_n^*$ converges to the M-P law as $p\to\infty$, $n\to\infty$ and $p/n\to y\in(0,+\infty)$.

preprint2013arXiv

Strong representation of weak convergence

Skorokhod's representation theorem states that if on a Polish space, there is defined a weakly convergent sequence of probability measures $μ_n\stackrel{w}\toμ_0,$ as $n\to \infty$, then there exist a probability space $(Ω, \mathscr F, P)$ and a sequence of random elements $X_n$ such that $X_n\to X$ almost surely and $X_n$ has the distribution function $μ_n$, $n=0,1,2,\cdots$. In this paper, we shall extend the Skorokhod representation theorem to the case where if there are a sequence of separable metric spaces $S_n$, a sequence of probability measures $μ_n$ and a sequence of measurable mappings $φ_n$ such that $μ_nφ_n^{-1}\stackrel {w}\toμ_0$, then there exist a probability space $(Ω,\mathscr F,P)$ and $S_n$-valued random elements $X_n$ defined on $Ω$, with distribution $μ_n$ and such that $φ_n(X_n)\to X_0$ almost surely. In addition, we present several applications of our result including some results in random matrix theory, while the original Skorokhod representation theorem is not applicable.

preprint2011arXiv

A Note on Rate of Convergence in Probability to Semicircular Law

In the present paper, we prove that under the assumption of the finite sixth moment for elements of a Wigner matrix, the convergence rate of its empirical spectral distribution to the Wigner semicircular law in probability is $O(n^{-1/2})$ when the dimension $n$ tends to infinity.

Jiang Hu

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

Extended parameter shift rules with minimal derivative variance for parameterized quantum circuits

Interpolation-based coordinate descent method for parameterized quantum circuits

A CLT for the LSS of large dimensional sample covariance matrices with unbounded dispersions

Riemannian Natural Gradient Methods

Spectral Statistics of Sample Block Correlation Matrices

The limiting spectral distribution of large dimensional general information-plus-noise type matrices

Modified Pillai's trace statistics for two high-dimensional sample covariance matrices

Strong consistency of the AIC, BIC, $C_p$ and KOO methods in high-dimensional multivariate linear regression

A review of 20 years of naive tests of significance for high-dimensional mean vectors and covariance matrices

Convergence of the empirical spectral distribution function of Beta matrices

On testing the equality of high dimensional mean vectors with unequal covariance matrices

On the semicircular law of large dimensional random quaternion matrices

Canonical correlation coefficients of high-dimensional normal vectors: finite rank case

On the limit of extreme eigenvalues of large dimensional random quaternion matrices

Test of Independence for High-dimensional Random Vectors Based on Block Correlation Matrices

Convergence of Empirical Spectral Distributions of Large Dimensional Quaternion Sample Covariance Matrices

Strong representation of weak convergence

A Note on Rate of Convergence in Probability to Semicircular Law