Source author record

Z. D. Bai

Z. D. Bai appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

3works
4topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2015arXiv

Strong limit of the extreme eigenvalues of a symmetrized auto-cross covariance matrix

The auto-cross covariance matrix is defined as \[\mathbf{M}_n=\frac{1} {2T}\sum_{j=1}^T\bigl(\mathbf{e}_j\mathbf{e}_{j+τ}^*+\mathbf{e}_{j+ τ}\mathbf{e}_j^*\bigr),\] where $\mathbf{e}_j$'s are $n$-dimensional vectors of independent standard complex components with a common mean 0, variance $σ^2$, and uniformly bounded $2+η$th moments and $τ$ is the lag. Jin et al. [Ann. Appl. Probab. 24 (2014) 1199-1225] has proved that the LSD of $\mathbf{M}_n$ exists uniquely and nonrandomly, and independent of $τ$ for all $τ\ge 1$. And in addition they gave an analytic expression of the LSD. As a continuation of Jin et al. [Ann. Appl. Probab. 24 (2014) 1199-1225], this paper proved that under the condition of uniformly bounded fourth moments, in any closed interval outside the support of the LSD, with probability 1 there will be no eigenvalues of $\mathbf{M}_n$ for all large $n$. As a consequence of the main theorem, the limits of the largest and smallest eigenvalue of $\mathbf{M}_n$ are also obtained.

preprint2014arXiv

Substitution principle for CLT of linear spectral statistics of high-dimensional sample covariance matrices with applications to hypothesis testing

Sample covariance matrices are widely used in multivariate statistical analysis. The central limit theorems (CLT's) for linear spectral statistics of high-dimensional non-centered sample covariance matrices have received considerable attention in random matrix theory and have been applied to many high-dimensional statistical problems. However, known population mean vectors are assumed for non-centered sample covariance matrices, some of which even assume Gaussian-like moment conditions. In fact, there are still another two most frequently used sample covariance matrices: the MLE (by subtracting the sample mean vector from each sample vector) and the unbiased sample covariance matrix (by changing the denominator $n$ as $N=n-1$ in the MLE) without depending on unknown population mean vectors. In this paper, we not only establish new CLT's for non-centered sample covariance matrices without Gaussian-like moment conditions but also characterize the non-negligible differences among the CLT's for the three classes of high-dimensional sample covariance matrices by establishing a {\em substitution principle}: substitute the {\em adjusted} sample size $N=n-1$ for the actual sample size $n$ in the major centering term of the new CLT's so as to obtain the CLT of the unbiased sample covariance matrices. Moreover, it is found that the difference between the CLT's for the MLE and unbiased sample covariance matrix is non-negligible in the major centering term although the two sample covariance matrices only have differences $n$ and $n-1$ on the dominator. The new results are applied to two testing problems for high-dimensional data.

preprint2011arXiv

Asymptotic properties of eigenmatrices of a large sample covariance matrix

Let $S_n=\frac{1}{n}X_nX_n^*$ where $X_n=\{X_{ij}\}$ is a $p\times n$ matrix with i.i.d. complex standardized entries having finite fourth moments. Let $Y_n(\mathbf {t}_1,\mathbf {t}_2,σ)=\sqrt{p}({\mathbf {x}}_n(\mathbf {t}_1)^*(S_n+σI)^{-1}{\mathbf {x}}_n(\mathbf {t}_2)-{\mathbf {x}}_n(\mathbf {t}_1)^*{\mathbf {x}}_n(\mathbf {t}_2)m_n(σ))$ in which $σ>0$ and $m_n(σ)=\int\frac{dF_{y_n}(x)}{x+σ}$ where $F_{y_n}(x)$ is the Marčenko--Pastur law with parameter $y_n=p/n$; which converges to a positive constant as $n\to\infty$, and ${\mathbf {x}}_n(\mathbf {t}_1)$ and ${\mathbf {x}}_n(\mathbf {t}_2)$ are unit vectors in ${\Bbb{C}}^p$, having indices $\mathbf {t}_1$ and $\mathbf {t}_2$, ranging in a compact subset of a finite-dimensional Euclidean space. In this paper, we prove that the sequence $Y_n(\mathbf {t}_1,\mathbf {t}_2,σ)$ converges weakly to a $(2m+1)$-dimensional Gaussian process. This result provides further evidence in support of the conjecture that the distribution of the eigenmatrix of $S_n$ is asymptotically close to that of a Haar-distributed unitary matrix.