Researcher profile

Zhidong Bai

Zhidong Bai contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2022arXiv

A CLT for the LSS of large dimensional sample covariance matrices with unbounded dispersions

In this paper, we establish the central limit theorem (CLT) for linear spectral statistics (LSS) of large-dimensional sample covariance matrix when the population covariance matrices are not uniformly bounded, which is a nontrivial extension of the Bai-Silverstein theorem (BST) (2004). The latter has strongly stimulated the development of high-dimensional statistics, especially the application of random matrix theory to statistics. However, the assumption of uniform boundedness of the population covariance matrices is found strongly limited to the applications of BST. The aim of this paper is to remove the blockages to the applications of BST. The new CLT, allows the spiked eigenvalues to exist and tend to infinity. It is interesting to note that the roles of either spiked eigenvalues or the bulk eigenvalues or both of the two are dominating in the CLT. Moreover, the results are checked by simulation studies with various population settings. The CLT for LSS is then applied for testing the hypothesis that a covariance matrix $ \bSi $ is equal to an identity matrix. For this, the asymptotic distributions for the corrected likelihood ratio test (LRT) and Nagao's trace test (NT) under alternative are derived, and we also propose the asymptotic power of LRT and NT under certain alternatives.

preprint2022arXiv

Invariance principle and CLT for the spiked eigenvalues of large-dimensional Fisher matrices and applications

This paper aims to derive asymptotical distributions of the spiked eigenvalues of the large-dimensional spiked Fisher matrices without Gaussian assumption and the restrictive assumptions on covariance matrices. We first establish invariance principle for the spiked eigenvalues of the Fisher matrix. That is, we show that the limiting distributions of the spiked eigenvalues are invariant over a large class of population distributions satisfying certain conditions. Using the invariance principle, we further established a central limit theorem (CLT) for the spiked eigenvalues. As some interesting applications, we use the CLT to derive the power functions of Roy Maximum root test for linear hypothesis in linear models and the test in signal detection. We conduct some Monte Carlo simulation to compare the proposed test with existing ones.

preprint2022arXiv

The limiting spectral distribution of large dimensional general information-plus-noise type matrices

Let $ X_{n} $ be $ n\times N $ random complex matrices, $R_{n}$ and $T_{n}$ be non-random complex matrices with dimensions $n\times N$ and $n\times n$, respectively. We assume that the entries of $ X_{n} $ are independent and identically distributed, $ T_{n} $ are nonnegative definite Hermitian matrices and $T_{n}R_{n}R_{n}^{*}= R_{n}R_{n}^{*}T_{n} $. The general information-plus-noise type matrices are defined by $C_{n}=\frac{1}{N}T_{n}^{\frac{1}{2}} \left( R_{n} +X_{n}\right) \left(R_{n}+X_{n}\right)^{*}T_{n}^{\frac{1}{2}} $. In this paper, we establish the limiting spectral distribution of the large dimensional general information-plus-noise type matrices $C_{n}$. Specifically, we show that as $n$ and $N$ tend to infinity proportionally, the empirical distribution of the eigenvalues of $C_{n}$ converges weakly to a non-random probability distribution, which is characterized in terms of a system of equations of its Stieltjes transform.

preprint2020arXiv

Modified Pillai's trace statistics for two high-dimensional sample covariance matrices

The goal of this study was to test the equality of two covariance matrices by using modified Pillai's trace statistics under a high-dimensional framework, i.e., the dimension and sample sizes go to infinity proportionally. In this paper, we introduce two modified Pillai's trace statistics and obtain their asymptotic distributions under the null hypothesis. The benefits of the proposed statistics include the following: (1) the sample size can be smaller than the dimensions; (2) the limiting distributions of the proposed statistics are universal; and (3) we do not restrict the structure of the population covariance matrices. The theoretical results are established under mild and practical assumptions, and their properties are demonstrated numerically by simulations and a real data analysis.

preprint2020arXiv

Strong consistency of the AIC, BIC, $C_p$ and KOO methods in high-dimensional multivariate linear regression

Variable selection is essential for improving inference and interpretation in multivariate linear regression. Although a number of alternative regressor selection criteria have been suggested, the most prominent and widely used are the Akaike information criterion (AIC), Bayesian information criterion (BIC), Mallow's $C_p$, and their modifications. However, for high-dimensional data, experience has shown that the performance of these classical criteria is not always satisfactory. In the present article, we begin by presenting the necessary and sufficient conditions (NSC) for the strong consistency of the high-dimensional AIC, BIC, and $C_p$, based on which we can identify some reasons for their poor performance. Specifically, we show that under certain mild high-dimensional conditions, if the BIC is strongly consistent, then the AIC is strongly consistent, but not vice versa. This result contradicts the classical understanding. In addition, we consider some NSC for the strong consistency of the high-dimensional kick-one-out (KOO) methods introduced by Zhao et al. (1986) and Nishii et al. (1988). Furthermore, we propose two general methods based on the KOO methods and prove their strong consistency. The proposed general methods remove the penalties while simultaneously reducing the conditions for the dimensions and sizes of the regressors. A simulation study supports our consistency conclusions and shows that the convergence rates of the two proposed general KOO methods are much faster than those of the original methods.

preprint2010arXiv

Functional CLT for sample covariance matrices

Using Bernstein polynomial approximations, we prove the central limit theorem for linear spectral statistics of sample covariance matrices, indexed by a set of functions with continuous fourth order derivatives on an open interval including $[(1-\sqrt{y})^2,(1+\sqrt{y})^2]$, the support of the Marucenko--Pastur law. We also derive the explicit expressions for asymptotic mean and covariance functions.