Source author record

Yongcheng Qi

Yongcheng Qi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

10works
4topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2022arXiv

Empirical likelihood method for complete independence test on high dimensional data

Given a random sample of size $n$ from a $p$ dimensional random vector, where both $n$ and $p$ are large, we are interested in testing whether the $p$ components of the random vector are mutually independent. This is the so-called complete independence test. In the multivariate normal case, it is equivalent to testing whether the correlation matrix is an identity matrix. In this paper, we propose a one-sided empirical likelihood method for the complete independence test for multivariate normal data based on squared sample correlation coefficients. The limiting distribution for our one-sided empirical likelihood test statistic is proved to be $Z^2I(Z>0)$ when both $n$ and $p$ tend to infinity, where $Z$ is a standard normal random variable. In order to improve the power of the empirical likelihood test statistic, we also introduce a rescaled empirical likelihood test statistic. We carry out an extensive simulation study to compare the performance of the rescaled empirical likelihood method and two other statistics which are related to the sum of squared sample correlation coefficients.

preprint2022arXiv

Limiting distributions of the likelihood ratio test statistics for independence of normal random vectors

Consider the likelihood ratio test (LRT) statistics for the independence of sub-vectors from a $p$-variate normal random vector. We are devoted to deriving the limiting distributions of the LRT statistics based on a random sample of size $n$. It is well known that the limit is chi-square distribution when the dimension of the data or the number of the parameters are fixed. In a recent work by Qi, Wang and Zhang (Ann Inst Stat Math (2019) 71: 911--946), it was shown that the LRT statistics are asymptotically normal under condition that the lengths of the normal random sub-vectors are relatively balanced if the dimension $p$ goes to infinity with the sample size $n$. In this paper, we investigate the limiting distributions of the LRT statistic under general conditions. We find out all types of limiting distributions and obtain the necessary and sufficient conditions for the LRT statistic to converge to a normal distribution when $p$ goes to infinity. We also investigate the limiting distribution of the adjusted LRT test statistic proposed in Qi, Wang and Zhang (2019). Moreover, we present simulation results to compare the performance of classical chi-square approximation, normal and non-normal approximation to the LRT statistics, chi-square approximation to the adjusted test statistic, and some other test statistics.

preprint2022arXiv

Spectral Radii of Products of Random Rectangular Matrices

We consider m independent random rectangular matrices whose entries are independent and identically distributed standard complex Gaussian random variables. Assume the product of the m rectangular matrices is an n by n square matrix. The maximum absolute values of the n eigenvalues of the product matrix is called spectral radius. In this paper, we study the limiting spectral radii of the product when m changes with n and can even diverge. We give a complete description for the limiting distribution of the spectral radius. Our results reduce to those in Jiang and Qi [26] when the rectangular matrices are square ones.

preprint2021arXiv

Pearson's goodness-of-fit tests for sparse distributions

Pearson's chi-squared test is widely used to test the goodness of fit between categorical data and a given discrete distribution function. When the number of sets of the categorical data, say $k$, is a fixed integer, Pearson's chi-squared test statistic converges in distribution to a chi-squared distribution with $k-1$ degrees of freedom when the sample size $n$ goes to infinity. In real applications, the number $k$ often changes with $n$ and may be even much larger than $n$. By using the martingale techniques, we prove that Pearson's chi-squared test statistic converges to the normal under quite general conditions. We also propose a new test statistic which is more powerful than chi-squared test statistic based on our simulation study. A real application to lottery data is provided to illustrate our methodology.

preprint2020arXiv

Limiting Spectral Radii of Circular Unitary Matrices under Light Truncation

Consider a truncated circular unitary matrix which is a $p_n$ by $p_n$ submatrix of an $n$ by $n$ circular unitary matrix after deleting the last $n-p_n$ columns and rows. Jiang and Qi \cite{JiangQi2017} and Gui and Qi \cite{GQ2018} study the limiting distributions of the maximum absolute value of the eigenvalues (known as spectral radius) of the truncated matrix. Some limiting distributions for the spectral radius for the truncated circular unitary matrix have been obtained under the following conditions: (1). $p_n/n$ is bounded away from $0$ and $1$; (2). $p_n\to\infty$ and $p_n/n\to 0$ as $n\to\infty$; (3). $(n-p_n)/n\to 0$ and $(n-p_n)/(\log n)^3\to\infty$ as $n\to\infty$; (4). $n-p_n\to\infty$ and $(n-p_n)/\log n\to 0$ as $n\to\infty$; and (5). $n-p_n=k\ge 1$ is a fixed integer. The spectral radius converges in distribution to the Gumbel distribution under the first four conditions and to a reversed Weibull distribution under the fifth condition. Apparently, the conditions above do not cover the case when $n-p_n$ is of order between $\log n$ and $(\log n)^3$. In this paper, we prove that the spectral radius converges in distribution to the Gumbel distribution as well in this case, as conjectured by Gui and Qi \cite{GQ2018}.

preprint2014arXiv

Spectral Radii of Large Non-Hermitian Random Matrices

By using the independence structure of points following a determinantal point process, we study the radii of the spherical ensemble, the truncation of the circular unitary ensemble and the product ensemble with parameter n and k. The limiting distributions of the three radii are obtained. They are not the Tracy-Widom distribution. In particular, for the product ensemble, we show that the limiting distribution has a transition phenomenon: when k/n -> 0, k/n -> a in (0,infty) and k/n -> infty, the liming distribution is the Gumbel distribution, a new distribution $μ$ and the logarithmic normal distribution, respectively. The cumulative distribution function (cdf) of mu is the infinite product of some normal distribution functions. Another new distribution nu is also obtained for the spherical ensemble such that the cdf of nu is the infinite product of the cdfs of some Poisson-distributed random variables.

preprint2014arXiv

Test for a Mean Vector with Fixed or Divergent Dimension

It has been a long history in testing whether a mean vector with a fixed dimension has a specified value. Some well-known tests include the Hotelling $T^2$-test and the empirical likelihood ratio test proposed by Owen [Biometrika 75 (1988) 237-249; Ann. Statist. 18 (1990) 90-120]. Recently, Hotelling $T^2$-test has been modified to work for a high-dimensional mean, and the empirical likelihood method for a mean has been shown to be valid when the dimension of the mean vector goes to infinity. However, the asymptotic distributions of these tests depend on whether the dimension of the mean vector is fixed or goes to infinity. In this paper, we propose to split the sample into two parts and then to apply the empirical likelihood method to two equations instead of d equations, where d is the dimension of the underlying random vector. The asymptotic distribution of the new test is independent of the dimension of the mean vector. A simulation study shows that the new test has a very stable size with respect to the dimension of the mean vector, and is much more powerful than the modified Hotelling $T^2$-test.

preprint2012arXiv

A Characterization of a New Type of Strong Law of Large Numbers

By applying results obtained from the new versions of the classical Levy, Ottaviani, and Hoffmann-Jorgensen (1974) inequalities proved by Li and Rosalsky(2013) and by using techniques developed by Hechner and Heinkel (2010), we provide a characterization of a new type of strong law of large numbers for independent and identically distributed real-valued random variables. Versions of this strong law of large numbers are also presented in a Banach space setting.

preprint2010arXiv

A Refinement of the Kolmogorov-Marcinkiewicz-Zygmund Strong Law of Large Numbers

For the partial sums formed from a sequence of i.i.d. random variables having a finite absolute p'th moment for some p in (0,2), we extend the recent and striking discovery of Hechner and Heinkel (Journal of Theoretical Probability (2010)) concerning "complete moment convergence" to the two cases 0<p<1 and p=1. Moreover, for 0<p<2, we obtain "almost sure convergence" analogues of these "complete moment convergence" results and these "almost sure convergence" analogues may be regarded as being a refinement of the celebrated Kolmogorov-Marcinkiewicz-Zygmund strong law of large numbers. Versions of the above results in a Banach space setting are also presented.

preprint2010arXiv

On Jiang's asymptotic distribution of the largest entry of a sample correlation matrix

Let $ \{X, X_{k,i}; i \geq 1, k \geq 1 \}$ be a double array of nondegenerate i.i.d. random variables and let $\{p_{n}; n \geq 1 \}$ be a sequence of positive integers such that $n/p_{n}$ is bounded away from $0$ and $\infty$. This paper is devoted to the solution to an open problem posed in Li, Liu, and Rosalsky (2010) on the asymptotic distribution of the largest entry $L_{n} = \max_{1 \leq i < j \leq p_{n}} \left | \hatρ^{(n)}_{i,j} \right |$ of the sample correlation matrix ${\bf Γ}_{n} = \left ( \hatρ_{i,j}^{(n)} \right )_{1 \leq i, j \leq p_{n}}$ where $\hatρ^{(n)}_{i,j}$ denotes the Pearson correlation coefficient between $(X_{1, i},..., X_{n,i})'$ and $(X_{1, j},..., X_{n,j})'$. We show under the assumption $\mathbb{E}X^{2} < \infty$ that the following three statements are equivalent: \begin{align*} & {\bf (1)} \quad \lim_{n \to \infty} n^{2} \int_{(n \log n)^{1/4}}^{\infty} \left( F^{n-1}(x) - F^{n-1}\left(\frac{\sqrt{n \log n}}{x} \right) \right) dF(x) = 0, \\ & {\bf (2)} \quad \left ( \frac{n}{\log n} \right )^{1/2} L_{n} \stackrel{\mathbb{P}}{\rightarrow} 2, \\ & {\bf (3)} \quad \lim_{n \rightarrow \infty} \mathbb{P} \left (n L_{n}^{2} - a_{n} \leq t \right ) = \exp \left \{ - \frac{1}{\sqrt{8 π}} e^{-t/2} \right \}, - \infty < t < \infty \end{align*} where $F(x) = \mathbb{P}(|X| \leq x), x \geq 0$ and $a_{n} = 4 \log p_{n} - \log \log p_{n}$, $n \geq 2$. To establish this result, we present six interesting new lemmas which may be beneficial to the further study of the sample correlation matrix.