Source author record

Zeng Li

Zeng Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology Applications Computer Vision math.ST Statistics Theory Artificial Intelligence eess.IV Machine Learning

Catalog footprint

What is connected

8works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Masked Next-Scale Prediction for Self-supervised Scene Text Recognition

Scene Text Recognition requires modeling visual structures that evolve from coarse layouts to fine-grained character strokes. Training such models relies on large amounts of annotated data. Recent self-supervised approaches, such as Masked Image Modeling (MIM), alleviate this dependency by leveraging large-scale unlabeled data. Yet most existing MIM methods operate at a single spatial scale and fail to capture the hierarchical nature of scene text. In this work, we introduce Masked Next-Scale Prediction (MNSP), a unified self-supervised framework designed to explicitly model cross-scale structural evolution. The framework incorporates Next-Scale Prediction (NSP), which learns hierarchical representations by predicting higher-resolution features from lower-resolution contexts. Naive scale prediction, however, tends to produce spatially diffuse attention, directing the model toward background regions rather than textual structures. MNSP resolves this limitation by jointly learning cross-scale prediction and masked image reconstruction. NSP captures global layout priors across resolutions, while masked reconstruction imposes strong local constraints that guide attention toward informative text regions. A Multi-scale Linguistic Alignment module further maintains semantic consistency across different resolutions. Extensive experiments demonstrate that MNSP achieves state-of-the-art performance, reaching 86.2\% average accuracy on the challenging Union14M benchmark and 96.7\% across six standard datasets. Additional analyses show that our method improves robustness under extreme scale and layout variations. Code is available at https://github.com/CzhczhcHczh/MNSP

preprint2022arXiv

On eigenvalues of a high-dimensional Kendall's rank correlation matrix with dependence

This paper investigates limiting spectral distribution of a high-dimensional Kendall's rank correlation matrix. The underlying population is allowed to have general dependence structure. The result no longer follows the generalized Marucenko-Pastur law, which is brand new. It's the first result on rank correlation matrices with dependence. As applications, we study the Kendall's rank correlation matrix for multivariate normal distributions with a general covariance matrix. From these results, we further gain insights on Kendall's rank correlation matrix and its connections with the sample covariance/correlation matrix.

preprint2022arXiv

On singular values of large dimensional lag-tau sample autocorrelation matrices

We study the limiting behavior of singular values of a lag-$τ$ sample auto-correlation matrix $\bf{R}_τ^ε$ of error term $ε$ in the high-dimensional factor model. We establish the limiting spectral distribution (LSD) which characterizes the global spectrum of $\bf{R}_τ^ε$, and derive the limit of its largest singular value. All the asymptotic results are derived under the high-dimensional asymptotic regime where the data dimension and sample size go to infinity proportionally. Under mild assumptions, we show that the LSD of $\bf{R}_τ^ε$ is the same as that of the lag-$τ$ sample auto-covariance matrix. Based on this asymptotic equivalence, we additionally show that the largest singular value of $\bf{R}_τ^ε$ converges almost surely to the right end point of the support of its LSD. Our results take the first step to identify the number of factors in factor analysis using lag-$τ$ sample auto-correlation matrices. Our theoretical results are fully supported by numerical experiments as well.

preprint2022arXiv

Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation

We observe that human poses exhibit strong group-wise structural correlation and spatial coupling between keypoints due to the biological constraints of different body parts. This group-wise structural correlation can be explored to improve the accuracy and robustness of human pose estimation. In this work, we develop a self-constrained prediction-verification network to characterize and learn the structural correlation between keypoints during training. During the inference stage, the feedback information from the verification network allows us to perform further optimization of pose prediction, which significantly improves the performance of human pose estimation. Specifically, we partition the keypoints into groups according to the biological structure of human body. Within each group, the keypoints are further partitioned into two subsets, high-confidence base keypoints and low-confidence terminal keypoints. We develop a self-constrained prediction-verification network to perform forward and backward predictions between these keypoint subsets. One fundamental challenge in pose estimation, as well as in generic prediction tasks, is that there is no mechanism for us to verify if the obtained pose estimation or prediction results are accurate or not, since the ground truth is not available. Once successfully learned, the verification network serves as an accuracy verification module for the forward pose prediction. During the inference stage, it can be used to guide the local optimization of the pose estimation results of low-confidence keypoints with the self-constrained loss on high-confidence keypoints as the objective function. Our extensive experimental results on benchmark MS COCO and CrowdPose datasets demonstrate that the proposed method can significantly improve the pose estimation results.

preprint2021arXiv

Sparse Linear Spectral Unmixing of Hyperspectral images using Expectation-Propagation

This paper presents a novel Bayesian approach for hyperspectral image unmixing. The observed pixels are modeled by a linear combination of material signatures weighted by their corresponding abundances. A spike-and-slab abundance prior is adopted to promote sparse mixtures and an Ising prior model is used to capture spatial correlation of the mixture support across pixels. We approximate the posterior distribution of the abundances using the expectation-propagation (EP) method. We show that it can significantly reduce the computational complexity of the unmixing stage and meanwhile provide uncertainty measures, compared to expensive Monte Carlo strategies traditionally considered for uncertainty quantification. Moreover, many variational parameters within each EP factor can be updated in a parallel manner, which enables mapping of efficient algorithmic architectures based on graphics processing units (GPU). Under the same approximate Bayesian framework, we then extend the proposed algorithm to semi-supervised unmixing, whereby the abundances are viewed as latent variables and the expectation-maximization (EM) algorithm is used to refine the endmember matrix. Experimental results on synthetic data and real hyperspectral data illustrate the benefits of the proposed framework over state-of-art linear unmixing methods.

preprint2020arXiv

Provable More Data Hurt in High Dimensional Least Squares Estimator

This paper investigates the finite-sample prediction risk of the high-dimensional least squares estimator. We derive the central limit theorem for the prediction risk when both the sample size and the number of features tend to infinity. Furthermore, the finite-sample distribution and the confidence interval of the prediction risk are provided. Our theoretical results demonstrate the sample-wise nonmonotonicity of the prediction risk and confirm "more data hurt" phenomenon.

preprint2016arXiv

Testing the Sphericity of a covariance matrix when the dimension is much larger than the sample size

This paper focuses on the prominent sphericity test when the dimension $p$ is much lager than sample size $n$. The classical likelihood ratio test(LRT) is no longer applicable when $p\gg n$. Therefore a Quasi-LRT is proposed and asymptotic distribution of the test statistic under the null when $p/n\rightarrow\infty, n\rightarrow\infty$ is well established in this paper. Meanwhile, John's test has been found to possess the powerful {\it dimension-proof} property, which keeps exactly the same limiting distribution under the null with any $(n,p)$-asymptotic, i.e. $p/n\rightarrow[0,\infty]$, $n\rightarrow\infty$. All asymptotic results are derived for general population with finite fourth order moment. Numerical experiments are implemented for comparison.

preprint2014arXiv

On singular value distribution of large dimensional auto-covariance matrices

Let $(\varepsilon_j)_{j\geq 0}$ be a sequence of independent $p-$dimensional random vectors and $τ\geq1$ a given integer. From a sample $\varepsilon_1,\cdots,\varepsilon_{T+τ-1},\varepsilon_{T+τ}$ of the sequence, the so-called lag $-τ$ auto-covariance matrix is $C_τ=T^{-1}\sum_{j=1}^T\varepsilon_{τ+j}\varepsilon_{j}^t$. When the dimension $p$ is large compared to the sample size $T$, this paper establishes the limit of the singular value distribution of $C_τ$ assuming that $p$ and $T$ grow to infinity proportionally and the sequence satisfies a Lindeberg condition on fourth order moments. Compared to existing asymptotic results on sample covariance matrices developed in random matrix theory, the case of an auto-covariance matrix is much more involved due to the fact that the summands are dependent and the matrix $C_τ$ is not symmetric. Several new techniques are introduced for the derivation of the main theorem.

Zeng Li

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

Masked Next-Scale Prediction for Self-supervised Scene Text Recognition

On eigenvalues of a high-dimensional Kendall's rank correlation matrix with dependence

On singular values of large dimensional lag-tau sample autocorrelation matrices

Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation

Sparse Linear Spectral Unmixing of Hyperspectral images using Expectation-Propagation

Provable More Data Hurt in High Dimensional Least Squares Estimator

Testing the Sphericity of a covariance matrix when the dimension is much larger than the sample size

On singular value distribution of large dimensional auto-covariance matrices