Source author record

Chao Gao

Chao Gao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory Machine Learning cond-mat.quant-gas Methodology math.OC Social and Information Networks Applications Artificial Intelligence Computation Computational Complexity cond-mat.str-el physics.ins-det physics.optics Quantitative Methods

Catalog footprint

What is connected

31works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Adaptive Confidence Intervals in Efron's Gaussian Two-Groups Model

Robust uncertainty quantification is increasingly important in modern data analysis and is often formalized under Huber's model, which allows an $\varepsilon$-fraction of arbitrary corruptions. In many experimental sciences, however, the measurement protocol is well controlled, and contamination is more plausibly introduced upstream. Motivated by this noise-oblivious nature of adversaries, we study confidence intervals for the null location parameter $θ$ in Efron's Gaussian two-groups model, where an unknown fraction $\varepsilon$ of observations have arbitrarily shifted means, but all samples share the same law of additive Gaussian measurement noise with variance $σ^2$. We characterize the minimax-optimal length among confidence intervals with a prescribed coverage level uniformly over the unknown contamination proportion and all noise-oblivious adversaries. Although prior work has shown that the minimax point estimation rate of theta does not deteriorate when $\varepsilon$ becomes unknown, our results reveal that, with a given $σ^2$, the minimax-optimal length of confidence intervals that are adaptive to unknown $\varepsilon$ is of order $σ(n^{-1/4}+\varepsilon^{1/2}/\max\{1, \log(en \varepsilon^2)\}^{1/2})$, which is polynomially worse than the optimal length when $\varepsilon$ is known. When the variance $σ^2$ is also unknown, we show a further degradation: no adaptive confidence interval can be shorter than $Ω(σn^{-1/8})$. Algorithmically, we introduce a Fourier-based certification procedure built on Carathéodory's positive-semidefiniteness constraints. By scanning candidate points and accepting those whose residual characteristic function is certifiably consistent with a Gaussian location mixture, our algorithm attains the minimax lower bound in the known-variance setting and is computable in polynomial time.

preprint2022arXiv

Optimal Orthogonal Group Synchronization and Rotation Group Synchronization

We study the statistical estimation problem of orthogonal group synchronization and rotation group synchronization. The model is $Y_{ij} = Z_i^* Z_j^{*T} + σW_{ij}\in\mathbb{R}^{d\times d}$ where $W_{ij}$ is a Gaussian random matrix and $Z_i^*$ is either an orthogonal matrix or a rotation matrix, and each $Y_{ij}$ is observed independently with probability $p$. We analyze an iterative polar decomposition algorithm for the estimation of $Z^*$ and show it has an error of $(1+o(1))\frac{σ^2 d(d-1)}{2np}$ when initialized by spectral methods. A matching minimax lower bound is further established which leads to the optimality of the proposed algorithm as it achieves the exact minimax risk.

preprint2022arXiv

SDP Achieves Exact Minimax Optimality in Phase Synchronization

We study the phase synchronization problem with noisy measurements $Y=z^*z^{*H}+σW\in\mathbb{C}^{n\times n}$, where $z^*$ is an $n$-dimensional complex unit-modulus vector and $W$ is a complex-valued Gaussian random matrix. It is assumed that each entry $Y_{jk}$ is observed with probability $p$. We prove that an SDP relaxation of the MLE achieves the error bound $(1+o(1))\frac{σ^2}{2np}$ under a normalized squared $\ell_2$ loss. This result matches the minimax lower bound of the problem, and even the leading constant is sharp. The analysis of the SDP is based on an equivalent non-convex programming whose solution can be characterized as a fixed point of the generalized power iteration lifted to a higher dimensional space. This viewpoint unifies the proofs of the statistical optimality of three different methods: MLE, SDP, and generalized power method. The technique is also applied to the analysis of the SDP for $\mathbb{Z}_2$ synchronization, and we achieve the minimax optimal error $\exp\left(-(1-o(1))\frac{np}{2σ^2}\right)$ with a sharp constant in the exponent.

preprint2022arXiv

Uncertainty quantification in the Bradley-Terry-Luce model

The Bradley-Terry-Luce (BTL) model is a benchmark model for pairwise comparisons between individuals. Despite recent progress on the first-order asymptotics of several popular procedures, the understanding of uncertainty quantification in the BTL model remains largely incomplete, especially when the underlying comparison graph is sparse. In this paper, we fill this gap by focusing on two estimators that have received much recent attention: the maximum likelihood estimator (MLE) and the spectral estimator. Using a unified proof strategy, we derive sharp and uniform non-asymptotic expansions for both estimators in the sparsest possible regime (up to some poly-logarithmic factors) of the underlying comparison graph. These expansions allow us to obtain: (i) finite-dimensional central limit theorems for both estimators; (ii) construction of confidence intervals for individual ranks; (iii) optimal constant of $\ell_2$ estimation, which is achieved by the MLE but not by the spectral estimator. Our proof is based on a self-consistent equation of the second-order remainder vector and a novel leave-two-out analysis.

preprint2021arXiv

Exact Minimax Estimation for Phase Synchronization

We study the phase synchronization problem with measurements $Y=z^*z^{*H}+σW\in\mathbb{C}^{n\times n}$, where $z^*$ is an $n$-dimensional complex unit-modulus vector and $W$ is a complex-valued Gaussian random matrix. It is assumed that each entry $Y_{jk}$ is observed with probability $p$. We prove that the minimax lower bound of estimating $z^*$ under the squared $\ell_2$ loss is $(1-o(1))\frac{σ^2}{2p}$. We also show that both generalized power method and maximum likelihood estimator achieve the error bound $(1+o(1))\frac{σ^2}{2p}$. Thus, $\frac{σ^2}{2p}$ is the exact asymptotic minimax error of the problem. Our upper bound analysis involves a precise characterization of the statistical property of the power iteration. The lower bound is derived through an application of van Trees' inequality.

preprint2021arXiv

On Computation Complexity of True Proof Number Search

We point out that the computation of true \emph{proof} and \emph{disproof} numbers for proof number search in arbitrary directed acyclic graphs is NP-hard, an important theoretical result for proof number search. The proof requires a reduction from SAT, which demonstrates that finding true proof/disproof number for arbitrary DAG is at least as hard as deciding if arbitrary SAT instance is satisfiable, thus NP-hard.

preprint2021arXiv

Optimal Full Ranking from Pairwise Comparisons

We consider the problem of ranking $n$ players from partial pairwise comparison data under the Bradley-Terry-Luce model. For the first time in the literature, the minimax rate of this ranking problem is derived with respect to the Kendall's tau distance that measures the difference between two rank vectors by counting the number of inversions. The minimax rate of ranking exhibits a transition between an exponential rate and a polynomial rate depending on the magnitude of the signal-to-noise ratio of the problem. To the best of our knowledge, this phenomenon is unique to full ranking and has not been seen in any other statistical estimation problem. To achieve the minimax rate, we propose a divide-and-conquer ranking algorithm that first divides the $n$ players into groups of similar skills and then computes local MLE within each group. The optimality of the proposed algorithm is established by a careful approximate independence argument between the two steps.

preprint2020arXiv

Bayesian Model Selection with Graph Structured Sparsity

We propose a general algorithmic framework for Bayesian model selection. A spike-and-slab Laplacian prior is introduced to model the underlying structural assumption. Using the notion of effective resistance, we derive an EM-type algorithm with closed-form iterations to efficiently explore possible candidates for Bayesian model selection. The deterministic nature of the proposed algorithm makes it more scalable to large-scale and high-dimensional data sets compared with existing stochastic search algorithms. When applied to sparse linear regression, our framework recovers the EMVS algorithm [Rockova and George, 2014] as a special case. We also discuss extensions of our framework using tools from graph algebra to incorporate complex Bayesian models such as biclustering and submatrix localization. Extensive simulation studies and real data applications are conducted to demonstrate the superior performance of our methods over its frequentist competitors such as $\ell_0$ or $\ell_1$ penalization.

preprint2020arXiv

Convergence Rates of Empirical Bayes Posterior Distributions: A Variational Perspective

We study the convergence rates of empirical Bayes posterior distributions for nonparametric and high-dimensional inference. We show that as long as the hyperparameter set is discrete, the empirical Bayes posterior distribution induced by the maximum marginal likelihood estimator can be regarded as a variational approximation to a hierarchical Bayes posterior distribution. This connection between empirical Bayes and variational Bayes allows us to leverage the recent results in the variational Bayes literature, and directly obtains the convergence rates of empirical Bayes posterior distributions from a variational perspective. For a more general hyperparameter set that is not necessarily discrete, we introduce a new technique called "prior decomposition" to deal with prior distributions that can be written as convex combinations of probability measures whose supports are low-dimensional subspaces. This leads to generalized versions of the classical "prior mass and testing" conditions for the convergence rates of empirical Bayes. Our theory is applied to a number of statistical estimation problems including nonparametric density estimation and sparse linear regression.

preprint2020arXiv

Model Repair: Robust Recovery of Over-Parameterized Statistical Models

A new type of robust estimation problem is introduced where the goal is to recover a statistical model that has been corrupted after it has been estimated from data. Methods are proposed for "repairing" the model using only the design and not the response values used to fit the model in a supervised learning setting. Theory is developed which reveals that two important ingredients are necessary for model repair---the statistical model must be over-parameterized, and the estimator must incorporate redundancy. In particular, estimators based on stochastic gradient descent are seen to be well suited to model repair, but sparse estimators are not in general repairable. After formulating the problem and establishing a key technical lemma related to robust estimation, a series of results are presented for repair of over-parameterized linear models, random feature models, and artificial neural networks. Simulation studies are presented that corroborate and illustrate the theoretical findings.

preprint2020arXiv

Optimal estimation of variance in nonparametric regression with random design

Consider the heteroscedastic nonparametric regression model with random design \begin{align*} Y_i = f(X_i) + V^{1/2}(X_i)\varepsilon_i, \quad i=1,2,\ldots,n, \end{align*} with $f(\cdot)$ and $V(\cdot)$ $α$- and $β$-Hölder smooth, respectively. We show that the minimax rate of estimating $V(\cdot)$ under both local and global squared risks is of the order \begin{align*} n^{-\frac{8αβ}{4αβ+ 2α+ β}} \vee n^{-\frac{2β}{2β+1}}, \end{align*} where $a\vee b := \max\{a,b\}$ for any two real numbers $a,b$. This result extends the fixed design rate $n^{-4α} \vee n^{-2β/(2β+1)}$ derived in Wang et al. [2008] in a non-trivial manner, as indicated by the appearances of both $α$ and $β$ in the first term. In the special case of constant variance, we show that the minimax rate is $n^{-8α/(4α+1)}\vee n^{-1}$ for variance estimation, which further implies the same rate for quadratic functional estimation and thus unifies the minimax rate under the nonparametric regression model with those under the density model and the white noise model. To achieve the minimax rate, we develop a U-statistic-based local polynomial estimator and a lower bound that is constructed over a specified distribution family of randomness designed for both $\varepsilon_i$ and $X_i$.

preprint2019arXiv

Universal Dynamics of a Degenerate Bose Gas Quenched to Unitarity

Motivated by an unexpected experimental observation from the Cambridge group, [Eigen {\it et al.,} Nature {\bf563}, 221 (2018)], we study the evolution of the momentum distribution of a degenerate Bose gas quenched from the weakly interacting to the unitarity regime. For the two-body problem, we establish a relation that connects the momentum distribution at a long time to a sub-leading term in the initial wave function. For the many-body problem, we employ the time-dependent Bogoliubov variational wave function and find that, in certain momentum regimes, the momentum distribution at long times displays the same exponential behavior found by the experiment. Moreover, we find that this behavior is universal and independent of the short-range details of the interaction potential. Consistent with the relation found in the two-body problem, we also numerically show that this exponential form is hidden in the same sub-leading term of the Bogoliubov wave function in the initial stages. Our results establish a consistent picture to understand the universal dynamics observed in the Cambridge experiment.

preprint2018arXiv

Critical behavior of order parameter at the nonequilibrium phase transition of the Ising model

After a quench of transverse field, the asymptotic long-time state of Ising model displays a transition from a ferromagnetic phase to a paramagnetic phase as the post-quench field strength increases, which is revealed by the vanishing of the order parameter defined as the averaged magnetization over time. We estimate the critical behavior of the magnetization at this nonequilibrium phase transition by using mean-field approximation. In the vicinity of the critical field, the magnetization vanishes as the inverse of a logarithmic function, which is significantly distinguished from the critical behavior of order parameter at the corresponding equilibrium phase transition, i.e. a power-law function.

preprint2016arXiv

Community Detection in Degree-Corrected Block Models

Community detection is a central problem of network data analysis. Given a network, the goal of community detection is to partition the network nodes into a small number of clusters, which could often help reveal interesting structures. The present paper studies community detection in Degree-Corrected Block Models (DCBMs). We first derive asymptotic minimax risks of the problem for a misclassification proportion loss under appropriate conditions. The minimax risks are shown to depend on degree-correction parameters, community sizes, and average within and between community connectivities in an intuitive and interpretable way. In addition, we propose a polynomial time algorithm to adaptively perform consistent and even asymptotically optimal community detection in DCBMs.

preprint2016arXiv

Exact Exponent in Optimal Rates for Crowdsourcing

In many machine learning applications, crowdsourcing has become the primary means for label collection. In this paper, we study the optimal error rate for aggregating labels provided by a set of non-expert workers. Under the classic Dawid-Skene model, we establish matching upper and lower bounds with an exact exponent $mI(π)$ in which $m$ is the number of workers and $I(π)$ the average Chernoff information that characterizes the workers' collective ability. Such an exact characterization of the error exponent allows us to state a precise sample size requirement $m>\frac{1}{I(π)}\log\frac{1}ε$ in order to achieve an $ε$ misclassification error. In addition, our results imply the optimality of various EM algorithms for crowdsourcing initialized by consistent estimators.

preprint2016arXiv

Minimax Optimal Convergence Rates for Estimating Ground Truth from Crowdsourced Labels

Crowdsourcing has become a primary means for label collection in many real-world machine learning applications. A classical method for inferring the true labels from the noisy labels provided by crowdsourcing workers is Dawid-Skene estimator. In this paper, we prove convergence rates of a projected EM algorithm for the Dawid-Skene estimator. The revealed exponent in the rate of convergence is shown to be optimal via a lower bound argument. Our work resolves the long standing issue of whether Dawid-Skene estimator has sound theoretical guarantees besides its good performance observed in practice. In addition, a comparative study with majority voting illustrates both advantages and pitfalls of the Dawid-Skene estimator.

preprint2016arXiv

Rate exact Bayesian adaptation with modified block priors

A novel block prior is proposed for adaptive Bayesian estimation. The prior does not depend on the smoothness of the function or the sample size. It puts sufficient prior mass near the true signal and automatically concentrates on its effective dimension. A rate-optimal posterior contraction is obtained in a general framework, which includes density estimation, white noise model, Gaussian sequence model, Gaussian regression and spectral density estimation.

preprint2016arXiv

Sparse CCA: Adaptive Estimation and Computational Barriers

Canonical correlation analysis is a classical technique for exploring the relationship between two sets of variables. It has important applications in analyzing high dimensional datasets originated from genomics, imaging and other fields. This paper considers adaptive minimax and computationally tractable estimation of leading sparse canonical coefficient vectors in high dimensions. First, we establish separate minimax estimation rates for canonical coefficient vectors of each set of random variables under no structural assumption on marginal covariance matrices. Second, we propose a computationally feasible estimator to attain the optimal rates adaptively under an additional sample size condition. Finally, we show that a sample size condition of this kind is needed for any randomized polynomial-time estimator to be consistent, assuming hardness of certain instances of the Planted Clique detection problem. The result is faithful to the Gaussian models used in the paper. As a byproduct, we obtain the first computational lower bounds for sparse PCA under the Gaussian single spiked covariance model.

preprint2015arXiv

Achieving Optimal Misclassification Proportion in Stochastic Block Model

Community detection is a fundamental statistical problem in network data analysis. Many algorithms have been proposed to tackle this problem. Most of these algorithms are not guaranteed to achieve the statistical optimality of the problem, while procedures that achieve information theoretic limits for general parameter spaces are not computationally tractable. In this paper, we present a computationally feasible two-stage method that achieves optimal statistical performance in misclassification proportion for stochastic block model under weak regularity conditions. Our two-stage procedure consists of a generic refinement step that can take a wide range of weakly consistent community detection procedures as initializer, to which the refinement stage applies and outputs a community assignment achieving optimal misclassification proportion with high probability. The practical effectiveness of the new algorithm is demonstrated by competitive numerical results.

preprint2015arXiv

Minimax estimation in sparse canonical correlation analysis

Canonical correlation analysis is a widely used multivariate statistical technique for exploring the relation between two sets of variables. This paper considers the problem of estimating the leading canonical correlation directions in high-dimensional settings. Recently, under the assumption that the leading canonical correlation directions are sparse, various procedures have been proposed for many high-dimensional applications involving massive data sets. However, there has been few theoretical justification available in the literature. In this paper, we establish rate-optimal nonasymptotic minimax estimation with respect to an appropriate loss function for a wide range of model spaces. Two interesting phenomena are observed. First, the minimax rates are not affected by the presence of nuisance parameters, namely the covariance matrices of the two sets of random variables, though they need to be estimated in the canonical correlation analysis problem. Second, we allow the presence of the residual canonical correlation directions. However, they do not influence the minimax rates under a mild condition on eigengap. A generalized sin-theta theorem and an empirical process bound for Gaussian quadratic forms under rank constraint are used to establish the minimax upper bounds, which may be of independent interest.

preprint2015arXiv

Portable Microwave Frequency Dissemination in Free Space and Implications on Ground-Satellite Synchronization

Frequency dissemination and synchronization in free space plays an important role in global navigation satellite system, radio astronomy and synthetic aperture radar. In this paper, we demonstrate a portable radio frequency dissemination scheme via free space using microwave antennas. The setup has a good environment adaptability and high dissemination stability. The frequency signal is disseminated at different distances ranging from 10 to 640 m with a fixed 10 Hz locking bandwidth, and the scaling law of dissemination stability on distance and averaging time is discussed. The preliminary extrapolation shows that the dissemination stability may reach $1\times10^{-12}/s$ in ground-to-satellite synchronization, which far exceeds all present methods, and is worthy for further study.

preprint2015arXiv

Posterior Contraction Rates of the Phylogenetic Indian Buffet Processes

By expressing prior distributions as general stochastic processes, nonparametric Bayesian methods provide a flexible way to incorporate prior knowledge and constrain the latent structure in statistical inference. The Indian buffet process (IBP) is such an example that can be used to define a prior distribution on infinite binary features, where the exchangeability among subjects is assumed. The phylogenetic Indian buffet process (pIBP), a derivative of IBP, enables the modeling of non-exchangeability among subjects through a stochastic process on a rooted tree, which is similar to that used in phylogenetics, to describe relationships among the subjects. In this paper, we study the theoretical properties of IBP and pIBP under a binary factor model. We establish the posterior contraction rates for both IBP and pIBP and substantiate the theoretical results through simulation studies. This is the first work addressing the frequentist property of the posterior behaviors of IBP and pIBP. We also demonstrated its practical usefulness by applying pIBP prior to a real data example arising in the field of cancer genomics where the exchangeability among subjects is violated.

preprint2015arXiv

Rate-optimal graphon estimation

Network analysis is becoming one of the most active research areas in statistics. Significant advances have been made recently on developing theories, methodologies and algorithms for analyzing networks. However, there has been little fundamental study on optimal estimation. In this paper, we establish optimal rate of convergence for graphon estimation. For the stochastic block model with $k$ clusters, we show that the optimal rate under the mean squared error is $n^{-1}\log k+k^2/n^2$. The minimax upper bound improves the existing results in literature through a technique of solving a quadratic equation. When $k\leq\sqrt{n\log n}$, as the number of the cluster $k$ grows, the minimax rate grows slowly with only a logarithmic order $n^{-1}\log k$. A key step to establish the lower bound is to construct a novel subset of the parameter space and then apply Fano's lemma, from which we see a clear distinction of the nonparametric graphon estimation problem from classical nonparametric regression, due to the lack of identifiability of the order of nodes in exchangeable random graph models. As an immediate application, we consider nonparametric graphon estimation in a Hölder class with smoothness $α$. When the smoothness $α\geq1$, the optimal rate of convergence is $n^{-1}\log n$, independent of $α$, while for $α\in(0,1)$, the rate is $n^{-2α/(α+1)}$, which is, to our surprise, identical to the classical nonparametric rate.

preprint2015arXiv

Rate-optimal posterior contraction for sparse PCA

Principal component analysis (PCA) is possibly one of the most widely used statistical tools to recover a low-rank structure of the data. In the high-dimensional settings, the leading eigenvector of the sample covariance can be nearly orthogonal to the true eigenvector. A sparse structure is then commonly assumed along with a low rank structure. Recently, minimax estimation rates of sparse PCA were established under various interesting settings. On the other side, Bayesian methods are becoming more and more popular in high-dimensional estimation, but there is little work to connect frequentist properties and Bayesian methodologies for high-dimensional data analysis. In this paper, we propose a prior for the sparse PCA problem and analyze its theoretical properties. The prior adapts to both sparsity and rank. The posterior distribution is shown to contract to the truth at optimal minimax rates. In addition, a computationally efficient strategy for the rank-one case is discussed.

preprint2015arXiv

Revealing the origin of super-Efimov states in the hyperspherical formalism

Super-Efimov states are a new kind of universal three-body bound states predicted for three identical fermions with $p$-wave resonant interactions in two dimensions by a recent field-theoretic calculation [Phys.~Rev.~Lett.~\textbf{110}, 235301 (2013)]. The binding energies of these states obey a dramatic double exponential scaling $E_n=E_*\exp(-2 e^{πn/s_0+θ})$ with universal scaling $s_0=4/3$ and three-body parameters $E_*$ and $θ$. We use the hyperspherical formalism and show that the super-Efimov states originate from an emergent effective potential $-1/4ρ^2-(s_0^2+1/4)/ρ^2\ln^2\left(ρ\right)$ at large hyperradius $ρ$. Moreover, for pairwise interparticle potentials with van der Waals tails, our numerical calculation indicates that the three-body parameters $E_*$ and $θ$ are also universal and the ground super-Efimov state shall cross the threshold when the $2$D $p$-wave scattering area is about $-42.0\, l_\text{vdW}^2$ with $l_\text{vdW}$ the van der Waals length.

preprint2014arXiv

Bernstein-von Mises Theorems for Functionals of Covariance Matrix

We provide a general theoretical framework to derive Bernstein-von Mises theorems for matrix functionals. The conditions on functionals and priors are explicit and easy to check. Results are obtained for various functionals including entries of covariance matrix, entries of precision matrix, quadratic forms, log-determinant, eigenvalues in the Bayesian Gaussian covariance/precision matrix estimation setting, as well as for Bayesian linear and quadratic discriminant analysis.

preprint2014arXiv

Fiber-based ultra-stable frequency synchronization using client-side, 1f-2f active compensation method

We demonstrate a frequency synchronization scheme with the phase noise compensation function placed at the client site. One transmitting module hence can be linked with multiple client sites. As a performance test, using two separate 50 km fiber spools, we recover the 100 MHz disseminated reference frequencies at two remote sites, separately. Relative frequency stabilities between two recovered frequency signals of 2.8E-14/s and 2.5E-17/day are obtained. This scalable scheme is suitable for the applications of frequency dissemination with a star-topology, such as SKA and DSN.

preprint2014arXiv

Three Identical Fermions with Resonant p-wave Interactions in Two Dimensions

A new kind of "super-Efimov" states of binding energies scaling as $\ln|E_n|\sim-e^{3nπ/4}$ were predicted by a field theory calculation for three fermions with resonant $p$-wave interactions in two dimensions [Phys. Rev. Lett. \textbf{110}, 235301 (2013)]. However, the universality of these "super-Efimov" states has not been proved independently. In this Letter, we study the three fermion system through the hyperspherical formalism. Within the adiabatic approximation, we find that at $p$-wave resonances, the low energy physics of states of angular momentum $\ell=\pm1$ crucially depends on the value of an emergent dimensionless parameter $Y$ determined by the detail of the inter-particle potential. Only if $Y$ is exactly zero, the predicted "super-Efimov" states exist. If $Y>0$, the scaling of the bound states changes to $\ln|E_n|\sim-(nπ)^2/2Y$, while there are no shallow bound states if $Y<0$.

preprint2013arXiv

Sparse CCA via Precision Adjusted Iterative Thresholding

Sparse Canonical Correlation Analysis (CCA) has received considerable attention in high-dimensional data analysis to study the relationship between two sets of random variables. However, there has been remarkably little theoretical statistical foundation on sparse CCA in high-dimensional settings despite active methodological and applied research activities. In this paper, we introduce an elementary sufficient and necessary characterization such that the solution of CCA is indeed sparse, propose a computationally efficient procedure, called CAPIT, to estimate the canonical directions, and show that the procedure is rate-optimal under various assumptions on nuisance parameters. The procedure is applied to a breast cancer dataset from The Cancer Genome Atlas project. We identify methylation probes that are associated with genes, which have been previously characterized as prognosis signatures of the metastasis of breast cancer.

preprint2012arXiv

Breathing mode of two-dimensional atomic Fermi gases in harmonic traps

For two-dimensional (2D) atomic Fermi gases in harmonic traps, the SO(2,1) symmetry is broken by the interatomic interaction explicitly via the contact correlation operator. Consequently the frequency of the breathing mode $ω_B$ of the 2D Fermi gas can be different from $2ω_0$, with $ω_0$ the trapping frequency of harmonic potentials. At zero temperature, we use the sum rules of density correlation functions to yield upper bounds for $ω_B$. We further calculate $ω_B$ through the Euler equations in the hydrodynamic regime. The obtained value of $ω_B$ satisfies the upper bounds and shows deviation from $2ω_0$ which can be as large as about 8%.

preprint2010arXiv

Spin-Orbit Coupled Spinor Bose-Einstein Condensates

An effective spin-orbit coupling can be generated in cold atom system by engineering atom-light interactions. In this letter we study spin-1/2 and spin-1 Bose-Einstein condensates with Rashba spin-orbit coupling, and find that the condensate wave function will develop non-trivial structures. From numerical simulation we have identified two different phases. In one phase the ground state is a single plane wave, and often we find the system splits into domains and an array of vortices plays the role as domain wall. In this phase, time-reversal symmetry is broken. In the other phase the condensate wave function is a standing wave and it forms spin stripe. The transition between them is driven by interactions between bosons. We also provide an analytical understanding of these results and determines the transition point between the two phases.

Chao Gao

What is connected

Connect this record

See the researcher in context

Building this map preview

31 published item(s)

Adaptive Confidence Intervals in Efron's Gaussian Two-Groups Model

Optimal Orthogonal Group Synchronization and Rotation Group Synchronization

SDP Achieves Exact Minimax Optimality in Phase Synchronization

Uncertainty quantification in the Bradley-Terry-Luce model

Exact Minimax Estimation for Phase Synchronization

On Computation Complexity of True Proof Number Search

Optimal Full Ranking from Pairwise Comparisons

Bayesian Model Selection with Graph Structured Sparsity

Convergence Rates of Empirical Bayes Posterior Distributions: A Variational Perspective

Model Repair: Robust Recovery of Over-Parameterized Statistical Models

Optimal estimation of variance in nonparametric regression with random design

Universal Dynamics of a Degenerate Bose Gas Quenched to Unitarity

Critical behavior of order parameter at the nonequilibrium phase transition of the Ising model

Community Detection in Degree-Corrected Block Models

Exact Exponent in Optimal Rates for Crowdsourcing

Minimax Optimal Convergence Rates for Estimating Ground Truth from Crowdsourced Labels

Rate exact Bayesian adaptation with modified block priors

Sparse CCA: Adaptive Estimation and Computational Barriers

Achieving Optimal Misclassification Proportion in Stochastic Block Model

Minimax estimation in sparse canonical correlation analysis

Portable Microwave Frequency Dissemination in Free Space and Implications on Ground-Satellite Synchronization

Posterior Contraction Rates of the Phylogenetic Indian Buffet Processes

Rate-optimal graphon estimation

Rate-optimal posterior contraction for sparse PCA

Revealing the origin of super-Efimov states in the hyperspherical formalism

Bernstein-von Mises Theorems for Functionals of Covariance Matrix

Fiber-based ultra-stable frequency synchronization using client-side, 1f-2f active compensation method

Three Identical Fermions with Resonant p-wave Interactions in Two Dimensions

Sparse CCA via Precision Adjusted Iterative Thresholding

Breathing mode of two-dimensional atomic Fermi gases in harmonic traps

Spin-Orbit Coupled Spinor Bose-Einstein Condensates