Source author record

Cristina Butucea

Cristina Butucea appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory quant-ph Machine Learning math-ph math.MP math.PR Methodology

Catalog footprint

What is connected

22works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Interactive versus non-interactive locally differentially private estimation: Two elbows for the quadratic functional

Local differential privacy has recently received increasing attention from the statistics community as a valuable tool to protect the privacy of individual data owners without the need of a trusted third party. Similar to the classical notion of randomized response, the idea is that data owners randomize their true information locally and only release the perturbed data. Many different protocols for such local perturbation procedures can be designed. In most estimation problems studied in the literature so far, however, no significant difference in terms of minimax risk between purely non-interactive protocols and protocols that allow for some amount of interaction between individual data providers could be observed. In this paper we show that for estimating the integrated square of a density, sequentially interactive procedures improve substantially over the best possible non-interactive procedure in terms of minimax rate of estimation. In particular, in the non-interactive scenario we identify an elbow in the minimax rate at $s=\frac34$, whereas in the sequentially interactive scenario the elbow is at $s=\frac12$. This is markedly different from both, the case of direct observations, where the elbow is well known to be at $s=\frac14$, as well as from the case where Laplace noise is added to the original data, where an elbow at $s= \frac94$ is obtained. We also provide adaptive estimators that achieve the optimal rate up to log-factors, we draw connections to non-parametric goodness-of-fit testing and estimation of more general integral functionals and conduct a series of numerical experiments. The fact that a particular locally differentially private, but interactive, mechanism improves over the simple non-interactive one is also of great importance for practical implementations of local differential privacy.

preprint2022arXiv

Phase transitions for support recovery under local differential privacy

We address the problem of variable selection in a high-dimensional but sparse mean model, under the additional constraint that only privatised data are available for inference. The original data are vectors with independent entries having a symmetric, strongly log-concave distribution on $\mathbb{R}$. For this purpose, we adopt a recent generalisation of classical minimax theory to the framework of local $α-$differential privacy. We provide lower and upper bounds on the rate of convergence for the expected Hamming loss over classes of at most $s$-sparse vectors whose non-zero coordinates are separated from $0$ by a constant $a>0$. As corollaries, we derive necessary and sufficient conditions (up to log factors) for exact recovery and for almost full recovery. When we restrict our attention to non-interactive mechanisms that act independently on each coordinate our lower bound shows that, contrary to the non-private setting, both exact and almost full recovery are impossible whatever the value of $a$ in the high-dimensional regime such that $n α^2/ d^2\lesssim 1$. However, in the regime $nα^2/d^2\gg \log(d)$ we can exhibit a critical value $a^*$ (up to a logarithmic factor) such that exact and almost full recovery are possible for all $a\gg a^*$ and impossible for $a\leq a^*$. We show that these results can be improved when allowing for all non-interactive (that act globally on all coordinates) locally $α-$differentially private mechanisms in the sense that phase transitions occur at lower levels.

preprint2021arXiv

Fast Non-Asymptotic Testing And Support Recovery For Large Sparse Toeplitz Covariance Matrices

We consider $n$ independent $p$-dimensional Gaussian vectors with covariance matrix having Toeplitz structure. We test that these vectors have independent components against a stationary distribution with sparse Toeplitz covariance matrix, and also select the support of non-zero entries. We assume that the non-zero values can occur in the recent past (time-lag less than $p/2$). We build test procedures that combine a sum and a scan-type procedures, but are computationally fast, and show their non-asymptotic behaviour in both one-sided (only positive correlations) and two-sided alternatives, respectively. We also exhibit a selector of significant lags and bound the Hamming-loss risk of the estimated support. These results can be extended to the case of nearly Toeplitz covariance structure and to sub-Gaussian vectors. Numerical results illustrate the excellent behaviour of both test procedures and support selectors - larger the dimension $p$, faster are the rates.

preprint2021arXiv

Variable selection, monotone likelihood ratio and group sparsity

In the pivotal variable selection problem, we derive the exact non-asymptotic minimax selector over the class of all $s$-sparse vectors, which is also the Bayes selector with respect to the uniform prior. While this optimal selector is, in general, not realizable in polynomial time, we show that its tractable counterpart (the scan selector) attains the minimax expected Hamming risk to within factor 2, and is also exact minimax with respect to the probability of wrong recovery. As a consequence, we establish explicit lower bounds under the monotone likelihood ratio property and we obtain a tight characterization of the minimax risk in terms of the best separable selector risk. We apply these general results to derive necessary and sufficient conditions of exact and almost full recovery in the location model with light tail distributions and in the problem of group variable selection under Gaussian noise.

preprint2020arXiv

Locally private non-asymptotic testing of discrete distributions is faster using interactive mechanisms

We find separation rates for testing multinomial or more general discrete distributions under the constraint of local differential privacy. We construct efficient randomized algorithms and test procedures, in both the case where only non-interactive privacy mechanisms are allowed and also in the case where all sequentially interactive privacy mechanisms are allowed. The separation rates are faster in the latter case. We prove general information theoretical bounds that allow us to establish the optimality of our algorithms among all pairs of privacy mechanisms and test procedures, in most usual cases. Considered examples include testing uniform, polynomially and exponentially decreasing distributions.

preprint2016arXiv

Adaptive test for large covariance matrices with missing observations

We observe $n$ independent $p-$dimensional Gaussian vectors with missing coordinates, that is each value (which is assumed standardized) is observed with probability $a>0$. We investigate the problem of minimax nonparametric testing that the high-dimensional covariance matrix $Σ$ of the underlying Gaussian distribution is the identity matrix, using these partially observed vectors. Here, $n$ and $p$ tend to infinity and $a>0$ tends to 0, asymptotically. We assume that $Σ$ belongs to a Sobolev-type ellipsoid with parameter $α>0$. When $α$ is known, we give asymptotically minimax consistent test procedure and find the minimax separation rates $\tilde φ_{n,p}= (a^2n \sqrt{p})^{- \frac{2 α}{4 α+1}}$, under some additional constraints on $n,\, p$ and $a$. We show that, in the particular case of Toeplitz covariance matrices,the minimax separation rates are faster, $\tilde ϕ_{n,p}= (a^2n p)^{- \frac{2 α}{4 α+1}}$. We note how the "missingness" parameter $a$ deteriorates the rates with respect to the case of fully observed vectors ($a=1$). We also propose adaptive test procedures, that is free of the parameter $α$ in some interval, and show that the loss of rate is $(\ln \ln (a^2 n\sqrt{p}))^{α/(4 α+1)}$ and $(\ln \ln (a^2 n p))^{α/(4 α+1)}$ for Toeplitz covariance matrices, respectively.

preprint2016arXiv

Fast adaptive estimation of log-additive exponential models in Kullback-Leibler divergence

We study the problem of nonparametric estimation of density functions with a product form on the domain $\triangle=\{( x_1, \ldots, x_d)\in \mathbb{R}^d, 0\leq x_1\leq \dots \leq x_d \leq 1\}$. Such densities appear in the random truncation model as the joint density function of observations. They are also obtained as maximum entropy distributions of order statistics with given marginals. We propose an estimation method based on the approximation of the logarithm of the density by a carefully chosen family of basis functions. We show that the method achieves a fast convergence rate in probability with respect to the Kullback-Leibler divergence for densities whose logarithm belongs to a Sobolev function class with known regularity. In the case when the regularity is unknown, we propose an estimation procedure using convex aggregation of the log-densities to obtain adaptability. The performance of this method is illustrated in a simulation study.

preprint2016arXiv

Optimal exponential bounds for aggregation of estimators for the Kullback-Leibler loss

We study the problem of model selection type aggregation with respect to the Kullback-Leibler divergence for various probabilistic models. Rather than considering a convex combination of the initial estimators $f_1, \ldots, f_N$, our aggregation procedures rely on the convex combination of the logarithms of these functions. The first method is designed for probability density estimation as it gives an aggregate estimator that is also a proper density function, whereas the second method concerns spectral density estimation and has no such mass-conserving feature. We select the aggregation weights based on a penalized maximum likelihood criterion. We give sharp oracle inequalities that hold with high probability, with a remainder term that is decomposed into a bias and a variance part. We also show the optimality of the remainder terms by providing the corresponding lower bound results.

preprint2016arXiv

Sharp minimax tests for large covariance matrices and adaptation

We consider the detection problem of correlations in a $p$-dimensional Gaussian vector, when we observe $n$ independent, identically distributed random vectors, for $n$ and $p$ large. We assume that the covariance matrix varies in some ellipsoid with parameter $α>1/2$ and total energy bounded by $L>0$. We propose a test procedure based on a U-statistic of order 2 which is weighted in an optimal way. The weights are the solution of an optimization problem, they are constant on each diagonal and non-null only for the $T$ first diagonals, where $T=o(p)$. We show that this test statistic is asymptotically Gaussian distributed under the null hypothesis and also under the alternative hypothesis for matrices close to the detection boundary. We prove upper bounds for the total error probability of our test procedure, for $α>1/2$ and under the assumption $T=o(p)$ which implies that $n=o(p^{ 2 α})$. We illustrate via a numerical study the behavior of our test procedure. Moreover, we prove lower bounds for the maximal type II error and the total error probabilities. Thus we obtain the asymptotic and the sharp asymptotically minimax separation rate $\tildeφ = (C(α, L) n^2 p )^{- α/(4 α+ 1)}$, for $α>3/2$ and for $α>1$ together with the additional assumption $p= o(n^{4 α-1})$, respectively. We deduce rate asymptotic minimax results for testing the inverse of the covariance matrix. We construct an adaptive test procedure with respect to the parameter $α$ and show that it attains the rate $\tildeψ= ( n^2 p / \ln\ln(n \displaystyle\sqrt{p}) )^{- α/(4 α+ 1)}$.

preprint2015arXiv

Adaptive variable selection in nonparametric sparse additive models

We consider the problem of recovery of an unknown multivariate signal $f$ observed in a $d$-dimensional Gaussian white noise model of intensity $\varepsilon$. We assume that $f$ belongs to a class of smooth functions ${\cal F}^d\subset L_2([0,1]^d)$ and has an additive sparse structure determined by the parameter $s$, the number of non-zero univariate components contributing to $f$. We are interested in the case when $d=d_\varepsilon \to \infty$ as $\varepsilon \to 0$ and the parameter $s$ stays "small" relative to $d$. With these assumptions, the recovery problem in hand becomes that of determining which sparse additive components are non-zero. Attempting to reconstruct most non-zero components of $f$, but not all of them, we arrive at the problem of almost full variable selection in high-dimensional regression. For two different choices of ${\cal F}^d$, we establish conditions under which almost full variable selection is possible, and provide a procedure that gives almost full variable selection. The procedure does the best (in the asymptotically minimax sense) in selecting most non-zero components of $f$. Moreover, it is adaptive in the parameter $s$.

preprint2015arXiv

Maximum entropy distribution of order statistics with given marginals

We consider distributions of ordered random vectors with given one-dimensional marginal distributions. We give an elementary necessary and sufficient condition for the existence of such a distribution with finite entropy. In this case, we give explicitly the density of the unique distribution which achieves the maximal entropy and compute the value of its entropy. This density is the unique one which has a product form on its support and the given one-dimensional marginals. The proof relies on the study of copulas with given one-dimensional marginal distributions for its order statistics.

preprint2015arXiv

Sharp minimax tests for large Toeplitz covariance matrices with repeated observations

We observe a sample of $n$ independent $p$-dimensional Gaussian vectors with Toeplitz covariance matrix $ Σ= [σ_{|i-j|}]_{1 \leq i,j \leq p}$ and $σ_0=1$. We consider the problem of testing the hypothesis that $Σ$ is the identity matrix asymptotically when $n \to \infty$ and $p \to \infty$. We suppose that the covariances $σ_k$ decrease either polynomially ($\sum_{k \geq 1} k^{2α} σ^2_{k} \leq L$ for $ α>1/4$ and $L>0$) or exponentially ($\sum_{k \geq 1} e^{2Ak} σ^2_{k} \leq L$ for $ A,L>0$). We consider a test procedure based on a weighted U-statistic of order 2, with optimal weights chosen as solution of an extremal problem. We give the asymptotic normality of the test statistic under the null hypothesis for fixed $n$ and $p \to + \infty$ and the asymptotic behavior of the type I error probability of our test procedure. We also show that the maximal type II error probability, either tend to $0$, or is bounded from above. In the latter case, the upper bound is given using the asymptotic normality of our test statistic under alternatives close to the separation boundary. Our assumptions imply mild conditions: $n=o(p^{2α- 1/2})$ (in the polynomial case), $n=o(e^p)$ (in the exponential case). We prove both rate optimality and sharp optimality of our results, for $α>1$ in the polynomial case and for any $A>0$ in the exponential case. A simulation study illustrates the good behavior of our procedure, in particular for small $n$, large $p$.

preprint2015arXiv

Spectral thresholding quantum tomography for low rank states

The estimation of high dimensional quantum states is an important statistical problem arising in current quantum technology applications. A key example is the tomography of multiple ions states, employed in the validation of state preparation in ion trap experiments \cite{Haffner2005}. Since full tomography becomes unfeasible even for a small number of ions, there is a need to investigate lower dimensional statistical models which capture prior information about the state, and to devise estimation methods tailored to such models. In this paper we propose several new methods aimed at the efficient estimation of low rank states in multiple ions tomography. All methods consist in first computing the least squares estimator, followed by its truncation to an appropriately chosen smaller rank. The latter is done by setting eigenvalues below a certain "noise level" to zero, while keeping the rest unchanged, or normalising them appropriately. We show that (up to logarithmic factors in the space dimension) the mean square error of the resulting estimators scales as $r\cdot d/N$ where $r$ is the rank, $d=2^k$ is the dimension of the Hilbert space, and $N$ is the number of quantum samples. Furthermore we establish a lower bound for the asymptotic minimax risk which shows that the above scaling is optimal. The performance of the estimators is analysed in an extensive simulations study, with emphasis on the dependence on the state rank, and the number of measurement repetitions. We find that all estimators perform significantly better that the least squares, with the "physical estimator" (which is a bona fide density matrix) slightly outperforming the other estimators.

preprint2014arXiv

Semiparametric topographical mixture models with symmetric errors

Motivated by the analysis of a Positron Emission Tomography (PET) imaging data considered in Bowen et al. (2012), we introduce a semiparametric topographical mixture model able to capture the characteristics of dichotomous shifted response-type experiments. We propose a local estimation procedure, based on the symmetry of the local noise, for the proportion and locations functions involved in the proposed model. We establish under mild conditions the minimax properties and asymptotic normality of our estimators when Monte Carlo simulations are conducted to examine their finite sample performance. Finally a statistical analysis of the PET imaging data in Bowen et al. (2012) is illustrated for the proposed method.

preprint2013arXiv

Detection of a sparse submatrix of a high-dimensional noisy matrix

We observe a $N\times M$ matrix $Y_{ij}=s_{ij}+ξ_{ij}$ with $ξ_{ij}\sim {\mathcal {N}}(0,1)$ i.i.d. in $i,j$, and $s_{ij}\in \mathbb {R}$. We test the null hypothesis $s_{ij}=0$ for all $i,j$ against the alternative that there exists some submatrix of size $n\times m$ with significant elements in the sense that $s_{ij}\ge a>0$. We propose a test procedure and compute the asymptotical detection boundary $a$ so that the maximal testing risk tends to 0 as $M\to\infty$, $N\to\infty$, $p=n/N\to0$, $q=m/M\to0$. We prove that this boundary is asymptotically sharp minimax under some additional constraints. Relations with other testing problems are discussed. We propose a testing procedure which adapts to unknown $(n,m)$ within some given set and compute the adaptive sharp rates. The implementation of our test procedure on synthetic data shows excellent behavior for sparse, not necessarily squared matrices. We extend our sharp minimax results in different directions: first, to Gaussian matrices with unknown variance, next, to matrices of random variables having a distribution from an exponential family (non-Gaussian) and, finally, to a two-sided alternative for matrices with Gaussian elements.

preprint2013arXiv

Maximum entropy copula with given diagonal section

We consider copulas with a given diagonal section and compute the explicit density of the unique optimal copula which maximizes the entropy. In this sense, this copula is the least informative among the copulas with a given diagonal section. We give an explicit criterion on the diagonal section for the existence of the optimal copula and give a closed formula for its entropy. We also provide examples for some diagonal sections of usual bivariate copulas and illustrate the differences between them and the maximum entropy copula with the same diagonal section.

preprint2013arXiv

Rank penalized estimation of a quantum system

We introduce a new method to reconstruct the density matrix $ρ$ of a system of $n$-qubits and estimate its rank $d$ from data obtained by quantum state tomography measurements repeated $m$ times. The procedure consists in minimizing the risk of a linear estimator $\hatρ$ of $ρ$ penalized by given rank (from 1 to $2^n$), where $\hatρ$ is previously obtained by the moment method. We obtain simultaneously an estimator of the rank and the resulting density matrix associated to this rank. We establish an upper bound for the error of penalized estimator, evaluated with the Frobenius norm, which is of order $dn(4/3)^n /m$ and consistency for the estimator of the rank. The proposed methodology is computationaly efficient and is illustrated with some example states and real experimental data sets.

preprint2013arXiv

Sharp detection of smooth signals in a high-dimensional sparse matrix with indirect observations

We consider a matrix-valued Gaussian sequence model, that is, we observe a sequence of high-dimensional $M \times N$ matrices of heterogeneous Gaussian random variables $x_{ij,k}$ for $i \in\{1,...,M\}$, $j \in \{1,...,N\}$ and $k \in \mathbb{Z}$. The standard deviation of our observations is $\ep k^s$ for some $\ep >0$ and $s \geq 0$. We give sharp rates for the detection of a sparse submatrix of size $m \times n$ with active components. A component $(i,j)$ is said active if the sequence $\{x_{ij,k}\}_k$ have mean $\{θ_{ij,k}\}_k$ within a Sobolev ellipsoid of smoothness $τ>0$ and total energy $\sum_k θ^2_{ij,k} $ larger than some $r^2_\ep$. Our rates involve relationships between $m,\, n, \, M$ and $N$ tending to infinity such that $m/M$, $n/N$ and $\ep$ tend to 0, such that a test procedure that we construct has asymptotic minimax risk tending to 0. We prove corresponding lower bounds under additional assumptions on the relative size of the submatrix in the large matrix of observations. Except for these additional conditions our rates are asymptotically sharp. Lower bounds for hypothesis testing problems mean that no test procedure can distinguish between the null hypothesis (no signal) and the alternative, i.e. the minimax risk for testing tends to 1.

preprint2013arXiv

Sharp Variable Selection of a Sparse Submatrix in a High-Dimensional Noisy Matrix

We observe a $N\times M$ matrix of independent, identically distributed Gaussian random variables which are centered except for elements of some submatrix of size $n\times m$ where the mean is larger than some $a>0$. The submatrix is sparse in the sense that $n/N$ and $m/M$ tend to 0, whereas $n,\, m, \, N$ and $M$ tend to infinity. We consider the problem of selecting the random variables with significantly large mean values. We give sufficient conditions on $a$ as a function of $n,\, m,\,N$ and $M$ and construct a uniformly consistent procedure in order to do sharp variable selection. We also prove the minimax lower bounds under necessary conditions which are complementary to the previous conditions. The critical values $a^*$ separating the necessary and sufficient conditions are sharp (we show exact constants). We note a gap between the critical values $a^*$ for selection of variables and that of detecting that such a submatrix exists given by Butucea and Ingster (2012). When $a^*$ is in this gap, consistent detection is possible but no consistent selector of the corresponding variables can be found.

preprint2011arXiv

Semiparametric mixtures of symmetric distributions

We consider in this paper the semiparametric mixture of two distributions equal up to a shift parameter. The model is said to be semiparametric in the sense that the mixed distribution is not supposed to belong to a parametric family. In order to insure the identifiability of the model it is assumed that the mixed distribution is symmetric, the model being then defined by the mixing proportion, two location parameters, and the probability density function of the mixed distribution. We propose a new class of M-estimators of these parameters based on a Fourier approach, and prove that they are square root consistent under mild regularity conditions. Their finite-sample properties are illustrated by a Monte Carlo study and a benchmark real dataset is also studied with our method.

preprint2010arXiv

Quantum U-statistics

The notion of a $U$-statistic for an $n$-tuple of identical quantum systems is introduced in analogy to the classical (commutative) case: given a selfadjoint `kernel' $K$ acting on $(\mathbb{C}^{d})^{\otimes r}$ with $r<n$, we define the symmetric operator $U_{n}= {n \choose r} \sum_βK^{(β)}$ with $K^{(β)}$ being the kernel acting on the subset $β$ of $\{1,\dots ,n\}$. If the systems are prepared in the i.i.d state $ρ^{\otimes n}$ it is shown that the sequence of properly normalised $U$-statistics converges in moments to a linear combination of Hermite polynomials in canonical variables of a CCR algebra defined through the Quantum Central Limit Theorem. In the special cases of non-degenerate kernels and kernels of order $2$ it is shown that the convergence holds in the stronger distribution sense. Two types of applications in quantum statistics are described: testing beyond the two simple hypotheses scenario, and quantum metrology with interacting hamiltonians.

preprint2007arXiv

Minimax and adaptive estimation of the Wigner function in quantum homodyne tomography with noisy data

We estimate the quantum state of a light beam from results of quantum homodyne measurements performed on identically prepared quantum systems. The state is represented through the Wigner function, a generalized probability density on $\mathbb{R}^2$ which may take negative values and must respect intrinsic positivity constraints imposed by quantum physics. The effect of the losses due to detection inefficiencies, which are always present in a real experiment, is the addition to the tomographic data of independent Gaussian noise. We construct a kernel estimator for the Wigner function, prove that it is minimax efficient for the pointwise risk over a class of infinitely differentiable functions, and implement it for numerical results. We construct adaptive estimators, that is, which do not depend on the smoothness parameters, and prove that in some setups they attain the minimax rates for the corresponding smoothness class.

Cristina Butucea

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

Interactive versus non-interactive locally differentially private estimation: Two elbows for the quadratic functional

Phase transitions for support recovery under local differential privacy

Fast Non-Asymptotic Testing And Support Recovery For Large Sparse Toeplitz Covariance Matrices

Variable selection, monotone likelihood ratio and group sparsity

Locally private non-asymptotic testing of discrete distributions is faster using interactive mechanisms

Adaptive test for large covariance matrices with missing observations

Fast adaptive estimation of log-additive exponential models in Kullback-Leibler divergence

Optimal exponential bounds for aggregation of estimators for the Kullback-Leibler loss

Sharp minimax tests for large covariance matrices and adaptation

Adaptive variable selection in nonparametric sparse additive models

Maximum entropy distribution of order statistics with given marginals

Sharp minimax tests for large Toeplitz covariance matrices with repeated observations

Spectral thresholding quantum tomography for low rank states

Semiparametric topographical mixture models with symmetric errors

Detection of a sparse submatrix of a high-dimensional noisy matrix

Maximum entropy copula with given diagonal section

Rank penalized estimation of a quantum system

Sharp detection of smooth signals in a high-dimensional sparse matrix with indirect observations

Sharp Variable Selection of a Sparse Submatrix in a High-Dimensional Noisy Matrix

Semiparametric mixtures of symmetric distributions

Quantum U-statistics

Minimax and adaptive estimation of the Wigner function in quantum homodyne tomography with noisy data