Source author record

Jan Johannes

Jan Johannes appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory Computation math.NA Methodology

Catalog footprint

What is connected

18works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Adaptive pointwise density estimation under local differential privacy

We consider the estimation of a density at a fixed point under a local differential privacy constraint, where the observations are anonymised before being available for statistical inference. We propose both a privatised version of a projection density estimator as well as a kernel density estimator and derive their minimax rates under a privacy constraint. There is a twofold deterioration of the minimax rates due to the anonymisation, which we show to be unavoidable by providing lower bounds. In both estimation procedures a tuning parameter has to be chosen. We suggest a variant of the classical Goldenshluger-Lepski method for choosing the bandwidth and the cut-off dimension, respectively, and analyse its performance. It provides adaptive minimax-optimal (up to log-factors) estimators. We discuss in detail how the lower and upper bound depend on the privacy constraints, which in turn is reflected by a modification of the adaptive method.

preprint2021arXiv

Data-driven aggregation in circular deconvolution

In a circular deconvolution model we consider the fully data driven density estimation of a circular random variable where the density of the additive independent measurement error is unknown. We have at hand two independent iid samples, one of the contaminated version of the variable of interest, and the other of the additive noise. We show optimality,in an oracle and minimax sense, of a fully data-driven weighted sum of orthogonal series density estimators. Two shapes of random weights are considered, one motivated by a Bayesian approach and the other by a well known model selection method. We derive non-asymptotic upper bounds for the quadratic risk and the maximal quadratic risk over Sobolev-like ellipsoids of the fully data-driven estimator. We compute rates which can be obtained in different configurations for the smoothness of the density of interest and the error density. The rates (strictly) match the optimal oracle or minimax rates for a large variety of cases, and feature otherwise at most a deterioration by a logarithmic factor. We illustrate the performance of the fully data-driven weighted sum of orthogonal series estimators by a simulation study.

preprint2020arXiv

Adaptive minimax testing for circular convolution

Given observations from a circular random variable contaminated by an additive measurement error, we consider the problem of minimax optimal goodness-of-fit testing in a non-asymptotic framework. We propose direct and indirect testing procedures using a projection approach. The structure of the optimal tests depends on regularity and ill-posedness parameters of the model, which are unknown in practice. Therefore, adaptive testing strategies that perform optimally over a wide range of regularity and ill-posedness classes simultaneously are investigated. Considering a multiple testing procedure, we obtain adaptive i.e. assumption-free procedures and analyse their performance. Compared with the non-adaptive tests, their radii of testing face a deterioration by a log-factor. We show that for testing of uniformity this loss is unavoidable by providing a lower bound. The results are illustrated considering Sobolev spaces and ordinary or super smooth error densities.

preprint2020arXiv

Adaptive minimax testing in inverse Gaussian sequence space models

In the inverse Gaussian sequence space model with additional noisy observations of the operator, we derive nonasymptotic minimax radii of testing for ellipsoid-type alternatives simultaneously for both the signal detection problem (testing against zero) and the goodness-of-fit testing problem (testing against a prescribed sequence) without any regularity assumption on the null hypothesis. The radii are the maximum of two terms, each of which only depends on one of the noise levels. Interestingly, the term involving the noise level of the operator explicitly depends on the null hypothesis and vanishes in the signal detection case. The minimax radii are established by first showing a lower bound for arbitrary null hypotheses and noise levels. For the upper bound we consider two testing procedures, a direct test based on estimating the energy in the image space and an indirect test. Under mild assumptions, we prove that the testing radius of the indirect test achieves the lower bound, which shows the minimax optimality of the radius and the test. We highlight the assumptions under which the direct test also performs optimally. Furthermore, we apply a classical Bonferroni method for making both the indirect and the direct test adaptive with respect to the regularity of the alternative. The radii of the adaptive tests are deteriorated by an additional log-factor, which we show to be unavoidable. The results are illustrated considering Sobolev spaces and mildly or severely ill-posed inverse problems.

preprint2020arXiv

Data-driven aggregation in non-parametric density estimation on the real line

We study non-parametric estimation of an unknown density with support in R (respectively R+). The proposed estimation procedure is based on the projection on finite dimensional subspaces spanned by the Hermite (respectively the Laguerre) functions. The focus of this paper is to introduce a data-driven aggregation approach in order to deal with the upcoming bias-variance trade-off. Our novel procedure integrates the usual model selection method as a limit case. We show the oracle- and the minimax-optimality of the data-driven aggregated density estimator and hence its adaptivity. We present results of a simulation study which allow to compare the finite sample performance of the data-driven estimators using model selection compared to the new aggregation.

preprint2020arXiv

Minimax testing and quadratic functional estimation for circular convolution

In a circular convolution model, we aim to infer on the density of a circular random variable using observations contaminated by an additive measurement error. We highlight the interplay of the two problems: optimal testing and quadratic functional estimation. Under general regularity assumptions, we determine an upper bound for the minimax risk of estimation for the quadratic functional. The upper bound consists of two terms, one that mimics a classical bias-variance trade-off and a second that causes the typical elbow effect in quadratic functional estimation. Using a minimax optimal estimator of the quadratic functional as a test statistic, we derive an upper bound for the nonasymptotic minimax radius of testing for nonparametric alternatives. Interestingly, the term causing the elbow effect in the estimation case vanishes in the radius of testing. We provide a matching lower bound for the testing problem. By showing that any lower bound for the testing problem also yields a lower bound for the quadratic functional estimation problem, we obtain a lower bound for the risk of estimation. Lastly, we prove a matching lower bound for the term causing the elbow effect in the estimation problem. The results are illustrated considering Sobolev spaces and ordinary or super smooth error densities.

preprint2020arXiv

Spectral cut-off regularisation for density estimation under multiplicative measurement errors

We study the non-parametric estimation of an unknown density f with support on R+ based on an i.i.d. sample with multiplicative measurement errors. The proposed fully data driven procedure is based on the estimation of the Mellin transform of the density f , a regularisation of the inverse of the Mellin transform by a spectral cut-off and a data-driven model selection in order to deal with the upcoming bias-variance trade-off. We introduce and discuss further Mellin-Sobolev spaces which characterize the regularity of the unknown density f through the decay of its Mellin transform. Additionally, we show minimax-optimality over Mellin-Sobolev spaces of the data-driven density estimator and hence its adaptivity.

preprint2016arXiv

Adaptive non-parametric estimation in the presence of dependence

We consider non-parametric estimation problems in the presence of dependent data, notably non-parametric regression with random design and non-parametric density estimation. The proposed estimation procedure is based on a dimension reduction. The minimax optimal rate of convergence of the estimator is derived assuming a sufficiently weak dependence characterized by fast decreasing mixing coefficients. We illustrate these results by considering classical smoothness assumptions. However, the proposed estimator requires an optimal choice of a dimension parameter depending on certain characteristics of the function of interest, which are not known in practice. The main issue addressed in our work is an adaptive choice of this dimension parameter combining model selection and Lepski's method. It is inspired by the recent work of Goldenshluger and Lepski (2011). We show that this data-driven estimator can attain the lower risk bound up to a constant provided a fast decay of the mixing coefficients.

preprint2016arXiv

Adaptive non-parametric instrumental regression in the presence of dependence

We consider the estimation of a structural function which models a non-parametric relationship between a response and an endogenous regressor given an instrument in presence of dependence in the data generating process. Assuming an independent and identically distributed (iid.) sample it has been shown in Johannes and Schwarz (2010) that a least squares estimator based on dimension reduction and thresholding can attain minimax-optimal rates of convergence up to a constant. As this estimation procedure requires an optimal choice of a dimension parameter with regard amongst others to certain characteristics of the unknown structural function we investigate its fully data-driven choice based on a combination of model selection and Lepski's method inspired by Goldenshluger and Lepski (2011). For the resulting fully data-driven thresholded least squares estimator a non-asymptotic oracle risk bound is derived by considering either an iid. sample or by dismissing the independence assumption. In both cases the derived risk bounds coincide up to a constant assuming sufficiently weak dependence characterised by a fast decay of the mixing coefficients. Employing the risk bounds the minimax optimality up to constant of the estimator is established over a variety of classes of structural functions.

preprint2016arXiv

Functional linear instrumental regression under second order stationarity

We consider the problem of estimating the slope parameter in functional linear instrumental regression, where in the presence of an instrument W, i.e., an exogenous random function, a scalar response Y is modeled in dependence of an endogenous random function X. Assuming second order stationarity jointly for X and W a nonparametric estimator of the functional slope parameter and its derivatives is proposed based on an n-sample of (Y,X,W). In this paper the minimax optimal rate of convergence of the estimator is derived assuming that the slope parameter belongs to the well-known Sobolev space of periodic functions. We discuss the cases that the cross-covariance operator associated to the random functions X and W is finitely, infinitely or in some general form smoothing.

preprint2015arXiv

Adaptive Bayesian estimation in indirect Gaussian sequence space models

In an indirect Gaussian sequence space model lower and upper bounds are derived for the concentration rate of the posterior distribution of the parameter of interest shrinking to the parameter value $θ^\circ$ that generates the data. While this establishes posterior consistency, however, the concentration rate depends on both $θ^\circ$ and a tuning parameter which enters the prior distribution. We first provide an oracle optimal choice of the tuning parameter, i.e., optimized for each $θ^\circ$ separately. The optimal choice of the prior distribution allows us to derive an oracle optimal concentration rate of the associated posterior distribution. Moreover, for a given class of parameters and a suitable choice of the tuning parameter, we show that the resulting uniform concentration rate over the given class is optimal in a minimax sense. Finally, we construct a hierarchical prior that is adaptive. This means that, given a parameter $θ^\circ$ or a class of parameters, respectively, the posterior distribution contracts at the oracle rate or at the minimax rate over the class. Notably, the hierarchical prior does not depend neither on $θ^\circ$ nor on the given class. Moreover, convergence of the fully data-driven Bayes estimator at the oracle or at the minimax rate is established.

preprint2013arXiv

Adaptive functional linear regression

We consider the estimation of the slope function in functional linear regression, where scalar responses are modeled in dependence of random functions. Cardot and Johannes [J. Multivariate Anal. 101 (2010) 395-408] have shown that a thresholded projection estimator can attain up to a constant minimax-rates of convergence in a general framework which allows us to cover the prediction problem with respect to the mean squared prediction error as well as the estimation of the slope function and its derivatives. This estimation procedure, however, requires an optimal choice of a tuning parameter with regard to certain characteristics of the slope function and the covariance operator associated with the functional regressor. As this information is usually inaccessible in practice, we investigate a fully data-driven choice of the tuning parameter which combines model selection and Lepski's method. It is inspired by the recent work of Goldenshluger and Lepski [Ann. Statist. 39 (2011) 1608-1632]. The tuning parameter is selected as minimizer of a stochastic penalized contrast function imitating Lepski's method among a random collection of admissible values. This choice of the tuning parameter depends only on the data and we show that within the general framework the resulting data-driven thresholded projection estimator can attain minimax-rates up to a constant over a variety of classes of slope functions and covariance operators. The results are illustrated considering different configurations which cover in particular the prediction problem as well as the estimation of the slope and its derivatives. A simulation study shows the reasonable performance of the fully data-driven estimation procedure.

preprint2013arXiv

Iterative Estimation of Solutions to Noisy Nonlinear Operator Equations in Nonparametric Instrumental Regression

This paper discusses the solution of nonlinear integral equations with noisy integral kernels as they appear in nonparametric instrumental regression. We propose a regularized Newton-type iteration and establish convergence and convergence rate results. A particular emphasis is on instrumental regression models where the usual conditional mean assumption is replaced by a stronger independence assumption. We demonstrate for the case of a binary instrument that our approach allows the correct estimation of regression functions which are not identifiable with the standard model. This is illustrated in computed examples with simulated data.

preprint2012arXiv

Adaptive Gaussian inverse regression with partially unknown operator

This work deals with the ill-posed inverse problem of reconstructing a function $f$ given implicitly as the solution of $g = Af$, where $A$ is a compact linear operator with unknown singular values and known eigenfunctions. We observe the function $g$ and the singular values of the operator subject to Gaussian white noise with respective noise levels $\varepsilon$ and $σ$. We develop a minimax theory in terms of both noise levels and propose an orthogonal series estimator attaining the minimax rates. This estimator requires the optimal choice of a dimension parameter depending on certain characteristics of $f$ and $A$. This work addresses the fully data-driven choice of the dimension parameter combining model selection with Lepski's method. We show that the fully data-driven estimator preserves minimax optimality over a wide range of classes for $f$ and $A$ and noise levels $\varepsilon$ and $σ$. The results are illustrated considering Sobolev spaces and mildly and severely ill-posed inverse problems.

preprint2012arXiv

Partially adaptive nonparametric instrumental regression

We consider the problem of estimating the structural function in nonparametric instrumental regression, where in the presence of an instrument W a response Y is modeled in dependence of an endogenous explanatory variable Z. The proposed estimator is based on dimension reduction and additional thresholding. The minimax optimal rate of convergence of the estimator is derived assuming that the structural function belongs to some ellipsoids which are in a certain sense linked to the conditional expectation operator of Z given W. We illustrate these results by considering classical smoothness assumptions. However, the proposed estimator requires an optimal choice of a dimension parameter depending on certain characteristics of the unknown structural function and the conditional expectation operator of Z given W, which are not known in practice. The main issue addressed in our work is an adaptive choice of this dimension parameter using a model selection approach under the restriction that the conditional expectation operator of Z given W is smoothing in a certain sense. In this situation we develop a penalized minimum contrast estimator with randomized penalty and collection of models. We show that this data-driven estimator can attain the lower risk bound up to a constant over a wide range of smoothness classes for the structural function.

preprint2011arXiv

Adaptive estimation of functionals in nonparametric instrumental regression

We consider the problem of estimating the value l(ϕ) of a linear functional, where the structural function ϕ models a nonparametric relationship in presence of instrumental variables. We propose a plug-in estimator which is based on a dimension reduction technique and additional thresholding. It is shown that this estimator is consistent and can attain the minimax optimal rate of convergence under additional regularity conditions. This, however, requires an optimal choice of the dimension parameter m depending on certain characteristics of the structural function ϕ and the joint distribution of the regressor and the instrument, which are unknown in practice. We propose a fully data driven choice of m which combines model selection and Lepski's method. We show that the adaptive estimator attains the optimal rate of convergence up to a logarithmic factor. The theory in this paper is illustrated by considering classical smoothness assumptions and we discuss examples such as pointwise estimation or estimation of averages of the structural function ϕ.

preprint2011arXiv

Adaptive estimation of linear functionals in functional linear models

We consider the estimation of the value of a linear functional of the slope parameter in functional linear regression, where scalar responses are modeled in dependence of random functions. In Johannes and Schenk [2010] it has been shown that a plug-in estimator based on dimension reduction and additional thresholding can attain minimax optimal rates of convergence up to a constant. However, this estimation procedure requires an optimal choice of a tuning parameter with regard to certain characteristics of the slope function and the covariance operator associated with the functional regressor. As these are unknown in practice, we investigate a fully data-driven choice of the tuning parameter based on a combination of model selection and Lepski's method, which is inspired by the recent work of Goldenshluger and Lepski [2011]. The tuning parameter is selected as the minimizer of a stochastic penalized contrast function imitating Lepski's method among a random collection of admissible values. We show that this adaptive procedure attains the lower bound for the minimax risk up to a logarithmic factor over a wide range of classes of slope functions and covariance operators. In particular, our theory covers point-wise estimation as well as the estimation of local averages of the slope parameter.

preprint2009arXiv

Adaptive estimation in circular functional linear models

We consider the problem of estimating the slope parameter in circular functional linear regression, where scalar responses Y1,...,Yn are modeled in dependence of 1-periodic, second order stationary random functions X1,...,Xn. We consider an orthogonal series estimator of the slope function, by replacing the first m theoretical coefficients of its development in the trigonometric basis by adequate estimators. Wepropose a model selection procedure for m in a set of admissible values, by defining a contrast function minimized by our estimator and a theoretical penalty function; this first step assumes the degree of ill posedness to be known. Then we generalize the procedure to a random set of admissible m's and a random penalty function. The resulting estimator is completely data driven and reaches automatically what is known to be the optimal minimax rate of convergence, in term of a general weighted L2-risk. This means that we provide adaptive estimators of both the slope function and its derivatives.

Jan Johannes

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

Adaptive pointwise density estimation under local differential privacy

Data-driven aggregation in circular deconvolution

Adaptive minimax testing for circular convolution

Adaptive minimax testing in inverse Gaussian sequence space models

Data-driven aggregation in non-parametric density estimation on the real line

Minimax testing and quadratic functional estimation for circular convolution

Spectral cut-off regularisation for density estimation under multiplicative measurement errors

Adaptive non-parametric estimation in the presence of dependence

Adaptive non-parametric instrumental regression in the presence of dependence

Functional linear instrumental regression under second order stationarity

Adaptive Bayesian estimation in indirect Gaussian sequence space models

Adaptive functional linear regression

Iterative Estimation of Solutions to Noisy Nonlinear Operator Equations in Nonparametric Instrumental Regression

Adaptive Gaussian inverse regression with partially unknown operator

Partially adaptive nonparametric instrumental regression

Adaptive estimation of functionals in nonparametric instrumental regression

Adaptive estimation of linear functionals in functional linear models

Adaptive estimation in circular functional linear models