Researcher profile

Jan Johannes

Jan Johannes contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
2topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2022arXiv

Adaptive pointwise density estimation under local differential privacy

We consider the estimation of a density at a fixed point under a local differential privacy constraint, where the observations are anonymised before being available for statistical inference. We propose both a privatised version of a projection density estimator as well as a kernel density estimator and derive their minimax rates under a privacy constraint. There is a twofold deterioration of the minimax rates due to the anonymisation, which we show to be unavoidable by providing lower bounds. In both estimation procedures a tuning parameter has to be chosen. We suggest a variant of the classical Goldenshluger-Lepski method for choosing the bandwidth and the cut-off dimension, respectively, and analyse its performance. It provides adaptive minimax-optimal (up to log-factors) estimators. We discuss in detail how the lower and upper bound depend on the privacy constraints, which in turn is reflected by a modification of the adaptive method.

preprint2021arXiv

Data-driven aggregation in circular deconvolution

In a circular deconvolution model we consider the fully data driven density estimation of a circular random variable where the density of the additive independent measurement error is unknown. We have at hand two independent iid samples, one of the contaminated version of the variable of interest, and the other of the additive noise. We show optimality,in an oracle and minimax sense, of a fully data-driven weighted sum of orthogonal series density estimators. Two shapes of random weights are considered, one motivated by a Bayesian approach and the other by a well known model selection method. We derive non-asymptotic upper bounds for the quadratic risk and the maximal quadratic risk over Sobolev-like ellipsoids of the fully data-driven estimator. We compute rates which can be obtained in different configurations for the smoothness of the density of interest and the error density. The rates (strictly) match the optimal oracle or minimax rates for a large variety of cases, and feature otherwise at most a deterioration by a logarithmic factor. We illustrate the performance of the fully data-driven weighted sum of orthogonal series estimators by a simulation study.

preprint2020arXiv

Adaptive minimax testing for circular convolution

Given observations from a circular random variable contaminated by an additive measurement error, we consider the problem of minimax optimal goodness-of-fit testing in a non-asymptotic framework. We propose direct and indirect testing procedures using a projection approach. The structure of the optimal tests depends on regularity and ill-posedness parameters of the model, which are unknown in practice. Therefore, adaptive testing strategies that perform optimally over a wide range of regularity and ill-posedness classes simultaneously are investigated. Considering a multiple testing procedure, we obtain adaptive i.e. assumption-free procedures and analyse their performance. Compared with the non-adaptive tests, their radii of testing face a deterioration by a log-factor. We show that for testing of uniformity this loss is unavoidable by providing a lower bound. The results are illustrated considering Sobolev spaces and ordinary or super smooth error densities.

preprint2020arXiv

Adaptive minimax testing in inverse Gaussian sequence space models

In the inverse Gaussian sequence space model with additional noisy observations of the operator, we derive nonasymptotic minimax radii of testing for ellipsoid-type alternatives simultaneously for both the signal detection problem (testing against zero) and the goodness-of-fit testing problem (testing against a prescribed sequence) without any regularity assumption on the null hypothesis. The radii are the maximum of two terms, each of which only depends on one of the noise levels. Interestingly, the term involving the noise level of the operator explicitly depends on the null hypothesis and vanishes in the signal detection case. The minimax radii are established by first showing a lower bound for arbitrary null hypotheses and noise levels. For the upper bound we consider two testing procedures, a direct test based on estimating the energy in the image space and an indirect test. Under mild assumptions, we prove that the testing radius of the indirect test achieves the lower bound, which shows the minimax optimality of the radius and the test. We highlight the assumptions under which the direct test also performs optimally. Furthermore, we apply a classical Bonferroni method for making both the indirect and the direct test adaptive with respect to the regularity of the alternative. The radii of the adaptive tests are deteriorated by an additional log-factor, which we show to be unavoidable. The results are illustrated considering Sobolev spaces and mildly or severely ill-posed inverse problems.

preprint2020arXiv

Data-driven aggregation in non-parametric density estimation on the real line

We study non-parametric estimation of an unknown density with support in R (respectively R+). The proposed estimation procedure is based on the projection on finite dimensional subspaces spanned by the Hermite (respectively the Laguerre) functions. The focus of this paper is to introduce a data-driven aggregation approach in order to deal with the upcoming bias-variance trade-off. Our novel procedure integrates the usual model selection method as a limit case. We show the oracle- and the minimax-optimality of the data-driven aggregated density estimator and hence its adaptivity. We present results of a simulation study which allow to compare the finite sample performance of the data-driven estimators using model selection compared to the new aggregation.

preprint2020arXiv

Minimax testing and quadratic functional estimation for circular convolution

In a circular convolution model, we aim to infer on the density of a circular random variable using observations contaminated by an additive measurement error. We highlight the interplay of the two problems: optimal testing and quadratic functional estimation. Under general regularity assumptions, we determine an upper bound for the minimax risk of estimation for the quadratic functional. The upper bound consists of two terms, one that mimics a classical bias-variance trade-off and a second that causes the typical elbow effect in quadratic functional estimation. Using a minimax optimal estimator of the quadratic functional as a test statistic, we derive an upper bound for the nonasymptotic minimax radius of testing for nonparametric alternatives. Interestingly, the term causing the elbow effect in the estimation case vanishes in the radius of testing. We provide a matching lower bound for the testing problem. By showing that any lower bound for the testing problem also yields a lower bound for the quadratic functional estimation problem, we obtain a lower bound for the risk of estimation. Lastly, we prove a matching lower bound for the term causing the elbow effect in the estimation problem. The results are illustrated considering Sobolev spaces and ordinary or super smooth error densities.

preprint2020arXiv

Spectral cut-off regularisation for density estimation under multiplicative measurement errors

We study the non-parametric estimation of an unknown density f with support on R+ based on an i.i.d. sample with multiplicative measurement errors. The proposed fully data driven procedure is based on the estimation of the Mellin transform of the density f , a regularisation of the inverse of the Mellin transform by a spectral cut-off and a data-driven model selection in order to deal with the upcoming bias-variance trade-off. We introduce and discuss further Mellin-Sobolev spaces which characterize the regularity of the unknown density f through the decay of its Mellin transform. Additionally, we show minimax-optimality over Mellin-Sobolev spaces of the data-driven density estimator and hence its adaptivity.

preprint2013arXiv

Adaptive functional linear regression

We consider the estimation of the slope function in functional linear regression, where scalar responses are modeled in dependence of random functions. Cardot and Johannes [J. Multivariate Anal. 101 (2010) 395-408] have shown that a thresholded projection estimator can attain up to a constant minimax-rates of convergence in a general framework which allows us to cover the prediction problem with respect to the mean squared prediction error as well as the estimation of the slope function and its derivatives. This estimation procedure, however, requires an optimal choice of a tuning parameter with regard to certain characteristics of the slope function and the covariance operator associated with the functional regressor. As this information is usually inaccessible in practice, we investigate a fully data-driven choice of the tuning parameter which combines model selection and Lepski's method. It is inspired by the recent work of Goldenshluger and Lepski [Ann. Statist. 39 (2011) 1608-1632]. The tuning parameter is selected as minimizer of a stochastic penalized contrast function imitating Lepski's method among a random collection of admissible values. This choice of the tuning parameter depends only on the data and we show that within the general framework the resulting data-driven thresholded projection estimator can attain minimax-rates up to a constant over a variety of classes of slope functions and covariance operators. The results are illustrated considering different configurations which cover in particular the prediction problem as well as the estimation of the slope and its derivatives. A simulation study shows the reasonable performance of the fully data-driven estimation procedure.

preprint2012arXiv

Adaptive Gaussian inverse regression with partially unknown operator

This work deals with the ill-posed inverse problem of reconstructing a function $f$ given implicitly as the solution of $g = Af$, where $A$ is a compact linear operator with unknown singular values and known eigenfunctions. We observe the function $g$ and the singular values of the operator subject to Gaussian white noise with respective noise levels $\varepsilon$ and $σ$. We develop a minimax theory in terms of both noise levels and propose an orthogonal series estimator attaining the minimax rates. This estimator requires the optimal choice of a dimension parameter depending on certain characteristics of $f$ and $A$. This work addresses the fully data-driven choice of the dimension parameter combining model selection with Lepski's method. We show that the fully data-driven estimator preserves minimax optimality over a wide range of classes for $f$ and $A$ and noise levels $\varepsilon$ and $σ$. The results are illustrated considering Sobolev spaces and mildly and severely ill-posed inverse problems.

preprint2011arXiv

Adaptive estimation of functionals in nonparametric instrumental regression

We consider the problem of estimating the value l(ϕ) of a linear functional, where the structural function ϕ models a nonparametric relationship in presence of instrumental variables. We propose a plug-in estimator which is based on a dimension reduction technique and additional thresholding. It is shown that this estimator is consistent and can attain the minimax optimal rate of convergence under additional regularity conditions. This, however, requires an optimal choice of the dimension parameter m depending on certain characteristics of the structural function ϕ and the joint distribution of the regressor and the instrument, which are unknown in practice. We propose a fully data driven choice of m which combines model selection and Lepski's method. We show that the adaptive estimator attains the optimal rate of convergence up to a logarithmic factor. The theory in this paper is illustrated by considering classical smoothness assumptions and we discuss examples such as pointwise estimation or estimation of averages of the structural function ϕ.

preprint2011arXiv

Adaptive estimation of linear functionals in functional linear models

We consider the estimation of the value of a linear functional of the slope parameter in functional linear regression, where scalar responses are modeled in dependence of random functions. In Johannes and Schenk [2010] it has been shown that a plug-in estimator based on dimension reduction and additional thresholding can attain minimax optimal rates of convergence up to a constant. However, this estimation procedure requires an optimal choice of a tuning parameter with regard to certain characteristics of the slope function and the covariance operator associated with the functional regressor. As these are unknown in practice, we investigate a fully data-driven choice of the tuning parameter based on a combination of model selection and Lepski's method, which is inspired by the recent work of Goldenshluger and Lepski [2011]. The tuning parameter is selected as the minimizer of a stochastic penalized contrast function imitating Lepski's method among a random collection of admissible values. We show that this adaptive procedure attains the lower bound for the minimax risk up to a logarithmic factor over a wide range of classes of slope functions and covariance operators. In particular, our theory covers point-wise estimation as well as the estimation of local averages of the slope parameter.

preprint2009arXiv

Adaptive estimation in circular functional linear models

We consider the problem of estimating the slope parameter in circular functional linear regression, where scalar responses Y1,...,Yn are modeled in dependence of 1-periodic, second order stationary random functions X1,...,Xn. We consider an orthogonal series estimator of the slope function, by replacing the first m theoretical coefficients of its development in the trigonometric basis by adequate estimators. Wepropose a model selection procedure for m in a set of admissible values, by defining a contrast function minimized by our estimator and a theoretical penalty function; this first step assumes the degree of ill posedness to be known. Then we generalize the procedure to a random set of admissible m's and a random penalty function. The resulting estimator is completely data driven and reaches automatically what is known to be the optimal minimax rate of convergence, in term of a general weighted L2-risk. This means that we provide adaptive estimators of both the slope function and its derivatives.