Source author record

Richard Nickl

Richard Nickl appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory math.PR math.AP math.NA Methodology Numerical Analysis Computation math.FA

Catalog footprint

What is connected

25works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

On gradient stability in nonlinear PDE models and inference in interacting particle systems

We consider general parameter to solution maps $θ\mapsto \mathcal G(θ)$ of non-linear partial differential equations and describe an approach based on a Banach space version of the implicit function theorem to verify the gradient stability condition of Nickl&Wang (JEMS 2024) for the underlying non-linear inverse problem, providing also injectivity estimates and corresponding statistical identifiability results. We illustrate our methods in two examples involving a non-linear reaction diffusion system as well as a McKean--Vlasov interacting particle model, both with periodic boundary conditions. We apply our results to prove the polynomial time convergence of a Langevin-type algorithm sampling the posterior measure of the interaction potential arising from a discrete aggregate measurement of the interacting particle system.

preprint2022arXiv

On polynomial-time computation of high-dimensional posterior measures by Langevin-type algorithms

The problem of generating random samples of high-dimensional posterior distributions is considered. The main results consist of non-asymptotic computational guarantees for Langevin-type MCMC algorithms which scale polynomially in key quantities such as the dimension of the model, the desired precision level, and the number of available statistical measurements. As a direct consequence, it is shown that posterior mean vectors as well as optimisation based maximum a posteriori (MAP) estimates are computable in polynomial time, with high probability under the distribution of the data. These results are complemented by statistical guarantees for recovery of the ground truth parameter generating the data. Our results are derived in a general high-dimensional non-linear regression setting (with Gaussian process priors) where posterior measures are not necessarily log-concave, employing a set of local `geometric' assumptions on the parameter space, and assuming that a good initialiser of the algorithm is available. The theory is applied to a representative non-linear example from PDEs involving a steady-state Schrödinger equation.

preprint2020arXiv

Consistency of Bayesian inference with Gaussian process priors in an elliptic inverse problem

For $\mathcal{O}$ a bounded domain in $\mathbb{R}^d$ and a given smooth function $g:\mathcal{O}\to\mathbb{R}$, we consider the statistical nonlinear inverse problem of recovering the conductivity $f>0$ in the divergence form equation $$ \nabla\cdot(f\nabla u)=g\ \textrm{on}\ \mathcal{O}, \quad u=0\ \textrm{on}\ \partial\mathcal{O}, $$ from $N$ discrete noisy point evaluations of the solution $u=u_f$ on $\mathcal O$. We study the statistical performance of Bayesian nonparametric procedures based on a flexible class of Gaussian (or hierarchical Gaussian) process priors, whose implementation is feasible by MCMC methods. We show that, as the number $N$ of measurements increases, the resulting posterior distributions concentrate around the true parameter generating the data, and derive a convergence rate $N^{-λ}, λ>0,$ for the reconstruction error of the associated posterior means, in $L^2(\mathcal{O})$-distance.

preprint2020arXiv

Consistent Inversion of Noisy Non-Abelian X-Ray Transforms

For $M$ a simple surface, the non-linear statistical inverse problem of recovering a matrix field $Φ: M \to \mathfrak{so}(n)$ from discrete, noisy measurements of the $SO(n)$-valued scattering data $C_Φ$ of a solution of a matrix ODE is considered ($n\geq 2$). Injectivity of the map $Φ\mapsto C_Φ$ was established by [Paternain, Salo, Uhlmann; Geom.Funct.Anal. 2012]. A statistical algorithm for the solution of this inverse problem based on Gaussian process priors is proposed, and it is shown how it can be implemented by infinite-dimensional MCMC methods. It is further shown that as the number $N$ of measurements of point-evaluations of $C_Φ$ increases, the statistical error in the recovery of $Φ$ converges to zero in $L^2(M)$-distance at a rate that is algebraic in $1/N$, and approaches $1/\sqrt N$ for smooth matrix fields $Φ$. The proof relies, among other things, on a new stability estimate for the inverse map $C_Φ\to Φ$. Key applications of our results are discussed in the case $n=3$ to polarimetric neutron tomography, see [Desai et al., Nature Sc.Rep. 2018] and [Hilger et al., Nature Comm. 2018]

preprint2020arXiv

On statistical Calderón problems

For $D$ a bounded domain in $\mathbb R^d, d \ge 2,$ with smooth boundary $\partial D$, the non-linear inverse problem of recovering the unknown conductivity $γ$ determining solutions $u=u_{γ, f}$ of the partial differential equation \begin{equation*} \begin{split} \nabla \cdot(γ\nabla u)&=0 \quad \text{ in }D, \\ u&=f \quad \text { on } \partial D, \end{split} \end{equation*} from noisy observations $Y$ of the Dirichlet-to-Neumann map \[f \mapsto Λ_γ(f) = {γ\frac{\partial u_{γ,f}}{\partial ν}}\Big|_{\partial D},\] with $\partial/\partial ν$ denoting the outward normal derivative, is considered. The data $Y$ consists of $Λ_γ$ corrupted by additive Gaussian noise at noise level $\varepsilon>0$, and a statistical algorithm $\hat γ(Y)$ is constructed which is shown to recover $γ$ in supremum-norm loss at a statistical convergence rate of the order $\log(1/\varepsilon)^{-δ}$ as $\varepsilon \to 0$. It is further shown that this convergence rate is optimal, up to the precise value of the exponent $δ>0$, in an information theoretic sense. The estimator $\hat γ(Y)$ has a Bayesian interpretation in terms of the posterior mean of a suitable Gaussian process prior and can be computed by MCMC methods.

preprint2019arXiv

Nonparametric statistical inference for drift vector fields of multi-dimensional diffusions

The problem of determining a periodic Lipschitz vector field $b=(b_1, \dots, b_d)$ from an observed trajectory of the solution $(X_t: 0 \le t \le T)$ of the multi-dimensional stochastic differential equation \begin{equation*} dX_t = b(X_t)dt + dW_t, \quad t \geq 0, \end{equation*} where $W_t$ is a standard $d$-dimensional Brownian motion, is considered. Convergence rates of a penalised least squares estimator, which equals the maximum a posteriori (MAP) estimate corresponding to a high-dimensional Gaussian product prior, are derived. These results are deduced from corresponding contraction rates for the associated posterior distributions. The rates obtained are optimal up to log-factors in $L^2$-loss in any dimension, and also for supremum norm loss when $d \le 4$. Further, when $d \le 3$, nonparametric Bernstein-von Mises theorems are proved for the posterior distributions of $b$. From this we deduce functional central limit theorems for the implied estimators of the invariant measure $μ_b$. The limiting Gaussian process distributions have a covariance structure that is asymptotically optimal from an information-theoretic point of view.

preprint2016arXiv

Inference on covariance operators via concentration inequalities: k-sample tests, classification, and clustering via Rademacher complexities

We propose a novel approach to the analysis of covariance operators making use of concentration inequalities. First, non-asymptotic confidence sets are constructed for such operators. Then, subsequent applications including a k sample test for equality of covariance, a functional data classifier, and an expectation-maximization style clustering algorithm are derived and tested on both simulated and phoneme data.

preprint2016arXiv

Nonparametric Bayesian posterior contraction rates for discretely observed scalar diffusions

We consider nonparametric Bayesian inference in a reflected diffusion model $dX_t = b (X_t)dt + σ(X_t) dW_t,$ with discretely sampled observations $X_0, X_Δ, \dots, X_{nΔ}$. We analyse the nonlinear inverse problem corresponding to the `low frequency sampling' regime where $Δ>0$ is fixed and $n \to \infty$. A general theorem is proved that gives conditions for prior distributions $Π$ on the diffusion coefficient $σ$ and the drift function $b$ that ensure minimax optimal contraction rates of the posterior distribution over Hölder-Sobolev smoothness classes. These conditions are verified for natural examples of nonparametric random wavelet series priors. For the proofs we derive new concentration inequalities for empirical processes arising from discretely observed diffusions that are of independent interest.

preprint2015arXiv

A sharp adaptive confidence ball for self-similar functions

In the nonparametric Gaussian sequence space model an $\ell^2$-confidence ball $C_n$ is constructed that adapts to unknown smoothness and Sobolev-norm of the infinite-dimensional parameter to be estimated. The confidence ball has exact and honest asymptotic coverage over appropriately defined `self-similar' parameter spaces. It is shown by information-theoretic methods that this `self-similarity' condition is weakest possible.

preprint2015arXiv

Discussion of "Frequentist coverage of adaptive nonparametric Bayesian credible sets"

Discussion of "Frequentist coverage of adaptive nonparametric Bayesian credible sets" by Szabó, van der Vaart and van Zanten [arXiv:1310.4489v5].

preprint2015arXiv

On signal detection and confidence sets for low rank inference problems

We consider the signal detection problem in the Gaussian design trace regression model with low rank alternative hypotheses. We derive the precise (Ingster-type) detection boundary for the Frobenius and the nuclear norm. We then apply these results to show that honest confidence sets for the unknown matrix parameter that adapt to all low rank sub-models in nuclear norm do not exist. This shows that recently obtained positive results in (Carpentier, Eisert, Gross and Nickl, 2015) for confidence sets in low rank recovery problems are essentially optimal.

preprint2014arXiv

High-frequency Donsker theorems for Lévy measures

Donsker-type functional limit theorems are proved for empirical processes arising from discretely sampled increments of a univariate Lévy process. In the asymptotic regime the sampling frequencies increase to infinity and the limiting object is a Gaussian process that can be obtained from the composition of a Brownian motion with a covariance operator determined by the Lévy measure. The results are applied to derive the asymptotic distribution of natural estimators for the distribution function of the Lévy jump measure. As an application we deduce Kolmogorov-Smirnov type tests and confidence bands.

preprint2014arXiv

On the Bernstein-von Mises phenomenon for nonparametric Bayes procedures

We continue the investigation of Bernstein-von Mises theorems for nonparametric Bayes procedures from [Ann. Statist. 41 (2013) 1999-2028]. We introduce multiscale spaces on which nonparametric priors and posteriors are naturally defined, and prove Bernstein-von Mises theorems for a variety of priors in the setting of Gaussian nonparametric regression and in the i.i.d. sampling model. From these results we deduce several applications where posterior-based inference coincides with efficient frequentist procedures, including Donsker- and Kolmogorov-Smirnov theorems for the random posterior cumulative distribution functions. We also show that multiscale posterior credible bands for the regression or density function are optimal frequentist confidence bands.

preprint2013arXiv

Confidence sets in sparse regression

The problem of constructing confidence sets in the high-dimensional linear model with $n$ response variables and $p$ parameters, possibly $p\ge n$, is considered. Full honest adaptive inference is possible if the rate of sparse estimation does not exceed $n^{-1/4}$, otherwise sparse adaptive confidence sets exist only over strict subsets of the parameter spaces for which sparse estimators exist. Necessary and sufficient conditions for the existence of confidence sets that adapt to a fixed sparsity level of the parameter vector are given in terms of minimal $\ell^2$-separation conditions on the parameter space. The design conditions cover common coherence assumptions used in models for sparsity, including (possibly correlated) sub-Gaussian designs.

preprint2013arXiv

Nonparametric Bernstein-von Mises theorems in Gaussian white noise

Bernstein-von Mises theorems for nonparametric Bayes priors in the Gaussian white noise model are proved. It is demonstrated how such results justify Bayes methods as efficient frequentist inference procedures in a variety of concrete nonparametric problems. Particularly Bayesian credible sets are constructed that have asymptotically exact $1-α$ frequentist coverage level and whose $L^2$-diameter shrinks at the minimax rate of convergence (within logarithmic factors) over Hölder balls. Other applications include general classes of linear and nonlinear functionals and credible bands for auto-convolutions. The assumptions cover nonconjugate product priors defined on general orthonormal bases of $L^2$ satisfying weak conditions.

preprint2012arXiv

A Donsker Theorem for Lévy Measures

Given $n$ equidistant realisations of a Lévy process $(L_t,\,t\ge 0)$, a natural estimator $\hat N_n$ for the distribution function $N$ of the Lévy measure is constructed. Under a polynomial decay restriction on the characteristic function $ϕ$, a Donsker-type theorem is proved, that is, a functional central limit theorem for the process $\sqrt n (\hat N_n -N)$ in the space of bounded functions away from zero. The limit distribution is a generalised Brownian bridge process with bounded and continuous sample paths whose covariance structure depends on the Fourier-integral operator ${\cal F}^{-1}[1/ϕ(-\cdot)]$. The class of Lévy processes covered includes several relevant examples such as compound Poisson, Gamma and self-decomposable processes. Main ideas in the proof include establishing pseudo-locality of the Fourier-integral operator and recent techniques from smoothed empirical processes.

preprint2012arXiv

Adaptive confidence sets in L^2

The problem of constructing confidence sets that are adaptive in L^2-loss over a continuous scale of Sobolev classes of probability densities is considered. Adaptation holds, where possible, with respect to both the radius of the Sobolev ball and its smoothness degree, and over maximal parameter spaces for which adaptation is possible. Two key regimes of parameter constellations are identified: one where full adaptation is possible, and one where adaptation requires critical regions be removed. Techniques used to derive these results include a general nonparametric minimax test for infinite-dimensional null- and alternative hypotheses, and new lower bounds for L^2-adaptive confidence sets.

preprint2012arXiv

On adaptive inference and confidence bands

The problem of existence of adaptive confidence bands for an unknown density $f$ that belongs to a nested scale of Hölder classes over $\mathbb{R}$ or $[0,1]$ is considered. Whereas honest adaptive inference in this problem is impossible already for a pair of Hölder balls $Σ(r),Σ(s),r\ne s$, of fixed radius, a nonparametric distinguishability condition is introduced under which adaptive confidence bands can be shown to exist. It is further shown that this condition is necessary and sufficient for the existence of honest asymptotic confidence bands, and that it is strictly weaker than similar analytic conditions recently employed in Giné and Nickl [Ann. Statist. 38 (2010) 1122--1170]. The exceptional sets for which honest inference is not possible have vanishingly small probability under natural priors on Hölder balls $Σ(s)$. If no upper bound for the radius of the Hölder balls is known, a price for adaptation has to be paid, and near-optimal adaptation is possible for standard procedures. The implications of these findings for a general theory of adaptive inference are discussed.

preprint2012arXiv

Rates of contraction for posterior distributions in $\bolds{L^r}$-metrics, $\bolds{1\le r\le\infty}$

The frequentist behavior of nonparametric Bayes estimates, more specifically, rates of contraction of the posterior distributions to shrinking $L^r$-norm neighborhoods, $1\le r\le\infty$, of the unknown parameter, are studied. A theorem for nonparametric density estimation is proved under general approximation-theoretic assumptions on the prior. The result is applied to a variety of common examples, including Gaussian process, wavelet series, normal mixture and histogram priors. The rates of contraction are minimax-optimal for $1\le r\le2$, but deteriorate as $r$ increases beyond 2. In the case of Gaussian nonparametric regression a Gaussian prior is devised for which the posterior contracts at the optimal rate in all $L^r$-norms, $1\le r\le\infty$.

preprint2012arXiv

Spatially Adaptive Density Estimation by Localised Haar Projections

Given a random sample from some unknown density $f_0: \mathbb R \to [0, \infty)$ we devise Haar wavelet estimators for $f_0$ with variable resolution levels constructed from localised test procedures (as in Lepski, Mammen, and Spokoiny (1997, Ann. Statist.)). We show that these estimators adapt to spatially heterogeneous smoothness of $f_0$, simultaneously for every point $x$ in a fixed interval, in sup-norm loss. The thresholding constants involved in the test procedures can be chosen in practice under the idealised assumption that the true density is locally constant in a neighborhood of the point $x$ of estimation, and an information theoretic justification of this practice is given.

preprint2011arXiv

Adaptive estimation of a distribution function and its density in sup-norm loss by wavelet and spline projections

Given an i.i.d. sample from a distribution $F$ on $\mathbb{R}$ with uniformly continuous density $p_0$, purely data-driven estimators are constructed that efficiently estimate $F$ in sup-norm loss and simultaneously estimate $p_0$ at the best possible rate of convergence over Hölder balls, also in sup-norm loss. The estimators are obtained by applying a model selection procedure close to Lepski's method with random thresholds to projections of the empirical measure onto spaces spanned by wavelets or $B$-splines. The random thresholds are based on suprema of Rademacher processes indexed by wavelet or spline projection kernels. This requires Bernstein-type analogs of the inequalities in Koltchinskii [Ann. Statist. 34 (2006) 2593-2656] for the deviation of suprema of empirical processes from their Rademacher symmetrizations.

preprint2011arXiv

Concentration Inequalities and Confidence Bands for Needlet Density Estimators on Compact Homogeneous Manifolds

Let $X_1,...,X_n$ be a random sample from some unknown probability density $f$ defined on a compact homogeneous manifold $\mathbf M$ of dimension $d \ge 1$. Consider a 'needlet frame' $\{ϕ_{j η}\}$ describing a localised projection onto the space of eigenfunctions of the Laplace operator on $\mathbf M$ with corresponding eigenvalues less than $2^{2j}$, as constructed in \cite{GP10}. We prove non-asymptotic concentration inequalities for the uniform deviations of the linear needlet density estimator $f_n(j)$ obtained from an empirical estimate of the needlet projection $\sum_ηϕ_{j η} \int f ϕ_{j η}$ of $f$. We apply these results to construct risk-adaptive estimators and nonasymptotic confidence bands for the unknown density $f$. The confidence bands are adaptive over classes of differentiable and H\"{older}-continuous functions on $\mathbf M$ that attain their Hölder exponents.

preprint2011arXiv

Global uniform risk bounds for wavelet deconvolution estimators

We consider the statistical deconvolution problem where one observes $n$ replications from the model $Y=X+ε$, where $X$ is the unobserved random signal of interest and $ε$ is an independent random error with distribution $ϕ$. Under weak assumptions on the decay of the Fourier transform of $ϕ,$ we derive upper bounds for the finite-sample sup-norm risk of wavelet deconvolution density estimators $f_n$ for the density $f$ of $X$, where $f:\mathbb{R}\to \mathbb{R}$ is assumed to be bounded. We then derive lower bounds for the minimax sup-norm risk over Besov balls in this estimation problem and show that wavelet deconvolution density estimators attain these bounds. We further show that linear estimators adapt to the unknown smoothness of $f$ if the Fourier transform of $ϕ$ decays exponentially and that a corresponding result holds true for the hard thresholding wavelet estimator if $ϕ$ decays polynomially. We also analyze the case where $f$ is a "supersmooth"/analytic density. We finally show how our results and recent techniques from Rademacher processes can be applied to construct global confidence bands for the density $f$.

preprint2010arXiv

Confidence bands in density estimation

Given a sample from some unknown continuous density $f:\mathbb{R}\to\mathbb{R}$, we construct adaptive confidence bands that are honest for all densities in a "generic" subset of the union of $t$-Hölder balls, $0<t\le r$, where $r$ is a fixed but arbitrary integer. The exceptional ("nongeneric") set of densities for which our results do not hold is shown to be nowhere dense in the relevant Hölder-norm topologies. In the course of the proofs we also obtain limit theorems for maxima of linear wavelet and kernel density estimators, which are of independent interest.

preprint2010arXiv

Efficient Simulation-Based Minimum Distance Estimation and Indirect Inference

Given a random sample from a parametric model, we show how indirect inference estimators based on appropriate nonparametric density estimators (i.e., simulation-based minimum distance estimators) can be constructed that, under mild assumptions, are asymptotically normal with variance-covarince matrix equal to the Cramer-Rao bound.

Richard Nickl

What is connected

Connect this record

See the researcher in context

Building this map preview

25 published item(s)

On gradient stability in nonlinear PDE models and inference in interacting particle systems

On polynomial-time computation of high-dimensional posterior measures by Langevin-type algorithms

Consistency of Bayesian inference with Gaussian process priors in an elliptic inverse problem

Consistent Inversion of Noisy Non-Abelian X-Ray Transforms

On statistical Calderón problems

Nonparametric statistical inference for drift vector fields of multi-dimensional diffusions

Inference on covariance operators via concentration inequalities: k-sample tests, classification, and clustering via Rademacher complexities

Nonparametric Bayesian posterior contraction rates for discretely observed scalar diffusions

A sharp adaptive confidence ball for self-similar functions

Discussion of "Frequentist coverage of adaptive nonparametric Bayesian credible sets"

On signal detection and confidence sets for low rank inference problems

High-frequency Donsker theorems for Lévy measures

On the Bernstein-von Mises phenomenon for nonparametric Bayes procedures

Confidence sets in sparse regression

Nonparametric Bernstein-von Mises theorems in Gaussian white noise

A Donsker Theorem for Lévy Measures

Adaptive confidence sets in L^2

On adaptive inference and confidence bands

Rates of contraction for posterior distributions in $\bolds{L^r}$-metrics, $\bolds{1\le r\le\infty}$

Spatially Adaptive Density Estimation by Localised Haar Projections

Adaptive estimation of a distribution function and its density in sup-norm loss by wavelet and spline projections

Concentration Inequalities and Confidence Bands for Needlet Density Estimators on Compact Homogeneous Manifolds

Global uniform risk bounds for wavelet deconvolution estimators

Confidence bands in density estimation

Efficient Simulation-Based Minimum Distance Estimation and Indirect Inference