Researcher profile

Richard Nickl

Richard Nickl contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

On gradient stability in nonlinear PDE models and inference in interacting particle systems

We consider general parameter to solution maps $θ\mapsto \mathcal G(θ)$ of non-linear partial differential equations and describe an approach based on a Banach space version of the implicit function theorem to verify the gradient stability condition of Nickl&Wang (JEMS 2024) for the underlying non-linear inverse problem, providing also injectivity estimates and corresponding statistical identifiability results. We illustrate our methods in two examples involving a non-linear reaction diffusion system as well as a McKean--Vlasov interacting particle model, both with periodic boundary conditions. We apply our results to prove the polynomial time convergence of a Langevin-type algorithm sampling the posterior measure of the interaction potential arising from a discrete aggregate measurement of the interacting particle system.

preprint2022arXiv

On polynomial-time computation of high-dimensional posterior measures by Langevin-type algorithms

The problem of generating random samples of high-dimensional posterior distributions is considered. The main results consist of non-asymptotic computational guarantees for Langevin-type MCMC algorithms which scale polynomially in key quantities such as the dimension of the model, the desired precision level, and the number of available statistical measurements. As a direct consequence, it is shown that posterior mean vectors as well as optimisation based maximum a posteriori (MAP) estimates are computable in polynomial time, with high probability under the distribution of the data. These results are complemented by statistical guarantees for recovery of the ground truth parameter generating the data. Our results are derived in a general high-dimensional non-linear regression setting (with Gaussian process priors) where posterior measures are not necessarily log-concave, employing a set of local `geometric' assumptions on the parameter space, and assuming that a good initialiser of the algorithm is available. The theory is applied to a representative non-linear example from PDEs involving a steady-state Schrödinger equation.

preprint2020arXiv

Consistency of Bayesian inference with Gaussian process priors in an elliptic inverse problem

For $\mathcal{O}$ a bounded domain in $\mathbb{R}^d$ and a given smooth function $g:\mathcal{O}\to\mathbb{R}$, we consider the statistical nonlinear inverse problem of recovering the conductivity $f>0$ in the divergence form equation $$ \nabla\cdot(f\nabla u)=g\ \textrm{on}\ \mathcal{O}, \quad u=0\ \textrm{on}\ \partial\mathcal{O}, $$ from $N$ discrete noisy point evaluations of the solution $u=u_f$ on $\mathcal O$. We study the statistical performance of Bayesian nonparametric procedures based on a flexible class of Gaussian (or hierarchical Gaussian) process priors, whose implementation is feasible by MCMC methods. We show that, as the number $N$ of measurements increases, the resulting posterior distributions concentrate around the true parameter generating the data, and derive a convergence rate $N^{-λ}, λ>0,$ for the reconstruction error of the associated posterior means, in $L^2(\mathcal{O})$-distance.

preprint2020arXiv

Consistent Inversion of Noisy Non-Abelian X-Ray Transforms

For $M$ a simple surface, the non-linear statistical inverse problem of recovering a matrix field $Φ: M \to \mathfrak{so}(n)$ from discrete, noisy measurements of the $SO(n)$-valued scattering data $C_Φ$ of a solution of a matrix ODE is considered ($n\geq 2$). Injectivity of the map $Φ\mapsto C_Φ$ was established by [Paternain, Salo, Uhlmann; Geom.Funct.Anal. 2012]. A statistical algorithm for the solution of this inverse problem based on Gaussian process priors is proposed, and it is shown how it can be implemented by infinite-dimensional MCMC methods. It is further shown that as the number $N$ of measurements of point-evaluations of $C_Φ$ increases, the statistical error in the recovery of $Φ$ converges to zero in $L^2(M)$-distance at a rate that is algebraic in $1/N$, and approaches $1/\sqrt N$ for smooth matrix fields $Φ$. The proof relies, among other things, on a new stability estimate for the inverse map $C_Φ\to Φ$. Key applications of our results are discussed in the case $n=3$ to polarimetric neutron tomography, see [Desai et al., Nature Sc.Rep. 2018] and [Hilger et al., Nature Comm. 2018]

preprint2020arXiv

On statistical Calderón problems

For $D$ a bounded domain in $\mathbb R^d, d \ge 2,$ with smooth boundary $\partial D$, the non-linear inverse problem of recovering the unknown conductivity $γ$ determining solutions $u=u_{γ, f}$ of the partial differential equation \begin{equation*} \begin{split} \nabla \cdot(γ\nabla u)&=0 \quad \text{ in }D, \\ u&=f \quad \text { on } \partial D, \end{split} \end{equation*} from noisy observations $Y$ of the Dirichlet-to-Neumann map \[f \mapsto Λ_γ(f) = {γ\frac{\partial u_{γ,f}}{\partial ν}}\Big|_{\partial D},\] with $\partial/\partial ν$ denoting the outward normal derivative, is considered. The data $Y$ consists of $Λ_γ$ corrupted by additive Gaussian noise at noise level $\varepsilon>0$, and a statistical algorithm $\hat γ(Y)$ is constructed which is shown to recover $γ$ in supremum-norm loss at a statistical convergence rate of the order $\log(1/\varepsilon)^{-δ}$ as $\varepsilon \to 0$. It is further shown that this convergence rate is optimal, up to the precise value of the exponent $δ>0$, in an information theoretic sense. The estimator $\hat γ(Y)$ has a Bayesian interpretation in terms of the posterior mean of a suitable Gaussian process prior and can be computed by MCMC methods.

preprint2019arXiv

Nonparametric statistical inference for drift vector fields of multi-dimensional diffusions

The problem of determining a periodic Lipschitz vector field $b=(b_1, \dots, b_d)$ from an observed trajectory of the solution $(X_t: 0 \le t \le T)$ of the multi-dimensional stochastic differential equation \begin{equation*} dX_t = b(X_t)dt + dW_t, \quad t \geq 0, \end{equation*} where $W_t$ is a standard $d$-dimensional Brownian motion, is considered. Convergence rates of a penalised least squares estimator, which equals the maximum a posteriori (MAP) estimate corresponding to a high-dimensional Gaussian product prior, are derived. These results are deduced from corresponding contraction rates for the associated posterior distributions. The rates obtained are optimal up to log-factors in $L^2$-loss in any dimension, and also for supremum norm loss when $d \le 4$. Further, when $d \le 3$, nonparametric Bernstein-von Mises theorems are proved for the posterior distributions of $b$. From this we deduce functional central limit theorems for the implied estimators of the invariant measure $μ_b$. The limiting Gaussian process distributions have a covariance structure that is asymptotically optimal from an information-theoretic point of view.

preprint2016arXiv

Inference on covariance operators via concentration inequalities: k-sample tests, classification, and clustering via Rademacher complexities

We propose a novel approach to the analysis of covariance operators making use of concentration inequalities. First, non-asymptotic confidence sets are constructed for such operators. Then, subsequent applications including a k sample test for equality of covariance, a functional data classifier, and an expectation-maximization style clustering algorithm are derived and tested on both simulated and phoneme data.

preprint2016arXiv

Nonparametric Bayesian posterior contraction rates for discretely observed scalar diffusions

We consider nonparametric Bayesian inference in a reflected diffusion model $dX_t = b (X_t)dt + σ(X_t) dW_t,$ with discretely sampled observations $X_0, X_Δ, \dots, X_{nΔ}$. We analyse the nonlinear inverse problem corresponding to the `low frequency sampling' regime where $Δ>0$ is fixed and $n \to \infty$. A general theorem is proved that gives conditions for prior distributions $Π$ on the diffusion coefficient $σ$ and the drift function $b$ that ensure minimax optimal contraction rates of the posterior distribution over Hölder-Sobolev smoothness classes. These conditions are verified for natural examples of nonparametric random wavelet series priors. For the proofs we derive new concentration inequalities for empirical processes arising from discretely observed diffusions that are of independent interest.

preprint2014arXiv

High-frequency Donsker theorems for Lévy measures

Donsker-type functional limit theorems are proved for empirical processes arising from discretely sampled increments of a univariate Lévy process. In the asymptotic regime the sampling frequencies increase to infinity and the limiting object is a Gaussian process that can be obtained from the composition of a Brownian motion with a covariance operator determined by the Lévy measure. The results are applied to derive the asymptotic distribution of natural estimators for the distribution function of the Lévy jump measure. As an application we deduce Kolmogorov-Smirnov type tests and confidence bands.

preprint2011arXiv

Adaptive estimation of a distribution function and its density in sup-norm loss by wavelet and spline projections

Given an i.i.d. sample from a distribution $F$ on $\mathbb{R}$ with uniformly continuous density $p_0$, purely data-driven estimators are constructed that efficiently estimate $F$ in sup-norm loss and simultaneously estimate $p_0$ at the best possible rate of convergence over Hölder balls, also in sup-norm loss. The estimators are obtained by applying a model selection procedure close to Lepski's method with random thresholds to projections of the empirical measure onto spaces spanned by wavelets or $B$-splines. The random thresholds are based on suprema of Rademacher processes indexed by wavelet or spline projection kernels. This requires Bernstein-type analogs of the inequalities in Koltchinskii [Ann. Statist. 34 (2006) 2593-2656] for the deviation of suprema of empirical processes from their Rademacher symmetrizations.

preprint2010arXiv

Confidence bands in density estimation

Given a sample from some unknown continuous density $f:\mathbb{R}\to\mathbb{R}$, we construct adaptive confidence bands that are honest for all densities in a &#34;generic&#34; subset of the union of $t$-Hölder balls, $0<t\le r$, where $r$ is a fixed but arbitrary integer. The exceptional (&#34;nongeneric&#34;) set of densities for which our results do not hold is shown to be nowhere dense in the relevant Hölder-norm topologies. In the course of the proofs we also obtain limit theorems for maxima of linear wavelet and kernel density estimators, which are of independent interest.