Source author record

Martin Wahl

Martin Wahl appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory math.PR Machine Learning math.NT

Catalog footprint

What is connected

7works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

A note on the prediction error of principal component regression in high dimensions

We analyze the prediction error of principal component regression (PCR) and prove high probability bounds for the corresponding squared risk conditional on the design. Our first main result shows that PCR performs comparably to the oracle method obtained by replacing empirical principal components by their population counterparts, provided that an effective rank condition holds. On the other hand, if the latter condition is violated, then empirical eigenvalues start to have a significant upward bias, resulting in a self-induced regularization of PCR. Our approach relies on the behavior of empirical eigenvalues, empirical eigenvectors and the excess risk of principal component analysis in high-dimensional regimes.

preprint2022arXiv

Relative perturbation bounds with applications to empirical covariance operators

The goal of this paper is to establish relative perturbation bounds, tailored for empirical covariance operators. Our main results are expansions for empirical eigenvalues and spectral projectors, leading to concentration inequalities and limit theorems. One of the key ingredients is a specific separation measure for population eigenvalues, which we call the relative rank, giving rise to a sharp invariance principle in terms of limit theorems, concentration inequalities and inconsistency results. Our framework is very general, requiring only $p > 4$ moments and allows for a huge variety of dependence structures.

preprint2020arXiv

Analyzing the discrepancy principle for kernelized spectral filter learning algorithms

We investigate the construction of early stopping rules in the nonparametric regression problem where iterative learning algorithms are used and the optimal iteration number is unknown. More precisely, we study the discrepancy principle, as well as modifications based on smoothed residuals, for kernelized spectral filter learning algorithms including gradient descent. Our main theoretical bounds are oracle inequalities established for the empirical estimation error (fixed design), and for the prediction error (random design). From these finite-sample bounds it follows that the classical discrepancy principle is statistically adaptive for slow rates occurring in the hard learning scenario, while the smoothed discrepancy principles are adaptive over ranges of faster rates (resp. higher smoothness parameters). Our approach relies on deviation inequalities for the stopping rules in the fixed design setting, combined with change-of-norm arguments to deal with the random design setting.

preprint2020arXiv

High-probability bounds for the reconstruction error of PCA

We derive high-probability bounds for the reconstruction error of PCA in infinite dimensions. We apply our bounds in the case that the eigenvalues of the covariance operator satisfy polynomial or exponential upper bounds.

preprint2015arXiv

A theory of nonparametric regression in the presence of complex nuisance components

In this paper, we consider the nonparametric random regression model $Y=f_1(X_1)+f_2(X_2)+ε$ and address the problem of estimating the function $f_1$. The term $f_2(X_2)$ is regarded as a nuisance term which can be considerably more complex than $f_1(X_1)$. Under minimal assumptions, we prove several nonasymptotic $L^2(\mathbb{P}^X)$-risk bounds for our estimators of $f_1$. Our approach is geometric and based on considerations in Hilbert spaces. It shows that the performance of our estimators is closely related to geometric quantities, such as minimal angles and Hilbert-Schmidt norms. Our results establish new conditions under which the estimators of $f_1$ have up to first order the same sharp upper bound as the corresponding estimators of $f_1$ in the model $Y=f_1(X_1)+ε$. As an example we apply the results to an additive model in which the number of components is very large or in which the nuisance components are considerably less smooth than $f_1$. In particular, the results apply to an asymptotic scenario in which the number of components is allowed to increase with the sample size.

preprint2015arXiv

Variable selection in high-dimensional additive models based on norms of projections

We consider the problem of variable selection in high-dimensional sparse additive models. We focus on the case that the components belong to nonparametric classes of functions. The proposed method is motivated by geometric considerations in Hilbert spaces and consists of comparing the norms of the projections of the data onto various additive subspaces. Under minimal geometric assumptions, we prove concentration inequalities which lead to new conditions under which consistent variable selection is possible. As an application, we establish conditions under which a single component can be estimated with the rate of convergence corresponding to the situation in which the other components are known.

preprint2013arXiv

On the mod-Gaussian convergence of a sum over primes

We prove mod-Gaussian convergence for a Dirichlet polynomial which approximates $\operatorname{Im}\logζ(1/2+it)$. This Dirichlet polynomial is sufficiently long to deduce Selberg's central limit theorem with an explicit error term. Moreover, assuming the Riemann hypothesis, we apply the theory of the Riemann zeta-function to extend this mod-Gaussian convergence to the complex plane. From this we obtain that $\operatorname{Im}\logζ(1/2+it)$ satisfies a large deviation principle on the critical line. Results about the moments of the Riemann zeta-function follow.

Martin Wahl

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

A note on the prediction error of principal component regression in high dimensions

Relative perturbation bounds with applications to empirical covariance operators

Analyzing the discrepancy principle for kernelized spectral filter learning algorithms

High-probability bounds for the reconstruction error of PCA

A theory of nonparametric regression in the presence of complex nuisance components

Variable selection in high-dimensional additive models based on norms of projections

On the mod-Gaussian convergence of a sum over primes