Source author record

Stefano Vigogna

Stefano Vigogna appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.FA math.ST Statistics Theory math.GR

Catalog footprint

What is connected

9works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Conditional regression for single-index models

The single-index model is a statistical model for intrinsic regression where responses are assumed to depend on a single yet unknown linear combination of the predictors, allowing to express the regression function as $ \mathbb{E} [ Y | X ] = f ( \langle v , X \rangle ) $ for some unknown \emph{index} vector $v$ and \emph{link} function $f$. Conditional methods provide a simple and effective approach to estimate $v$ by averaging moments of $X$ conditioned on $Y$, but depend on parameters whose optimal choice is unknown and do not provide generalization bounds on $f$. In this paper we propose a new conditional method converging at $\sqrt{n}$ rate under an explicit parameter characterization. Moreover, we prove that polynomial partitioning estimates achieve the $1$-dimensional min-max rate for regression of Hölder functions when combined to any $\sqrt{n}$-convergent index estimator. Overall this yields an estimator for dimension reduction and regression of single-index models that attains statistical optimality in quasilinear time.

preprint2022arXiv

Multiclass learning with margin: exponential rates with no bias-variance trade-off

We study the behavior of error bounds for multiclass classification under suitable margin conditions. For a wide variety of methods we prove that the classification error under a hard-margin condition decreases exponentially fast without any bias-variance trade-off. Different convergence rates can be obtained in correspondence of different margin assumptions. With a self-contained and instructive analysis we are able to generalize known results from the binary to the multiclass setting.

preprint2021arXiv

Construction and Monte Carlo estimation of wavelet frames generated by a reproducing kernel

We introduce a construction of multiscale tight frames on general domains. The frame elements are obtained by spectral filtering of the integral operator associated with a reproducing kernel. Our construction extends classical wavelets as well as generalized wavelets on both continuous and discrete non-Euclidean structures such as Riemannian manifolds and weighted graphs. Moreover, it allows to study the relation between continuous and discrete frames in a random sampling regime, where discrete frames can be seen as Monte Carlo estimates of the continuous ones. Pairing spectral regularization with learning theory, we show that a sample frame tends to its population counterpart, and derive explicit finite-sample rates on spaces of Sobolev and Besov regularity. Our results prove the stability of frames constructed on empirical data, in the sense that all stochastic discretizations have the same underlying limit regardless of the set of initial training samples.

preprint2021arXiv

Multiscale regression on unknown manifolds

We consider the regression problem of estimating functions on $\mathbb{R}^D$ but supported on a $d$-dimensional manifold $ \mathcal{M} \subset \mathbb{R}^D $ with $ d \ll D $. Drawing ideas from multi-resolution analysis and nonlinear approximation, we construct low-dimensional coordinates on $\mathcal{M}$ at multiple scales, and perform multiscale regression by local polynomial fitting. We propose a data-driven wavelet thresholding scheme that automatically adapts to the unknown regularity of the function, allowing for efficient estimation of functions exhibiting nonuniform regularity at different locations and scales. We analyze the generalization error of our method by proving finite sample bounds in high probability on rich classes of priors. Our estimator attains optimal learning rates (up to logarithmic factors) as if the function was defined on a known Euclidean domain of dimension $d$, instead of an unknown manifold embedded in $\mathbb{R}^D$. The implemented algorithm has quasilinear complexity in the sample size, with constants linear in $D$ and exponential in $d$. Our work therefore establishes a new framework for regression on low-dimensional sets embedded in high dimensions, with fast implementation and strong theoretical guarantees.

preprint2020arXiv

Estimating multi-index models with response-conditional least squares

The multi-index model is a simple yet powerful high-dimensional regression model which circumvents the curse of dimensionality assuming $ \mathbb{E} [ Y | X ] = g(A^\top X) $ for some unknown index space $A$ and link function $g$. In this paper we introduce a method for the estimation of the index space, and study the propagation error of an index space estimate in the regression of the link function. The proposed method approximates the index space by the span of linear regression slope coefficients computed over level sets of the data. Being based on ordinary least squares, our approach is easy to implement and computationally efficient. We prove a tight concentration bound that shows $N^{-1/2}$-convergence, but also faithfully describes the dependence on the chosen partition of level sets, hence giving indications on the hyperparameter tuning. The estimator's competitiveness is confirmed by extensive comparisons with state-of-the-art methods, both on synthetic and real data sets. As a second contribution, we establish minimax optimal generalization bounds for k-nearest neighbors and piecewise polynomial regression when trained on samples projected onto any $N^{-1/2}$-consistent estimate of the index space, thus providing complete and provable estimation of the multi-index model.

preprint2019arXiv

Monte Carlo wavelets: a randomized approach to frame discretization

In this paper we propose and study a family of continuous wavelets on general domains, and a corresponding stochastic discretization that we call Monte Carlo wavelets. First, using tools from the theory of reproducing kernel Hilbert spaces and associated integral operators, we define a family of continuous wavelets by spectral calculus. Then, we propose a stochastic discretization based on Monte Carlo estimates of integral operators. Using concentration of measure results, we establish the convergence of such a discretization and derive convergence rates under natural regularity assumptions.

preprint2014arXiv

Coorbit spaces with voice in a Fréchet space

We set up a new general coorbit space theory for reproducing representations of a locally compact second countable group $G$ that are not necessarily irreducible nor integrable. Our basic assumption is that the kernel associated with the voice transform belongs to a Fréchet space $\mathcal T$ of functions on $G$, which generalizes the classical choice $\mathcal T=L_w^1(G)$. Our basic example is $ \mathcal T=\bigcap_{p\in(1,+\infty)} L^p(G)$, or a weighted versions of it. By means of this choice it is possible to treat, for instance, Paley-Wiener spaces and coorbit spaces related to Shannon wavelets and Schrödingerlets.

preprint2014arXiv

Geometric classification of semidirect products in the maximal parabolic subgroup of $\operatorname{Sp}(2,\mathbb{R})$

We classify up to conjugation by $\operatorname{GL}(2,\mathbb{R})$ (more precisely, block diagonal symplectic matrices) all the semidirect products inside the maximal parabolic of $\operatorname{Sp}(2,\mathbb{R})$ by means of an essentially geometric argument. This classification has already been established without geometry, under a stricter notion of equivalence, namely conjugation by arbitrary symplectic matrices. The present approach might be useful in higher dimensions and provides some insight.

preprint2014arXiv

Intrinsic Localization of Anisotropic Frames II: $α$-Molecules

This article is a continuation of the recent paper [Grohs, Intrinsic localization of anisotropic frames, ACHA, 2013], where off-diagonal-decay properties (often referred to as 'localization' in the literature) of Moore-Penrose pseudoinverses of (bi-infinite) matrices are established, whenever the latter possess similar off-diagonal-decay properties. This problem is especially interesting if the matrix arises as a discretization of an operator with respect to a frame or basis. Previous work on this problem has been restricted to wavelet- or Gabor frames. In the previous work we extended these results to frames of parabolic molecules, including curvelets or shearlets as special cases. The present paper extends and unifies these results by establishing analogous properties for frames of $α$-molecules as introduced in recent work [Grohs, Keiper, Kutyniok, Schäfer, Alpha molecules: curvelets, shearlets, ridgelets, and beyond, Proc. SPIE. 8858, 2013]. Since wavelets, curvelets, shearlets, ridgelets and hybrid shearlets all constitute instances of $α$-molecules, our results establish localization properties for all these systems simultaneously.

Stefano Vigogna

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Conditional regression for single-index models

Multiclass learning with margin: exponential rates with no bias-variance trade-off

Construction and Monte Carlo estimation of wavelet frames generated by a reproducing kernel

Multiscale regression on unknown manifolds

Estimating multi-index models with response-conditional least squares

Monte Carlo wavelets: a randomized approach to frame discretization

Coorbit spaces with voice in a Fréchet space

Geometric classification of semidirect products in the maximal parabolic subgroup of $\operatorname{Sp}(2,\mathbb{R})$

Intrinsic Localization of Anisotropic Frames II: $α$-Molecules