Source author record

David Belius

David Belius appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR Computer Vision Machine Learning math-ph math.MP cond-mat.dis-nn cond-mat.stat-mech eess.IV

Catalog footprint

What is connected

12works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Complexity of local maxima of given radial derivative for mixed $p$-spin Hamiltonians

We study the number of local maxima with given radial derivative of spherical mixed $p$-spin models and prove that the second moment matches the square of the first moment on exponential scale for arbitrary mixtures and any radial derivative. This is surprising, since for the number of local maxima with given radial derivative and given energy the corresponding result is only true for specific mixtures [Sub17; BSZ20]. We use standard Kac-Rice computations to derive formulas for the first and second moment at exponential scale, and then find a remarkable analytic argument that shows that the second moment formula is bounded by twice the first moment formula in this general setting. This also leads to a new proof of a central inequality used to prove concentration of the number critical points of pure $p$-spin models of given energy in [Sub17] and removes the need for the computer assisted argument used in that paper for $3 \leq p \leq 10$.

preprint2022arXiv

High temperature TAP upper bound for the free energy of mean field spin glasses

This work proves an upper bound for the free energy of the Sherrington-Kirkpatrick model and its generalizations in terms of the Thouless-Anderson-Palmer (TAP) energy. The result applies to models with spherical or Ising spins and any mixed $p$-spin Hamiltonian with external field or with a non-linear spike term. The bound is expected to be tight to leading order at high temperature, and is non-trivial in the presence of an external field. For the proof a geometric microcanonical method is employed, in which one covers the spin space with sets, each of which is centered at a magnetization vector $m$ and whose contribution to the partition function is bounded in terms of the TAP energy at $m$.

preprint2022arXiv

Learning Multiscale Convolutional Dictionaries for Image Reconstruction

Convolutional neural networks (CNNs) have been tremendously successful in solving imaging inverse problems. To understand their success, an effective strategy is to construct simpler and mathematically more tractable convolutional sparse coding (CSC) models that share essential ingredients with CNNs. Existing CSC methods, however, underperform leading CNNs in challenging inverse problems. We hypothesize that the performance gap may be attributed in part to how they process images at different spatial scales: While many CNNs use multiscale feature representations, existing CSC models mostly rely on single-scale dictionaries. To close the performance gap, we thus propose a multiscale convolutional dictionary structure. The proposed dictionary structure is derived from the U-Net, arguably the most versatile and widely used CNN for image-to-image learning problems. We show that incorporating the proposed multiscale dictionary in an otherwise standard CSC framework yields performance competitive with state-of-the-art CNNs across a range of challenging inverse problems including CT and MRI reconstruction. Our work thus demonstrates the effectiveness and scalability of the multiscale CSC approach in solving challenging inverse problems.

preprint2022arXiv

Phase diagram for the tap energy of the $p$-spin spherical mean field spin glass model

We solve the Thouless-Anderson-Palmer (TAP) variational principle associated to the spherical pure $p$-spin mean field spin glass Hamiltonian and present a detailed phase diagram. In the high temperature phase the maximum of variational principle is the annealed free energy of the model. In the low temperature phase the maximum, for which we give a formula, is strictly smaller. The high temperature phase consists of three subphases. (1) In the first phase $m=0$ is the unique relevant TAP maximizer. (2) In the second phase there are exponentially many TAP maximizers, but $m=0$ remains dominant. (3) In the third phase, after the so called dynamic phase transition, $m=0$ is no longer a relevant TAP maximizer, and exponentially many non-zero relevant TAP solutions add up to give the annealed free energy. Finally in the low temperature phase a subexponential number of TAP maximizers of near-maximal TAP energy dominate.

preprint2021arXiv

Triviality of the geometry of mixed $p$-spin spherical Hamiltonians with external field

We study isotropic Gaussian random fields on the high-dimensional sphere with an added deterministic linear term, also known as mixed p-spin Hamiltonians with external field. We prove that if the external field is sufficiently strong, then the resulting function has trivial geometry, that is only two critical points. This contrasts with the situation of no or weak external field where these functions typically have an exponential number of critical points. We give an explicit threshold $h_c$ for the magnitude of the external fieldnecessary for trivialization and conjecture $h_c$ to be sharp. The Kac-Rice formula is our main tool. Our work extends [Fyo15], which identified the trivial regime for the special case of pure p-spin Hamiltonians with random external field.

preprint2020arXiv

On the Empirical Neural Tangent Kernel of Standard Finite-Width Convolutional Neural Network Architectures

The Neural Tangent Kernel (NTK) is an important milestone in the ongoing effort to build a theory for deep learning. Its prediction that sufficiently wide neural networks behave as kernel methods, or equivalently as random feature models, has been confirmed empirically for certain wide architectures. It remains an open question how well NTK theory models standard neural network architectures of widths common in practice, trained on complex datasets such as ImageNet. We study this question empirically for two well-known convolutional neural network architectures, namely AlexNet and LeNet, and find that their behavior deviates significantly from their finite-width NTK counterparts. For wider versions of these networks, where the number of channels and widths of fully-connected layers are increased, the deviation decreases.

preprint2020arXiv

Tightness for the Cover Time of the two dimensional sphere

Let $C^*_{ε,S^2}$ denote the cover time of the two dimensional sphere by a Wiener sausage of radius $ε$. We prove that $$\sqrt{C^{*}_{ε,S^2} } -\sqrt{\frac{2A_{S^2}}π}(\log ε^{-1}-\frac14\log\log ε^{-1})$$ is tight, where $A_{S^2}=4π$ denotes the Riemannian area of $S^2$.

preprint2016arXiv

Maximum of the characteristic polynomial of random unitary matrices

It was recently conjectured by Fyodorov, Hiary and Keating that the maximum of the characteristic polynomial on the unit circle of a $N\times N$ random unitary matrix sampled from the Haar measure grows like $CN/(\log N)^{3/4}$ for some random variable $C$. In this paper, we verify the leading order of this conjecture, that is, we prove that with high probability the maximum lies in the range $[N^{1 - \varepsilon},N^{1 + \varepsilon}]$, for arbitrarily small $\varepsilon$. The method is based on identifying an approximate branching random walk in the Fourier decomposition of the characteristic polynomial, and uses techniques developed to describe the extremes of branching random walks and of other log-correlated random fields. A key technical input is the asymptotic analysis of Toeplitz determinants with dimension-dependent symbols. The original argument for these asymptotics followed the general idea that the statistical mechanics of $1/f$-noise random energy models is governed by a freezing transition. We also prove the conjectured freezing of the free energy for random unitary matrices.

preprint2014arXiv

The subleading order of two dimensional cover times

The epsilon-cover time of the two dimensional torus by Brownian motion is the time it takes for the process to come within distance epsilon>0 from any point. Its leading order in the small epsilon-regime has been established by Dembo, Peres, Rosen and Zeitouni [Ann. of Math., 160 (2004)]. In this work, the second order correction is identified. The approach relies on a multi-scale refinement of the second moment method, and draws on ideas from the study of the extremes of branching Brownian motion.

preprint2012arXiv

Cover levels and random interlacements

This note investigates cover levels of finite sets in the random interlacements model introduced in [Ann. of Math. (2) 171 (2010) 2039-2087], that is, the least level such that the set is completely contained in the random interlacement at that level. It proves that as the cardinality of a set goes to infinity, the rescaled and recentered cover level tends in distribution to the Gumbel distribution with cumulative distribution function $\operatorname {exp}(-\operatorname {exp}(-z))$.

preprint2012arXiv

Gumbel fluctuations for cover times in the discrete torus

This work proves that the fluctuations of the cover time of simple random walk in the discrete torus of dimension at least three with large side-length are governed by the Gumbel extreme value distribution. This result was conjectured for example in the book by Aldous & Fill. We also derive some corollaries which qualitatively describe "how" covering happens. In addition, we develop a new and stronger coupling of the model of random interlacements, introduced by Sznitman, and random walk in the torus. This coupling is used to prove the cover time result and is also of independent interest.

preprint2011arXiv

Cover times in the discrete cylinder

This article proves that, in terms of local times, the rescaled and recentered cover times of finite subsets of the discrete cylinder by simple random walk converge in law to the Gumbel distribution, as the cardinality of the set goes to infinity. As applications we obtain several other results related to covering in the discrete cylinder. Our method is new and involves random interlacements, which were introduced by Sznitman in arXiv:0704.2560. To enable the proof we develop a new stronger coupling of simple random walk in the cylinder and random interlacements, which is also of independent interest.

David Belius

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

Complexity of local maxima of given radial derivative for mixed $p$-spin Hamiltonians

High temperature TAP upper bound for the free energy of mean field spin glasses

Learning Multiscale Convolutional Dictionaries for Image Reconstruction

Phase diagram for the tap energy of the $p$-spin spherical mean field spin glass model

Triviality of the geometry of mixed $p$-spin spherical Hamiltonians with external field

On the Empirical Neural Tangent Kernel of Standard Finite-Width Convolutional Neural Network Architectures

Tightness for the Cover Time of the two dimensional sphere

Maximum of the characteristic polynomial of random unitary matrices

The subleading order of two dimensional cover times

Cover levels and random interlacements

Gumbel fluctuations for cover times in the discrete torus

Cover times in the discrete cylinder