Researcher profile

David Belius

David Belius contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2022arXiv

Complexity of local maxima of given radial derivative for mixed $p$-spin Hamiltonians

We study the number of local maxima with given radial derivative of spherical mixed $p$-spin models and prove that the second moment matches the square of the first moment on exponential scale for arbitrary mixtures and any radial derivative. This is surprising, since for the number of local maxima with given radial derivative and given energy the corresponding result is only true for specific mixtures [Sub17; BSZ20]. We use standard Kac-Rice computations to derive formulas for the first and second moment at exponential scale, and then find a remarkable analytic argument that shows that the second moment formula is bounded by twice the first moment formula in this general setting. This also leads to a new proof of a central inequality used to prove concentration of the number critical points of pure $p$-spin models of given energy in [Sub17] and removes the need for the computer assisted argument used in that paper for $3 \leq p \leq 10$.

preprint2022arXiv

High temperature TAP upper bound for the free energy of mean field spin glasses

This work proves an upper bound for the free energy of the Sherrington-Kirkpatrick model and its generalizations in terms of the Thouless-Anderson-Palmer (TAP) energy. The result applies to models with spherical or Ising spins and any mixed $p$-spin Hamiltonian with external field or with a non-linear spike term. The bound is expected to be tight to leading order at high temperature, and is non-trivial in the presence of an external field. For the proof a geometric microcanonical method is employed, in which one covers the spin space with sets, each of which is centered at a magnetization vector $m$ and whose contribution to the partition function is bounded in terms of the TAP energy at $m$.

preprint2022arXiv

Learning Multiscale Convolutional Dictionaries for Image Reconstruction

Convolutional neural networks (CNNs) have been tremendously successful in solving imaging inverse problems. To understand their success, an effective strategy is to construct simpler and mathematically more tractable convolutional sparse coding (CSC) models that share essential ingredients with CNNs. Existing CSC methods, however, underperform leading CNNs in challenging inverse problems. We hypothesize that the performance gap may be attributed in part to how they process images at different spatial scales: While many CNNs use multiscale feature representations, existing CSC models mostly rely on single-scale dictionaries. To close the performance gap, we thus propose a multiscale convolutional dictionary structure. The proposed dictionary structure is derived from the U-Net, arguably the most versatile and widely used CNN for image-to-image learning problems. We show that incorporating the proposed multiscale dictionary in an otherwise standard CSC framework yields performance competitive with state-of-the-art CNNs across a range of challenging inverse problems including CT and MRI reconstruction. Our work thus demonstrates the effectiveness and scalability of the multiscale CSC approach in solving challenging inverse problems.

preprint2022arXiv

Phase diagram for the tap energy of the $p$-spin spherical mean field spin glass model

We solve the Thouless-Anderson-Palmer (TAP) variational principle associated to the spherical pure $p$-spin mean field spin glass Hamiltonian and present a detailed phase diagram. In the high temperature phase the maximum of variational principle is the annealed free energy of the model. In the low temperature phase the maximum, for which we give a formula, is strictly smaller. The high temperature phase consists of three subphases. (1) In the first phase $m=0$ is the unique relevant TAP maximizer. (2) In the second phase there are exponentially many TAP maximizers, but $m=0$ remains dominant. (3) In the third phase, after the so called dynamic phase transition, $m=0$ is no longer a relevant TAP maximizer, and exponentially many non-zero relevant TAP solutions add up to give the annealed free energy. Finally in the low temperature phase a subexponential number of TAP maximizers of near-maximal TAP energy dominate.

preprint2021arXiv

Triviality of the geometry of mixed $p$-spin spherical Hamiltonians with external field

We study isotropic Gaussian random fields on the high-dimensional sphere with an added deterministic linear term, also known as mixed p-spin Hamiltonians with external field. We prove that if the external field is sufficiently strong, then the resulting function has trivial geometry, that is only two critical points. This contrasts with the situation of no or weak external field where these functions typically have an exponential number of critical points. We give an explicit threshold $h_c$ for the magnitude of the external fieldnecessary for trivialization and conjecture $h_c$ to be sharp. The Kac-Rice formula is our main tool. Our work extends [Fyo15], which identified the trivial regime for the special case of pure p-spin Hamiltonians with random external field.

preprint2020arXiv

On the Empirical Neural Tangent Kernel of Standard Finite-Width Convolutional Neural Network Architectures

The Neural Tangent Kernel (NTK) is an important milestone in the ongoing effort to build a theory for deep learning. Its prediction that sufficiently wide neural networks behave as kernel methods, or equivalently as random feature models, has been confirmed empirically for certain wide architectures. It remains an open question how well NTK theory models standard neural network architectures of widths common in practice, trained on complex datasets such as ImageNet. We study this question empirically for two well-known convolutional neural network architectures, namely AlexNet and LeNet, and find that their behavior deviates significantly from their finite-width NTK counterparts. For wider versions of these networks, where the number of channels and widths of fully-connected layers are increased, the deviation decreases.