Source author record

Sho Yaida

Sho Yaida appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-th cond-mat.dis-nn cond-mat.stat-mech cond-mat.str-el cond-mat.soft gr-qc Machine Learning Artificial Intelligence astro-ph hep-ph

Catalog footprint

What is connected

17works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

The Principles of Deep Learning Theory

This book develops an effective theory approach to understanding deep neural networks of practical relevance. Beginning from a first-principles component-level picture of networks, we explain how to determine an accurate description of the output of trained networks by solving layer-to-layer iteration equations and nonlinear learning dynamics. A main result is that the predictions of networks are described by nearly-Gaussian distributions, with the depth-to-width aspect ratio of the network controlling the deviations from the infinite-width Gaussian description. We explain how these effectively-deep networks learn nontrivial representations from training and more broadly analyze the mechanism of representation learning for nonlinear models. From a nearly-kernel-methods perspective, we find that the dependence of such models' predictions on the underlying learning algorithm can be expressed in a simple and universal way. To obtain these results, we develop the notion of representation group flow (RG flow) to characterize the propagation of signals through the network. By tuning networks to criticality, we give a practical solution to the exploding and vanishing gradient problem. We further explain how RG flow leads to near-universal behavior and lets us categorize networks built from different activation functions into universality classes. Altogether, we show that the depth-to-width ratio governs the effective model complexity of the ensemble of trained networks. By using information-theoretic techniques, we estimate the optimal aspect ratio at which we expect the network to be practically most useful and show how residual connections can be used to push this scale to arbitrary depths. With these tools, we can learn in detail about the inductive bias of architectures, hyperparameters, and optimizers.

preprint2020arXiv

Non-Gaussian processes and neural networks at finite widths

Gaussian processes are ubiquitous in nature and engineering. A case in point is a class of neural networks in the infinite-width limit, whose priors correspond to Gaussian processes. Here we perturbatively extend this correspondence to finite-width neural networks, yielding non-Gaussian processes as priors. The methodology developed herein allows us to track the flow of preactivation distributions by progressively integrating out random variables from lower to higher layers, reminiscent of renormalization-group flow. We further develop a perturbative procedure to perform Bayesian inference with weakly non-Gaussian priors.

preprint2016arXiv

Efficient measurement of point-to-set correlations and overlap fluctuations in glass-forming liquids

Cavity point-to-set correlations are real-space tools to detect the roughening of the free-energy landscape that accompanies the dynamical slowdown of glass-forming liquids. Measuring these correlations in model glass formers remains, however, a major computational challenge. Here, we develop a general parallel-tempering method that provides orders-of-magnitude improvement for sampling and equilibrating configurations within cavities. We apply this improved scheme to the canonical Kob-Andersen binary Lennard-Jones model for temperatures down to the mode-coupling theory crossover. Most significant improvements are noted for small cavities, which have thus far been the most difficult to study. This methodological advance also enables us to study a broader range of physical observables associated with thermodynamic fluctuations. We measure the probability distribution of overlap fluctuations in cavities, which displays a non-trivial temperature evolution. The corresponding overlap susceptibility is found to provide a robust quantitative estimate of the point-to-set length scale requiring no fitting. By resolving spatial fluctuations of the overlap in the cavity, we also obtain quantitative information about the geometry of overlap fluctuations. We can thus examine in detail how the penetration length as well as its fluctuations evolve with temperature and cavity size.

preprint2016arXiv

Linking dynamical heterogeneity to static amorphous order

Glass-forming liquids grow dramatically sluggish upon cooling. This slowdown has long been thought to be accompanied by a growing correlation length. Characteristic dynamical and static length scales, however, have been observed to grow at different rates, which perplexes the relationship between the two and with the slowdown. Here, we show the existence of a direct link between dynamical sluggishness and static point-to-set correlations, holding at the local level as we probe different environments within a liquid. This link, which is stronger and more general than that observed with locally preferred structures, suggests the existence of an intimate relationship between structure and dynamics in a broader range of glass-forming liquids than previously thought.

preprint2016arXiv

Point-to-set lengths, local structure, and glassiness

The growing sluggishness of glass-forming liquids is thought to be accompanied by growing structural order. The nature of such order, however, remains hotly debated. A decade ago, point-to-set (PTS) correlation lengths were proposed as measures of amorphous order in glass formers, but recent results raise doubts as to their generality. Here, we extend the definition of PTS correlations to agnostically capture any type of growing order in liquids, be it local or amorphous. This advance enables the formulation of a clear distinction between slowing down due to conventional critical ordering and that due to glassiness, and provides a unified framework to assess the relative importance of specific local order and generic amorphous order in glass formation.

preprint2015arXiv

Glassy slowdown and replica-symmetry-breaking instantons

Glass-forming liquids exhibit a dramatic dynamical slowdown as the temperature is lowered. This can be attributed to relaxation proceeding via large structural rearrangements whose characteristic size increases as the system cools. These cooperative rearrangements are well modeled by instantons in a replica effective field theory, with the size of the dominant instanton encoding the liquid's cavity point-to-set correlation length. Varying the parameters of the effective theory corresponds to varying the statistics of the underlying free-energy landscape. We demonstrate that, for a wide range of parameters, replica-symmetry-breaking instantons dominate. The detailed structure of the dominant instanton provides a rich window into point-to-set correlations and glassy dynamics.

preprint2014arXiv

Critical Exponents for Supercooled Liquids

We compute critical exponents governing universal features of supercooled liquids through the effective theory of an overlap field. The correlation length diverges with the Ising exponent; the size of dynamically heterogeneous patches grows more rapidly; and the relaxation time obeys a generalized Vogel-Fulcher-Tammann relation.

preprint2014arXiv

Effective Field Theory for Supercooled Liquids

Starting from a microscopic model of liquids, we construct an effective theory of an overlap field through duplication of the system and coarse-graining. We then propose a recipe to extract a relaxation time and two characteristic length scales of a supercooled liquid from this effective field theory. Appealing to the Ginzburg-Landau-Wilson paradigm near the putative critical point, we further conclude that this effective field theory resides within the Ising universality class.

preprint2013arXiv

Point-to-set correlations and instantons

For a generic many-body system, we define a soft point-to-set correlation function. We then show that this function accepts a representation in terms of an effective overlap field theory. In particular, instantons in this effective field theory encode point-to-set correlations for supercooled liquids.

preprint2012arXiv

Disordered Holographic Systems II: Marginal Relevance of Imperfection

We continue our study of quenched disorder in holographic systems, focusing on the effects of mild electric disorder. By studying the renormalization group evolution of the disorder distribution at subleading order in perturbations away from the clean fixed point, we show that electric disorder is marginally relevant in (2+1)-dimensional holographic conformal field theories.

preprint2012arXiv

Instanton Calculus of Lifshitz Tails

For noninteracting particles moving in a Gaussian random potential, there exists a disagreement in the literature on the asymptotic expression for the density of states in the tail of the band. We resolve this discrepancy. Further we illuminate the physical facet of instantons appearing in replica and supersymmetric derivations with another derivation employing a Lagrange multiplier field.

preprint2012arXiv

Lifshitz Tails of Scale-Invariant Theories with Electric Impurities

We study scale-invariant systems in the presence of Gaussian quenched electric disorder, focusing on the tails of the energy spectra induced by disorder. For relevant disorder we derive asymptotic expressions for the densities of unit-charged states in the tails, positing the existence of saddle points in appropriate disorder integrals. The resultant scalings are dictated by spatial dimensions and dynamical exponents of the systems.

preprint2011arXiv

Adventures in Holographic Dimer Models

We abstract the essential features of holographic dimer models, and develop several new applications of these models. First, semi-holographically coupling free band fermions to holographic dimers, we uncover novel phase transitions between conventional Fermi liquids and non-Fermi liquids, accompanied by a change in the structure of the Fermi surface. Second, we make dimer vibrations propagate through the whole crystal by way of double trace deformations, obtaining nontrivial band structure. In a simple toy model, the topology of the band structure experiences an interesting reorganization as we vary the strength of the double trace deformations. Finally, we develop tools that would allow one to build, in a bottom-up fashion, a holographic avatar of the Hubbard model.

preprint2011arXiv

Disordered Holographic Systems I: Functional Renormalization

We study quenched disorder in strongly correlated systems via holography, focusing on the thermodynamic effects of mild electric disorder. Disorder is introduced through a random potential which is assumed to self-average on macroscopic scales. Studying the flow of this distribution with energy scale leads us to develop a holographic functional renormalization scheme. We test this scheme by computing thermodynamic quantities and confirming that the Harris criterion for relevance, irrelevance or marginality of quenched disorder holds.

preprint2010arXiv

Holographic Lattices, Dimers, and Glasses

We holographically engineer a periodic lattice of localized fermionic impurities within a plasma medium by putting an array of probe D5-branes in the background produced by N D3-branes. Thermodynamic quantities are computed in the large N limit via the holographic dictionary. We then dope the lattice by replacing some of the D5-branes by anti-D5-branes. In the large N limit, we determine the critical temperature below which the system dimerizes with bond ordering. Finally, we argue that for the special case of a square lattice our system is glassy at large but finite N, with the low temperature physics dominated by a huge collection of metastable dimerized configurations without long-range order, connected only through tunneling events.

preprint2008arXiv

Viscosity Bound Violation in Higher Derivative Gravity

Motivated by the vast string landscape, we consider the shear viscosity to entropy density ratio in conformal field theories dual to Einstein gravity with curvature square corrections. After field redefinitions these theories reduce to Gauss-Bonnet gravity, which has special properties that allow us to compute the shear viscosity nonperturbatively in the Gauss-Bonnet coupling. By tuning of the coupling, the value of the shear viscosity to entropy density ratio can be adjusted to any positive value from infinity down to zero, thus violating the conjectured viscosity bound. At linear order in the coupling, we also check consistency of four different methods to calculate the shear viscosity, and we find that all of them agree. We search for possible pathologies associated with this class of theories violating the viscosity bound.

preprint2005arXiv

Energy Conditions and Junction Conditions

We consider the familiar junction conditions described by Israel for thin timelike walls in Einstein-Hilbert gravity. One such condition requires the induced metric to be continuous across the wall. Now, there are many spacetimes with sources confined to a thin wall for which this condition is violated and the Israel formalism does not apply. However, we explore the conjecture that the induced metric is in fact continuous for any thin wall which models spacetimes containing only positive energy matter. Thus, the usual junction conditions would hold for all positive energy spacetimes. This conjecture is proven in various special cases, including the case of static spacetimes with spherical or planar symmetry as well as settings without symmetry which may be sufficiently well approximated by smooth spacetimes with well-behaved null geodesic congruences.

Sho Yaida

What is connected

Connect this record

See the researcher in context

Building this map preview

17 published item(s)

The Principles of Deep Learning Theory

Non-Gaussian processes and neural networks at finite widths

Efficient measurement of point-to-set correlations and overlap fluctuations in glass-forming liquids

Linking dynamical heterogeneity to static amorphous order

Point-to-set lengths, local structure, and glassiness

Glassy slowdown and replica-symmetry-breaking instantons

Critical Exponents for Supercooled Liquids

Effective Field Theory for Supercooled Liquids

Point-to-set correlations and instantons

Disordered Holographic Systems II: Marginal Relevance of Imperfection

Instanton Calculus of Lifshitz Tails

Lifshitz Tails of Scale-Invariant Theories with Electric Impurities

Adventures in Holographic Dimer Models

Disordered Holographic Systems I: Functional Renormalization

Holographic Lattices, Dimers, and Glasses

Viscosity Bound Violation in Higher Derivative Gravity

Energy Conditions and Junction Conditions