Source author record

Daniel A. Roberts

Daniel A. Roberts appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-th quant-ph Artificial Intelligence astro-ph.CO cond-mat.quant-gas cond-mat.stat-mech cond-mat.str-el gr-qc hep-ph Machine Learning

Catalog footprint

What is connected

10works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

The Principles of Deep Learning Theory

This book develops an effective theory approach to understanding deep neural networks of practical relevance. Beginning from a first-principles component-level picture of networks, we explain how to determine an accurate description of the output of trained networks by solving layer-to-layer iteration equations and nonlinear learning dynamics. A main result is that the predictions of networks are described by nearly-Gaussian distributions, with the depth-to-width aspect ratio of the network controlling the deviations from the infinite-width Gaussian description. We explain how these effectively-deep networks learn nontrivial representations from training and more broadly analyze the mechanism of representation learning for nonlinear models. From a nearly-kernel-methods perspective, we find that the dependence of such models' predictions on the underlying learning algorithm can be expressed in a simple and universal way. To obtain these results, we develop the notion of representation group flow (RG flow) to characterize the propagation of signals through the network. By tuning networks to criticality, we give a practical solution to the exploding and vanishing gradient problem. We further explain how RG flow leads to near-universal behavior and lets us categorize networks built from different activation functions into universality classes. Altogether, we show that the depth-to-width ratio governs the effective model complexity of the ensemble of trained networks. By using information-theoretic techniques, we estimate the optimal aspect ratio at which we expect the network to be practically most useful and show how residual connections can be used to push this scale to arbitrary depths. With these tools, we can learn in detail about the inductive bias of architectures, hyperparameters, and optimizers.

preprint2016arXiv

Chaos in quantum channels

We study chaos and scrambling in unitary channels by considering their entanglement properties as states. Using out-of-time-order correlation functions to diagnose chaos, we characterize the ability of a channel to process quantum information. We show that the generic decay of such correlators implies that any input subsystem must have near vanishing mutual information with almost all partitions of the output. Additionally, we propose the negativity of the tripartite information of the channel as a general diagnostic of scrambling. This measures the delocalization of information and is closely related to the decay of out-of-time-order correlators. We back up our results with numerics in two non-integrable models and analytic results in a perfect tensor network model of chaotic time evolution. These results show that the butterfly effect in quantum systems implies the information-theoretic definition of scrambling.

preprint2016arXiv

Complexity Equals Action

We conjecture that the quantum complexity of a holographic state is dual to the action of a certain spacetime region that we call a Wheeler-DeWitt patch. We illustrate and test the conjecture in the context of neutral, charged, and rotating black holes in AdS, as well as black holes perturbed with static shells and with shock waves. This conjecture evolved from a previous conjecture that complexity is dual to spatial volume, but appears to be a major improvement over the original. In light of our results, we discuss the hypothesis that black holes are the fastest computers in nature.

preprint2016arXiv

Complexity, action, and black holes

Our earlier paper "Complexity Equals Action" conjectured that the quantum computational complexity of a holographic state is given by the classical action of a region in the bulk (the "Wheeler-DeWitt" patch). We provide calculations for the results quoted in that paper, explain how it fits into a broader (tensor) network of ideas, and elaborate on the hypothesis that black holes are the fastest computers in nature.

preprint2016arXiv

Lieb-Robinson and the butterfly effect

As experiments are increasingly able to probe the quantum dynamics of systems with many degrees of freedom, it is interesting to probe fundamental bounds on the dynamics of quantum information. We elaborate on the relationship between one such bound---the Lieb-Robinson bound---and the butterfly effect in strongly-coupled quantum systems. The butterfly effect implies the ballistic growth of local operators in time, which can be quantified with the "butterfly" velocity $v_B$. Similarly, the Lieb-Robinson velocity places a state independent ballistic upper bound on the size of time evolved operators in non-relativistic lattice models. Here, we argue that $v_B$ is a state-dependent effective Lieb-Robinson velocity. We study the butterfly velocity in a wide variety of quantum field theories using holography and compare with free particle computations to understand the role of strong coupling. We find that, depending on the way length and time scale, $v_B$ acquires a temperature dependence and decreases towards the IR. We also comment on experimental prospects and on the relationship between the butterfly velocity and signaling.

preprint2016arXiv

Localized shocks

We study products of precursors of spatially local operators, $W_{x_{n}}(t_{n}) ... W_{x_1}(t_1)$, where $W_x(t) = e^{-iHt} W_x e^{iHt}$. Using chaotic spin-chain numerics and gauge/gravity duality, we show that a single precursor fills a spatial region that grows linearly in $t$. In a lattice system, products of such operators can be represented using tensor networks. In gauge/gravity duality, they are related to Einstein-Rosen bridges supported by localized shock waves. We find a geometrical correspondence between these two descriptions, generalizing earlier work in the spatially homogeneous case.

preprint2016arXiv

Two-dimensional conformal field theory and the butterfly effect

We study chaotic dynamics in two-dimensional conformal field theory through out-of-time order thermal correlators of the form $\langle W(t)VW(t)V\rangle$. We reproduce bulk calculations similar to those of [1], by studying the large $c$ Virasoro identity block. The contribution of this block to the above correlation function begins to decrease exponentially after a delay of $\sim t_* - \fracβ{2π}\log β^2E_w E_v$, where $t_*$ is the scrambling time $\fracβ{2π}\log c$, and $E_w,E_v$ are the energy scales of the $W,V$ operators.

preprint2015arXiv

Hawking-Page transition in holographic massive gravity

We study the Hawking-Page transition in a holographic model of field theories with momentum dissipation. We find that the deconfinement temperature strictly decreases as momentum dissipation is increased. For sufficiently strong momentum dissipation, the critical temperature goes to zero, indicating a zero-temperature deconfinement transition in the dual field theory.

preprint2015arXiv

The goldstone and goldstino of supersymmetric inflation

We construct the minimal effective field theory (EFT) of supersymmetric inflation, whose field content is a real scalar, the goldstone for time-translation breaking, and a Weyl fermion, the goldstino for supersymmetry (SUSY) breaking. The inflating background can be viewed as a single SUSY-breaking sector, and the degrees of freedom can be efficiently parameterized using constrained superfields. Our EFT is comprised of a chiral superfield X_NL containing the goldstino and satisfying X_NL^2 = 0, and a real superfield B_NL containing both the goldstino and the goldstone, satisfying X_NL B_NL = B_NL^3 = 0. We match results from our EFT formalism to existing results for SUSY broken by a fluid background, showing that the goldstino propagates with subluminal velocities. The same effect can also be derived from the unitary gauge gravitino action after embedding our EFT in supergravity. If the gravitino mass is comparable to the Hubble scale during inflation, we identify a new parameter in the EFT related to a time-dependent phase of the gravitino mass parameter. We briefly comment on the leading contributions of goldstino loops to inflationary observables.

preprint2013arXiv

On memory in exponentially expanding spaces

We examine the degree to which fluctuating dynamics on exponentially expanding spaces remember initial conditions. In de Sitter space, the global late-time configuration of a free scalar field always contains information about early fluctuations. By contrast, fluctuations near the boundary of Euclidean Anti-de Sitter may or may not remember conditions in the center, with a transition at Δ=d/2. We connect these results to literature about statistical mechanics on trees and make contact with the observation by Anninos and Denef that the configuration space of a massless dS field exhibits ultrametricity. We extend their analysis to massive fields, finding that preference for isosceles triangles persists as long as Δ_- < d/4.

Daniel A. Roberts

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

The Principles of Deep Learning Theory

Chaos in quantum channels

Complexity Equals Action

Complexity, action, and black holes

Lieb-Robinson and the butterfly effect

Localized shocks

Two-dimensional conformal field theory and the butterfly effect

Hawking-Page transition in holographic massive gravity

The goldstone and goldstino of supersymmetric inflation

On memory in exponentially expanding spaces