Source author record

James Stokes

James Stokes appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-th Machine Learning quant-ph cond-mat.str-el gr-qc hep-lat astro-ph.CO cond-mat.dis-nn Artificial Intelligence Computer Science and Game Theory cond-mat.stat-mech hep-ph math-ph math.MP math.NA Numerical Analysis physics.chem-ph physics.comp-ph q-fin.MF

Catalog footprint

What is connected

20works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Fermionic Wave Functions from Neural-Network Constrained Hidden States

We introduce a systematically improvable family of variational wave functions for the simulation of strongly correlated fermionic systems. This family consists of Slater determinants in an augmented Hilbert space involving "hidden" additional fermionic degrees of freedom. These determinants are projected onto the physical Hilbert space through a constraint which is optimized, together with the single-particle orbitals, using a neural network parametrization. This construction draws inspiration from the success of hidden particle representations but overcomes the limitations associated with the mean-field treatment of the constraint often used in this context. Our construction provides an extremely expressive family of wave functions, which is proven to be universal. We apply this construction to the ground state properties of the Hubbard model on the square lattice, achieving levels of accuracy which are competitive with state-of-the-art variational methods.

preprint2022arXiv

Gauge equivariant neural networks for quantum lattice gauge theories

Gauge symmetries play a key role in physics appearing in areas such as quantum field theories of the fundamental particles and emergent degrees of freedom in quantum materials. Motivated by the desire to efficiently simulate many-body quantum systems with exact local gauge invariance, gauge equivariant neural-network quantum states are introduced, which exactly satisfy the local Hilbert space constraints necessary for the description of quantum lattice gauge theory with Zd gauge group on different geometries. Focusing on the special case of Z2 gauge group on a periodically identified square lattice, the equivariant architecture is analytically shown to contain the loop-gas solution as a special case. Gauge equivariant neural-network quantum states are used in combination with variational quantum Monte Carlo to obtain compact descriptions of the ground state wavefunction for the Z2 theory away from the exactly solvable limit, and to demonstrate the confining/deconfining phase transition of the Wilson loop order parameter.

preprint2022arXiv

Numerical and geometrical aspects of flow-based variational quantum Monte Carlo

This article aims to summarize recent and ongoing efforts to simulate continuous-variable quantum systems using flow-based variational quantum Monte Carlo techniques, focusing for pedagogical purposes on the example of bosons in the field amplitude (quadrature) basis. Particular emphasis is placed on the variational real- and imaginary-time evolution problems, carefully reviewing the stochastic estimation of the time-dependent variational principles and their relationship with information geometry. Some practical instructions are provided to guide the implementation of a PyTorch code. The review is intended to be accessible to researchers interested in machine learning and quantum information science.

preprint2022arXiv

Quantum-inspired variational algorithms for partial differential equations: Application to financial derivative pricing

Variational quantum Monte Carlo (VMC) combined with neural-network quantum states offers a novel angle of attack on the curse-of-dimensionality encountered in a particular class of partial differential equations (PDEs); namely, the real- and imaginary time-dependent Schrödinger equation. In this paper, we present a simple generalization of VMC applicable to arbitrary time-dependent PDEs, showcasing the technique in the multi-asset Black-Scholes PDE for pricing European options contingent on many correlated underlying assets.

preprint2022arXiv

Scalable neural quantum states architecture for quantum chemistry

Variational optimization of neural-network representations of quantum states has been successfully applied to solve interacting fermionic problems. Despite rapid developments, significant scalability challenges arise when considering molecules of large scale, which correspond to non-locally interacting quantum spin Hamiltonians consisting of sums of thousands or even millions of Pauli operators. In this work, we introduce scalable parallelization strategies to improve neural-network-based variational quantum Monte Carlo calculations for ab-initio quantum chemistry applications. We establish GPU-supported local energy parallelism to compute the optimization objective for Hamiltonians of potentially complex molecules. Using autoregressive sampling techniques, we demonstrate systematic improvement in wall-clock timings required to achieve CCSD baseline target energies. The performance is further enhanced by accommodating the structure of resultant spin Hamiltonians into the autoregressive sampling ordering. The algorithm achieves promising performance in comparison with the classical approximate methods and exhibits both running time and scalability advantages over existing neural-network based methods.

preprint2020arXiv

Holographic 2-Point Functions in the Pseudo-Conformal Universe

We holographically calculate two-point functions in the pseudo-conformal universe, an early universe alternative to inflation. The pseudo-conformal universe can be modeled as a defect conformal field theory, where the reheating surface is a codimension-1 spacelike defect which breaks the conformal algebra to a de Sitter subalgebra. The dual spacetime geometries are domain walls with de-Sitter symmetry in an asymptotically anti-de Sitter spacetime. We compute 2-point functions of scalars and stress tensors by solving the linearized equations for scalar and tensor fluctuations about these backgrounds.

preprint2020arXiv

Quantum Natural Gradient

A quantum generalization of Natural Gradient Descent is presented as part of a general-purpose optimization framework for variational quantum circuits. The optimization dynamics is interpreted as moving in the steepest descent direction with respect to the Quantum Information Geometry, corresponding to the real part of the Quantum Geometric Tensor (QGT), also known as the Fubini-Study metric tensor. An efficient algorithm is presented for computing a block-diagonal approximation to the Fubini-Study metric tensor for parametrized quantum circuits, which may be of independent interest.

preprint2019arXiv

Fisher-Rao Metric, Geometry, and Complexity of Neural Networks

We study the relationship between geometry and capacity measures for deep neural networks from an invariance viewpoint. We introduce a new notion of capacity --- the Fisher-Rao norm --- that possesses desirable invariance properties and is motivated by Information Geometry. We discover an analytical characterization of the new capacity measure, through which we establish norm-comparison inequalities and further show that the new measure serves as an umbrella for several existing norm-based complexity measures. We discuss upper bounds on the generalization error induced by the proposed measure. Extensive numerical experiments on CIFAR-10 support our theoretical findings. Our theoretical analysis rests on a key structural lemma about partial derivatives of multi-layer rectifier networks.

preprint2019arXiv

Interaction Matters: A Note on Non-asymptotic Local Convergence of Generative Adversarial Networks

Motivated by the pursuit of a systematic computational and algorithmic understanding of Generative Adversarial Networks (GANs), we present a simple yet unified non-asymptotic local convergence theory for smooth two-player games, which subsumes several discrete-time gradient-based saddle point dynamics. The analysis reveals the surprising nature of the off-diagonal interaction term as both a blessing and a curse. On the one hand, this interaction term explains the origin of the slow-down effect in the convergence of Simultaneous Gradient Ascent (SGA) to stable Nash equilibria. On the other hand, for the unstable equilibria, exponential convergence can be proved thanks to the interaction term, for four modified dynamics proposed to stabilize GAN training: Optimistic Mirror Descent (OMD), Consensus Optimization (CO), Implicit Updates (IU) and Predictive Method (PM). The analysis uncovers the intimate connections among these stabilizing techniques, and provides detailed characterization on the choice of learning rate. As a by-product, we present a new analysis for OMD proposed in Daskalakis, Ilyas, Syrgkanis, and Zeng [2017] with improved rates.

preprint2019arXiv

Probabilistic Modeling with Matrix Product States

Inspired by the possibility that generative models based on quantum circuits can provide a useful inductive bias for sequence modeling tasks, we propose an efficient training algorithm for a subset of classically simulable quantum circuit models. The gradient-free algorithm, presented as a sequence of exactly solvable effective models, is a modification of the density matrix renormalization group procedure adapted for learning a probability distribution. The conclusion that circuit-based models offer a useful inductive bias for classical datasets is supported by experimental results on the parity learning problem.

preprint2015arXiv

Holographic CFTs on maximally symmetric spaces: correlators, integral transforms and applications

We study one and two point functions of conformal field theories on spaces of maximal symmetry with and without boundaries and investigate their spectral representations. Integral transforms are found, relating the spectral decomposition to renormalized position space correlators. Several applications are presented, including the holographic boundary CFTs as well as spacelike boundary CFTs, which provide realizations of the pseudo-conformal universe.

preprint2015arXiv

Nonlinear Sigma Models with Compact Hyperbolic Target Spaces

We explore the phase structure of nonlinear sigma models with target spaces corresponding to compact quotients of hyperbolic space, focusing on the case of a hyperbolic genus-2 Riemann surface. The continuum theory of these models can be approximated by a lattice spin system which we simulate using Monte Carlo methods. The target space possesses interesting geometric and topological properties which are reflected in novel features of the sigma model. In particular, we observe a topological phase transition at a critical temperature, above which vortices proliferate, reminiscent of the Kosterlitz-Thouless phase transition in the $O(2)$ model. Unlike in the $O(2)$ case, there are many different types of vortices, suggesting a possible analogy to the Hagedorn treatment of statistical mechanics of a proliferating number of hadron species. Below the critical temperature the spins cluster around six special points in the target space known as Weierstrass points. The diversity of compact hyperbolic manifolds suggests that our model is only the simplest example of a broad class of statistical mechanical models whose main features can be understood essentially in geometric terms.

preprint2015arXiv

The curious case of large-N expansions on a (pseudo)sphere

We elucidate the large-N dynamics of one-dimensional sigma models with spherical and hyperbolic target spaces and find a duality between the Lagrange multiplier and the angular momentum. In the hyperbolic model we propose a new class of operators based on the irreducible representations of hyperbolic space. We also uncover unexpected zero modes which lead to the double scaling of the 1/N expansion and explore these modes using Gelfand-Dikiy equations.

preprint2014arXiv

Holography for a Non-Inflationary Early Universe

We construct a gravitational dual of the pseudo-conformal universe, a proposed alternative to inflation in which a conformal field theory in nearly flat space develops a time dependent vacuum expectation value. Constructing this dual amounts to finding five-dimensional domain-wall spacetimes with anti-de Sitter asymptotics, for which the wall has the symmetries of four-dimensional de Sitter space. This holographically realizes the characteristic symmetry breaking pattern O(2,4) to O(1,4) of the pseudo-conformal universe. We present an explicit example with a massless scalar field, using holographic renormalization to obtain general expressions for the renormalized scalar and stress-tensor one-point functions. We discuss the relationship between these solutions and those of four-dimensional holographic defect conformal field theories which break O(2,4) to O(2,3)

preprint2013arXiv

Cosmological perturbations of massive gravity coupled to DBI Galileons

Certain scalar fields with higher derivative interactions and novel classical and quantum mechanical properties - the Galileons - can be naturally covariantized by coupling to nonlinear massive gravity in such a way that their symmetries and number of degrees of freedom are unchanged. We study the propagating degrees of freedom in these models around cosmologically interesting backgrounds. We identify the conditions necessary for such a theory to remain ghost free, and consider when tachyonic instabilities can be avoided. We show that on the self-accelerating branch of solutions, the kinetic terms for the vector and scalar modes of the massive graviton vanish, as in the case of pure massive gravity.

preprint2013arXiv

Cosmologies of extended massive gravity

We study the background cosmology of two extensions of dRGT massive gravity. The first is variable mass massive gravity, where the fixed graviton mass of dRGT is replaced by the expectation value of a scalar field. We ask whether self-inflation can be driven by the self-accelerated branch of this theory, and we find that, while such solutions can exist for a short period, they cannot be sustained for a cosmologically useful time. Furthermore, we demonstrate that there generally exist future curvature singularities of the "big brake" form in cosmological solutions to these theories. The second extension is the covariant coupling of galileons to massive gravity. We find that, as in pure dRGT gravity, flat FRW solutions do not exist. Open FRW solutions do exist -- they consist of a branch of self-accelerating solutions that are identical to those of dRGT, and a new second branch of solutions which do not appear in dRGT.

preprint2013arXiv

Massive gravity coupled to DBI Galileons is ghost free

It is possible to couple Dirac-Born-Infeld (DBI) scalars possessing generalized Galilean internal shift symmetries (Galileons) to nonlinear massive gravity in four dimensions, in such a manner that the interactions maintain the Galilean symmetry. Such a construction is of interest because it is not possible to couple such fields to massless General Relativity in the same way. We show that this theory has the primary constraint necessary to eliminate the Boulware-Deser ghost, thus preserving the attractive properties of both the Galileons and ghost-free massive gravity.

preprint2012arXiv

Heterotic Kink Solitons and their Worldvolume Action

We present a formalism for computing the higher-order corrections to the worldvolume action of a co-dimension one kink soliton embedded in five-dimensional heterotic M-theory. The geometry of heterotic M-theory, as well as the effective theory which describes a five-brane wrapping a holomorphic curve by a topological kink in a scalar field, is reviewed. Using this formalism, the explicit worldvolume action is computed to second order in two expansion parameters--one describing the "warp" of the heterotic geometry and the second the fluctuation length of the soliton hypersurface. The result is expressed in terms of the trace of the extrinsic curvature and the intrinsic curvature scalar.

preprint2012arXiv

The Worldvolume Action of Kink Solitons in AdS Spacetime

A formalism is presented for computing the higher-order corrections to the worldvolume action of co-dimension one solitons. By modifying its potential, an explicit "kink" solution of a real scalar field in AdS spacetime is found. The formalism is then applied to explicitly compute the kink worldvolume action to quadratic order in two expansion parameters--associated with the hypersurface fluctuation length and the radius of AdS spacetime respectively. Two alternative methods are given for doing this. The results are expressed in terms of the trace of the extrinsic curvature and the intrinsic scalar curvature. In addition to conformal Galileon interactions, we find a non-Galileon term which is never sub-dominant. This method can be extended to any conformally flat bulk spacetime.

preprint2010arXiv

Fermion Masses in Emergent Electroweak Symmetry Breaking

We consider the generation of fermion masses in an emergent model of electroweak symmetry breaking with composite $W,Z$ gauge bosons. A universal bulk fermion profile in a warped extra dimension is used for all fermion flavors. Electroweak symmetry is broken at the UV (or Planck) scale where boundary mass terms are added to generate the fermion flavor structure. This leads to flavor-dependent nonuniversality in the gauge couplings. The effects are suppressed for the light fermion generations but are enhanced for the top quark where the $Zt{\bar t}$ and $Wt{\bar b}$ couplings can deviate at the $10-20%$ level in the minimal setup. By the AdS/CFT correspondence our model implies that electroweak symmetry is not a fundamental gauge symmetry. Instead the Standard Model with massive fermions and $W,Z$ gauge bosons is an effective chiral Lagrangian for some underlying confining strong dynamics at the TeV scale, where mass is generated without a Higgs mechanism.

James Stokes

What is connected

Connect this record

See the researcher in context

Building this map preview

20 published item(s)

Fermionic Wave Functions from Neural-Network Constrained Hidden States

Gauge equivariant neural networks for quantum lattice gauge theories

Numerical and geometrical aspects of flow-based variational quantum Monte Carlo

Quantum-inspired variational algorithms for partial differential equations: Application to financial derivative pricing

Scalable neural quantum states architecture for quantum chemistry

Holographic 2-Point Functions in the Pseudo-Conformal Universe

Quantum Natural Gradient

Fisher-Rao Metric, Geometry, and Complexity of Neural Networks

Interaction Matters: A Note on Non-asymptotic Local Convergence of Generative Adversarial Networks

Probabilistic Modeling with Matrix Product States

Holographic CFTs on maximally symmetric spaces: correlators, integral transforms and applications

Nonlinear Sigma Models with Compact Hyperbolic Target Spaces

The curious case of large-N expansions on a (pseudo)sphere

Holography for a Non-Inflationary Early Universe

Cosmological perturbations of massive gravity coupled to DBI Galileons

Cosmologies of extended massive gravity

Massive gravity coupled to DBI Galileons is ghost free

Heterotic Kink Solitons and their Worldvolume Action

The Worldvolume Action of Kink Solitons in AdS Spacetime

Fermion Masses in Emergent Electroweak Symmetry Breaking