Source author record

Vinay V. Ramasesh

Vinay V. Ramasesh appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning quant-ph Computation and Language Computer Vision cond-mat.mes-hall cond-mat.quant-gas cond-mat.supr-con

Catalog footprint

What is connected

5works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

The geometry of integration in text classification RNNs

Despite the widespread application of recurrent neural networks (RNNs) across a variety of tasks, a unified understanding of how RNNs solve these tasks remains elusive. In particular, it is unclear what dynamical patterns arise in trained RNNs, and how those patterns depend on the training dataset or task. This work addresses these questions in the context of a specific natural language processing task: text classification. Using tools from dynamical systems analysis, we study recurrent networks trained on a battery of both natural and synthetic text classification tasks. We find the dynamics of these trained RNNs to be both interpretable and low-dimensional. Specifically, across architectures and datasets, RNNs accumulate evidence for each class as they process the text, using a low-dimensional attractor manifold as the underlying mechanism. Moreover, the dimensionality and geometry of the attractor manifold are determined by the structure of the training dataset; in particular, we describe how simple word-count statistics computed on the training dataset can be used to predict these properties. Our observations span multiple architectures and datasets, reflecting a common mechanism RNNs employ to perform text classification. To the degree that integration of evidence towards a decision is a common computational primitive, this work lays the foundation for using dynamical systems techniques to study the inner workings of RNNs.

preprint2020arXiv

Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics

A central challenge in developing versatile machine learning systems is catastrophic forgetting: a model trained on tasks in sequence will suffer significant performance drops on earlier tasks. Despite the ubiquity of catastrophic forgetting, there is limited understanding of the underlying process and its causes. In this paper, we address this important knowledge gap, investigating how forgetting affects representations in neural network models. Through representational analysis techniques, we find that deeper layers are disproportionately the source of forgetting. Supporting this, a study of methods to mitigate forgetting illustrates that they act to stabilize deeper layers. These insights enable the development of an analytic argument and empirical picture relating the degree of forgetting to representational similarity between tasks. Consistent with this picture, we observe maximal forgetting occurs for task sequences with intermediate similarity. We perform empirical studies on the standard split CIFAR-10 setup and also introduce a novel CIFAR-100 based task approximating realistic input distribution shift.

preprint2016arXiv

Dynamics of simultaneously measured non-commuting observables

In quantum mechanics, measurements cause wavefunction collapse that yields precise outcomes, for non-commuting observables such as position and momentum Heisenberg's uncertainty principle limits the intrinsic precision of a state. Although theoretical work has demonstrated the possibility to perform simultaneous non-commuting measurements and has revealed the limits on measurement outcomes, only recently has the dynamics of the quantum state been discussed. To realize this unexplored regime, we simultaneously apply two continuous quantum non-demolition probes of non-commuting observables to a superconducting qubit. We implement multiple readout channels by coupling the qubit to multiple modes of a cavity. To control the measurement observables, we implement a 'single quadrature' measurement by driving the qubit and applying cavity sidebands with a relative phase that sets the observable. Here, we show that the uncertainty principle governs the dynamics of the wavefunction by enforcing a lower bound on the measurement-induced disturbance. Consequently, as we transition from measuring identical to measuring non-commuting observables, the dynamics make a smooth transition from standard wavefunction collapse to persistent diffusion. Although the evolution of the state differs from that of a conventional measurement, information about both observables is extracted by keeping track of the time ordering of the measurement record, enabling quantum state tomography without alternating measurements. Our work creates novel capabilities for quantum control, including rapid state purification, adaptive measurement, measurement-based state steering and continuous quantum error correction. As physical systems often interact continuously with their environment via non-commuting degrees of freedom, our work offers a way to study how notions of contemporary quantum foundations arise in such settings.

preprint2015arXiv

A Quantum Gas Microscope for Fermionic Atoms

Strongly interacting fermions define the properties of complex matter at all densities, from atomic nuclei to modern solid state materials and neutron stars. Ultracold atomic Fermi gases have emerged as a pristine platform for the study of many-fermion systems. Here we realize a quantum gas microscope for fermionic $^{40}$K atoms trapped in an optical lattice, which allows one to probe strongly correlated fermions at the single atom level. We combine 3D Raman sideband cooling with high-resolution optics to simultaneously cool and image individual atoms with single lattice site resolution at a detection fidelity above $95\%$. The imaging process leaves each atom predominantly in the 3D ground state of its lattice site, inviting the implementation of a Maxwell's demon to assemble low-entropy many-body states. Single site resolved imaging of fermions enables the direct observation of magnetic order, time resolved measurements of the spread of particle correlations, and the detection of many-fermion entanglement.

preprint2015arXiv

Cooling and Autonomous Feedback in a Bose-Hubbard chain with Attractive Interactions

We engineer a quantum bath that enables entropy and energy exchange with a one-dimensional Bose-Hubbard lattice with attractive on-site interactions. We implement this in an array of three superconducting transmon qubits coupled to a single cavity mode; the transmons represent lattice sites and their excitation quanta embody bosonic particles. Our cooling protocol preserves particle number--realizing a canonical ensemble-- and also affords the efficient preparation of dark states which, due to symmetry, cannot be prepared via coherent drives on the cavity. Furthermore, by applying continuous microwave radiation, we also realize autonomous feedback to indefinitely stabilize particular eigenstates of the array.

Vinay V. Ramasesh

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

The geometry of integration in text classification RNNs

Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics

Dynamics of simultaneously measured non-commuting observables

A Quantum Gas Microscope for Fermionic Atoms

Cooling and Autonomous Feedback in a Bose-Hubbard chain with Attractive Interactions