Researcher profile

Vasudev Shyam

Vasudev Shyam contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Do Sparse Autoencoders Capture Concept Manifolds?

Sparse autoencoders (SAEs) are widely used to extract interpretable features from neural network representations, often under the implicit assumption that concepts correspond to independent linear directions. However, a growing body of evidence suggests that many concepts are instead organized along low-dimensional manifolds encoding continuous geometric relationships. This raises three basic questions: what does it mean for an SAE to capture a manifold, when do existing SAE architectures do so, and how? We develop a theoretical framework that answers these questions and show that SAEs can capture manifolds in two fundamentally different ways: globally, by allocating a compact group of atoms whose linear span contains the entire manifold, or locally, by distributing it across features that each selectively tile a restricted region of the underlying geometry. Empirically, we find that SAEs suboptimally recover continuous structures, mixing the global subspace and local tiling solutions in a fragmented regime we call dilution. This explains why manifold structure is rarely visible at the level of individual concepts and motivates post-hoc unsupervised discovery methods that search for coherent groups of atoms rather than isolated directions. More broadly, our results suggest that future representation learning methods should treat geometric objects, not just individual directions, as the basic units of interpretability.

preprint2026arXiv

Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior

Neural representations carry rich geometric structure; but does that structure causally shape behavior? To address this question, we intervene along paths through activation space defined by different geometries, and measure the behavioral trajectories they induce. In particular, we test whether interventions that respect the geometry of activation space will yield behaviors close to those the model exhibits naturally. Concretely, we first fit an activation manifold $M_h$ to representations and a behavior manifold $M_y$ to output probability distributions. We then test the link $M_h \leftrightarrow M_y$ via interventions: we find that steering along $M_h$, which we term manifold steering, yields behavioral trajectories that follow $M_y$, while linear steering -- which assumes a Euclidean geometry -- cuts through off-manifold regions and hence produces unnatural outputs. Moreover, optimizing interventions in activation space to produce paths along $M_y$ recovers activation trajectories that trace the curvature of $M_h$. We demonstrate this bidirectional relationship between the geometry of representation and behavior across tasks and modalities. In language models, we use reasoning tasks with cyclic and sequential geometries as well as in-context learning tasks with more complex graph geometries. In a video world model, we use a task with geometry corresponding to physical dynamics. Overall, our work shows that geometry in neural representation is not merely incidental, but is in fact the proper object for enabling principled control via intervention on internals. This recasts the core problem of steering from finding the right direction to finding the right geometry.

preprint2022arXiv

de Sitter Microstates from $T\bar T+Λ_2$ and the Hawking-Page Transition

We obtain microstates accounting for the Gibbons-Hawking entropy in $dS_3$, along with a subleading logarithmic correction, from the solvable $T\bar T+Λ_2$ deformation of a seed CFT with sparse light spectrum. The microstates arise as the dressed CFT states near dimension $Δ=c/6$, associated with the Hawking-Page transition; they dominate the real spectrum of the deformed theory. We exhibit an analogue of the Hawking-Page transition in de Sitter. Appropriate generalizations of the $T\bar T+Λ_2$ deformation are required to treat model-dependent local bulk physics (subleading at large central charge) and higher dimensions. These results add considerably to the already strong motivation for the continued pursuit of such generalizations along with a more complete characterization of $T\bar T$ type theories, building from existing results in these directions.

preprint2022arXiv

Non-local Field Theory from Matrix Models

We show that a class of matrix theories can be understood as an extension of quantum field theory which has non-local interactions. This reformulation is based on the Wigner-Weyl transformation, and the interactions take the form of Moyal product on a doubled geometry. We recover local dynamics on the spacetime as a low-energy limit. This framework opens up the possibility for studying novel high-energy phenomena, including the unification of gauge and geometric symmetries in a gauge theory.

preprint2020arXiv

$T\bar{T}$ deformed YM$_{2}$ on general backgrounds from an integral transformation

We consider the $T\bar{T}$ deformation of two dimensional Yang--Mills theory on general curved backgrounds. We compute the deformed partition function through an integral transformation over frame fields weighted by a Gaussian kernel. We show that this partition function satisfies a flow equation which has been derived previously in the literature, which now holds on general backgrounds. We connect ambiguities associated to first derivative terms in the flow equation to the normalization of the functional integral over frame fields. We then compute the entanglement entropy for a general state in the theory. The connection to the string theoretic description of the theory is also investigated.

preprint2019arXiv

Quantum corrections to finite radius holography and holographic entanglement entropy

We calculate quantum corrections to holographic entanglement entropy in the proposed duality between $T\bar{T}$-deformed holographic 2D CFTs and gravity in AdS$_{3}$ with a finite cutoff. We first establish the dictionary between the two theories by mapping the flow equation of the deformed CFT to the bulk Wheeler-DeWitt equation. The latter reduces to an ordinary differential equation for the sphere partition function, which we solve to find the entanglement entropy for an entangling surface consisting of two antipodal points on a sphere. The entanglement entropy in the inverse central charge expansion yields the expectation value of the bulk length operator plus the entropy of length fluctuations, in accordance with the Ryu--Takayanagi formula and its generalization due to Faulkner, Lewkowycz, and Maldacena. Special attention is paid to the conformal mode problem and its resolution by a choice of contour for the gravitational path integral.