Researcher profile

K V Subrahmanyam

K V Subrahmanyam contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2022arXiv

Geometric Complexity Theory -- Lie Algebraic Methods for Projective Limits of Stable Points

Let $G$ be a connected reductive group acting on a complex vector space $V$ and projective space ${\mathbb P}V$. Let $x\in V$ and ${\cal H}\subseteq {\cal G}$ be the Lie algebra of its stabilizer. Our objective is to understand points $[y]$, and their stabilizers which occur in the vicinity of $[x]$. We construct an explicit ${\cal G}$-action on a suitable neighbourhood of $x$, which we call the local model at $x$. We show that Lie algebras of stabilizers of points in the vicinity of $x$ are parameterized by subspaces of ${\cal H}$. When ${\cal H}$ is reductive these are Lie subalgebras of ${\cal H}$. If the orbit of $x$ is closed this also follows from Luna's theorem. Our construction involves a map connected to the local curvature form at $x$. We apply the local model to forms, when the form $g$ is obtained from the form $f$ as the leading term of a one parameter family acting on $f$. We show that there is a flattening ${\cal K}_0$ of ${\cal K}$, the stabilizer of $f$ which sits as a subalgebra of ${\cal H}$, the stabilizer $g$. We specialize to the case of forms $f$ whose $SL(X)$-orbits are affine, and the orbit of $g$ is of co-dimension $1$. We show that (i) either ${\cal H}$ has a very simple structure, or (ii) conjugates of the elements of ${\cal K}$ also stabilize $g$ and the tangent of exit. Next, we apply this to the adjoint action. We show that for a general matrix $X$, the signatures of nilpotent matrices in its projective orbit closure (under conjugation) are determined by the multiplicity data of the spectrum of $X$. Finally, we formulate the path problem of finding paths with specific properties from $y$ to its limit points $x$ as an optimization problem using local differential geometry. Our study is motivated by Geometric Complexity Theory proposed by the second author and Ketan Mulmuley.

preprint2020arXiv

How do SGD hyperparameters in natural training affect adversarial robustness?

Learning rate, batch size and momentum are three important hyperparameters in the SGD algorithm. It is known from the work of Jastrzebski et al. arXiv:1711.04623 that large batch size training of neural networks yields models which do not generalize well. Yao et al. arXiv:1802.08241 observe that large batch training yields models that have poor adversarial robustness. In the same paper, the authors train models with different batch sizes and compute the eigenvalues of the Hessian of loss function. They observe that as the batch size increases, the dominant eigenvalues of the Hessian become larger. They also show that both adversarial training and small-batch training leads to a drop in the dominant eigenvalues of the Hessian or lowering its spectrum. They combine adversarial training and second order information to come up with a new large-batch training algorithm and obtain robust models with good generalization. In this paper, we empirically observe the effect of the SGD hyperparameters on the accuracy and adversarial robustness of networks trained with unperturbed samples. Jastrzebski et al. considered training models with a fixed learning rate to batch size ratio. They observed that higher the ratio, better is the generalization. We observe that networks trained with constant learning rate to batch size ratio, as proposed in Jastrzebski et al., yield models which generalize well and also have almost constant adversarial robustness, independent of the batch size. We observe that momentum is more effective with varying batch sizes and a fixed learning rate than with constant learning rate to batch size ratio based SGD training.

preprint2020arXiv

On Universalized Adversarial and Invariant Perturbations

Convolutional neural networks or standard CNNs (StdCNNs) are translation-equivariant models that achieve translation invariance when trained on data augmented with sufficient translations. Recent work on equivariant models for a given group of transformations (e.g., rotations) has lead to group-equivariant convolutional neural networks (GCNNs). GCNNs trained on data augmented with sufficient rotations achieve rotation invariance. Recent work by authors arXiv:2002.11318 studies a trade-off between invariance and robustness to adversarial attacks. In another related work arXiv:2005.08632, given any model and any input-dependent attack that satisfies a certain spectral property, the authors propose a universalization technique called SVD-Universal to produce a universal adversarial perturbation by looking at very few test examples. In this paper, we study the effectiveness of SVD-Universal on GCNNs as they gain rotation invariance through higher degree of training augmentation. We empirically observe that as GCNNs gain rotation invariance through training augmented with larger rotations, the fooling rate of SVD-Universal gets better. To understand this phenomenon, we introduce universal invariant directions and study their relation to the universal adversarial direction produced by SVD-Universal.