Source author record

K V Subrahmanyam

K V Subrahmanyam appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning math.RT Computational Complexity Cryptography and Security

Catalog footprint

What is connected

4works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Geometric Complexity Theory -- Lie Algebraic Methods for Projective Limits of Stable Points

Let $G$ be a connected reductive group acting on a complex vector space $V$ and projective space ${\mathbb P}V$. Let $x\in V$ and ${\cal H}\subseteq {\cal G}$ be the Lie algebra of its stabilizer. Our objective is to understand points $[y]$, and their stabilizers which occur in the vicinity of $[x]$. We construct an explicit ${\cal G}$-action on a suitable neighbourhood of $x$, which we call the local model at $x$. We show that Lie algebras of stabilizers of points in the vicinity of $x$ are parameterized by subspaces of ${\cal H}$. When ${\cal H}$ is reductive these are Lie subalgebras of ${\cal H}$. If the orbit of $x$ is closed this also follows from Luna's theorem. Our construction involves a map connected to the local curvature form at $x$. We apply the local model to forms, when the form $g$ is obtained from the form $f$ as the leading term of a one parameter family acting on $f$. We show that there is a flattening ${\cal K}_0$ of ${\cal K}$, the stabilizer of $f$ which sits as a subalgebra of ${\cal H}$, the stabilizer $g$. We specialize to the case of forms $f$ whose $SL(X)$-orbits are affine, and the orbit of $g$ is of co-dimension $1$. We show that (i) either ${\cal H}$ has a very simple structure, or (ii) conjugates of the elements of ${\cal K}$ also stabilize $g$ and the tangent of exit. Next, we apply this to the adjoint action. We show that for a general matrix $X$, the signatures of nilpotent matrices in its projective orbit closure (under conjugation) are determined by the multiplicity data of the spectrum of $X$. Finally, we formulate the path problem of finding paths with specific properties from $y$ to its limit points $x$ as an optimization problem using local differential geometry. Our study is motivated by Geometric Complexity Theory proposed by the second author and Ketan Mulmuley.

preprint2020arXiv

How do SGD hyperparameters in natural training affect adversarial robustness?

Learning rate, batch size and momentum are three important hyperparameters in the SGD algorithm. It is known from the work of Jastrzebski et al. arXiv:1711.04623 that large batch size training of neural networks yields models which do not generalize well. Yao et al. arXiv:1802.08241 observe that large batch training yields models that have poor adversarial robustness. In the same paper, the authors train models with different batch sizes and compute the eigenvalues of the Hessian of loss function. They observe that as the batch size increases, the dominant eigenvalues of the Hessian become larger. They also show that both adversarial training and small-batch training leads to a drop in the dominant eigenvalues of the Hessian or lowering its spectrum. They combine adversarial training and second order information to come up with a new large-batch training algorithm and obtain robust models with good generalization. In this paper, we empirically observe the effect of the SGD hyperparameters on the accuracy and adversarial robustness of networks trained with unperturbed samples. Jastrzebski et al. considered training models with a fixed learning rate to batch size ratio. They observed that higher the ratio, better is the generalization. We observe that networks trained with constant learning rate to batch size ratio, as proposed in Jastrzebski et al., yield models which generalize well and also have almost constant adversarial robustness, independent of the batch size. We observe that momentum is more effective with varying batch sizes and a fixed learning rate than with constant learning rate to batch size ratio based SGD training.

preprint2020arXiv

On Universalized Adversarial and Invariant Perturbations

Convolutional neural networks or standard CNNs (StdCNNs) are translation-equivariant models that achieve translation invariance when trained on data augmented with sufficient translations. Recent work on equivariant models for a given group of transformations (e.g., rotations) has lead to group-equivariant convolutional neural networks (GCNNs). GCNNs trained on data augmented with sufficient rotations achieve rotation invariance. Recent work by authors arXiv:2002.11318 studies a trade-off between invariance and robustness to adversarial attacks. In another related work arXiv:2005.08632, given any model and any input-dependent attack that satisfies a certain spectral property, the authors propose a universalization technique called SVD-Universal to produce a universal adversarial perturbation by looking at very few test examples. In this paper, we study the effectiveness of SVD-Universal on GCNNs as they gain rotation invariance through higher degree of training augmentation. We empirically observe that as GCNNs gain rotation invariance through training augmented with larger rotations, the fooling rate of SVD-Universal gets better. To understand this phenomenon, we introduce universal invariant directions and study their relation to the universal adversarial direction produced by SVD-Universal.

preprint2016arXiv

Relating tensor structures on representations of general linear and symmetric groups

For polynomial representations of $GL_n$ of a fixed degree, H. Krause defined a new internal tensor product using the language of strict polynomial functors. We show that over an arbitrary commutative base ring $k$, the Schur functor carries this internal tensor product to the usual Kronecker tensor product of symmetric group representations. This is true even at the level of derived categories. The new tensor product is a substantial enrichment of the Kronecker tensor product. E.g. in modular representation theory it brings in homological phenomena not visible on the symmetric group side. We calculate the internal tensor product over any $k$ in several interesting cases involving classical functors and the Weyl functors. We show an application to the Kronecker problem in characteristic zero when one partition has two rows or is a hook.