Researcher profile

Amit Vikram Singh

Amit Vikram Singh contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2022arXiv

Explicitising The Implicit Intrepretability of Deep Neural Networks Via Duality

Recent work by Lakshminarayanan and Singh [2020] provided a dual view for fully connected deep neural networks (DNNs) with rectified linear units (ReLU). It was shown that (i) the information in the gates is analytically characterised by a kernel called the neural path kernel (NPK) and (ii) most critical information is learnt in the gates, in that, given the learnt gates, the weights can be retrained from scratch without significant loss in performance. Using the dual view, in this paper, we rethink the conventional interpretations of DNNs thereby explicitsing the implicit interpretability of DNNs. Towards this, we first show new theoretical properties namely rotational invariance and ensemble structure of the NPK in the presence of convolutional layers and skip connections respectively. Our theory leads to two surprising empirical results that challenge conventional wisdom: (i) the weights can be trained even with a constant 1 input, (ii) the gating masks can be shuffled, without any significant loss in performance. These results motivate a novel class of networks which we call deep linearly gated networks (DLGNs). DLGNs using the phenomenon of dual lifting pave way to more direct and simpler interpretation of DNNs as opposed to conventional interpretations. We show via extensive experiments on CIFAR-10 and CIFAR-100 that these DLGNs lead to much better interpretability-accuracy tradeoff.

preprint2020arXiv

Deep Gated Networks: A framework to understand training and generalisation in deep learning

Understanding the role of (stochastic) gradient descent (SGD) in the training and generalisation of deep neural networks (DNNs) with ReLU activation has been the object study in the recent past. In this paper, we make use of deep gated networks (DGNs) as a framework to obtain insights about DNNs with ReLU activation. In DGNs, a single neuronal unit has two components namely the pre-activation input (equal to the inner product the weights of the layer and the previous layer outputs), and a gating value which belongs to $[0,1]$ and the output of the neuronal unit is equal to the multiplication of pre-activation input and the gating value. The standard DNN with ReLU activation, is a special case of the DGNs, wherein the gating value is $1/0$ based on whether or not the pre-activation input is positive or negative. We theoretically analyse and experiment with several variants of DGNs, each variant suited to understand a particular aspect of either training or generalisation in DNNs with ReLU activation. Our theory throws light on two questions namely i) why increasing depth till a point helps in training and ii) why increasing depth beyond a point hurts training? We also present experimental evidence to show that gate adaptation, i.e., the change of gating value through the course of training is key for generalisation.

preprint2012arXiv

Studies On Falling Ball Viscometry

A new method of accurate calculation of the coefficient of viscosity of a test liquid from experimentally measured terminal velocity of a ball falling in the test liquid contained in a narrow tube is described. The calculation requires the value of a multiplicative correction factor to the apparent coefficient of viscosity calculated by substitution of terminal velocity of the falling ball in Stokes formula. This correction factor, the so-called viscosity ratio, a measure of deviation from Stokes limit, arises from non-vanishing values of the Reynolds number and the ball/tube radius ratio. The method, valid over a very wide range of Reynolds number, is based on the recognition of a relationship between two measures of wall effect, the more widely investigated velocity ratio, defined as the ratio of terminal velocity in a confined medium to that in a boundless medium and viscosity ratio. The calculation uses two recently published correlation formulae based on extensive experimental results on terminal velocity of a falling ball. The first formula relates velocity ratio to Reynolds number and ball-tube radius ratio. The second formula gives an expression of the ratio of the drag force actually sensed by the ball falling in an infinite medium to that in the Stokes limit as a function of Reynolds number alone. It is shown that appropriate use of this correction factor extends the utility of the technique of falling ball viscometry beyond the very low Reynolds number 'creepy flow' regime, to which its application is presently restricted. Issues related to accuracy are examined by use of our own measurements of the terminal velocity of a falling ball in a narrow tube and that of published literature reports, on liquids of known viscosity coefficient.