Researcher profile

Ajay Gupta

Ajay Gupta contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2022arXiv

Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks

Quantization, knowledge distillation, and magnitude pruning are among the most popular methods for neural network compression in NLP. Independently, these methods reduce model size and can accelerate inference, but their relative benefit and combinatorial interactions have not been rigorously studied. For each of the eight possible subsets of these techniques, we compare accuracy vs. model size tradeoffs across six BERT architecture sizes and eight GLUE tasks. We find that quantization and distillation consistently provide greater benefit than pruning. Surprisingly, except for the pair of pruning and quantization, using multiple methods together rarely yields diminishing returns. Instead, we observe complementary and super-multiplicative reductions to model size. Our work quantitatively demonstrates that combining compression methods can synergistically reduce model size, and that practitioners should prioritize (1) quantization, (2) knowledge distillation, and (3) pruning to maximize accuracy vs. model size tradeoffs.

preprint2022arXiv

CovidMis20: COVID-19 Misinformation Detection System on Twitter Tweets using Deep Learning Models

Online news and information sources are convenient and accessible ways to learn about current issues. For instance, more than 300 million people engage with posts on Twitter globally, which provides the possibility to disseminate misleading information. There are numerous cases where violent crimes have been committed due to fake news. This research presents the CovidMis20 dataset (COVID-19 Misinformation 2020 dataset), which consists of 1,375,592 tweets collected from February to July 2020. CovidMis20 can be automatically updated to fetch the latest news and is publicly available at: https://github.com/everythingguy/CovidMis20. This research was conducted using Bi-LSTM deep learning and an ensemble CNN+Bi-GRU for fake news detection. The results showed that, with testing accuracy of 92.23% and 90.56%, respectively, the ensemble CNN+Bi-GRU model consistently provided higher accuracy than the Bi-LSTM model.

preprint2019arXiv

Evolution of magnetic anisotropy in cobalt film on nanopatterned silicon substrate studied in situ using MOKE

Evolution of magnetization behaviour of cobalt film on nano patterned silicon substrate, with film thickness, has been studied. In situ magneto-optical Kerr effect measurements during film deposition allowed us to study genuine thickness dependence of magnetization behaviour, all other parameters like surface topology, deposition conditions remaining invariant. The film exhibits uniaxial magnetic anisotropy, with its magnitude decreasing with increasing film thickness. Analysis shows that anisotropy has contributions from both, i) exchange energy which is volume dependent and, ii) stray dipolar fields at the surface/interface. This suggests that local magnetization follows only partially the topology of the rippled surface. As expected from energy considerations, for small film thickness, the local magnetization closely follows the surface contour of the ripples making the volume term as the dominant contribution. With increasing film thickness, the local magnetization gradually deviates from the local slope and approaches towards a uniform magnetization along the macroscopic film plane making the surface term as the dominant contribution. Significant deviation from the anisotropy energy expected on the basis of theoretical considerations can be attributed to several factors like, deviation of surface topology from an ideal sinusoidal wave, breaks of continuity along the ripple direction, defects like pattern dislocations, and possible decrease in surface modulation depth with increasing film thickness.