Researcher profile

Ashutosh Kumar

Ashutosh Kumar contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Generative or Discriminative? Revisiting Text Classification in the Era of Transformers

The comparison between discriminative and generative classifiers has intrigued researchers since Efron's seminal analysis of logistic regression versus discriminant analysis. While early theoretical work established that generative classifiers exhibit lower sample complexity but higher asymptotic error in simple linear settings, these trade-offs remain unexplored in the transformer era. We present the first comprehensive evaluation of modern generative and discriminative architectures - Auto-regressive modeling, Masked Language Modeling, Discrete Diffusion, and Encoders for text classification. Our study reveals that the classical 'two regimes' phenomenon manifests distinctly across different architectures and training paradigms. Beyond accuracy, we analyze sample efficiency, calibration, noise robustness, and ordinality across diverse scenarios. Our findings offer practical guidance for selecting the most suitable modeling approach based on real-world constraints such as latency and data limitations.

preprint2023arXiv

Quantum simulation of molecular response properties

Accurate modeling of the response of molecular systems to an external electromagnetic field is challenging on classical computers, especially in the regime of strong electronic correlation. In this paper, we develop a quantum linear response (qLR) theory to calculate molecular response properties on near-term quantum computers. Inspired by the recently developed variants of the quantum counterpart of equation of motion (qEOM) theory, the qLR formalism employs "killer condition" satisfying excitation operator manifolds that offers a number of theoretical advantages along with reduced quantum resource requirements. We also used the qEOM framework in this work to calculate state-specific response properties. Further, through noise-less quantum simulations, we show that response properties calculated using the qLR approach are more accurate than the ones obtained from the classical coupled-cluster based linear response models due to the improved quality of the ground-state wavefunction obtained using the ADAPT-VQE algorithm.

preprint2022arXiv

Accurate quantum simulation of molecular ground and excited states with a transcorrelated Hamiltonian

NISQ era devices suffer from a number of challenges like limited qubit connectivity, short coherence times and sizable gate error rates. Thus, quantum algorithms are desired that require shallow circuit depths and low qubit counts to take advantage of these devices. We attempt to realize this with the help of classical quantum chemical theories of canonical transformation and explicit correlation. In this work, compact ab initio Hamiltonians are generated classically through an approximate similarity transformation of the Hamiltonian with a) an explicitly correlated two-body unitary operator with generalized pair excitations that remove the Coulombic electron-electron singularities from the Hamiltonian and b) a unitary one-body operator to efficiently capture the orbital relaxation effects required for accurate description of the excited states. The resulting transcorelated Hamiltonians are able to describe both ground and excited states of molecular systems in a balanced manner. Using the fermionic-ADAPT-VQE method based on the unitary coupled cluster with singles and doubles (UCCSD) ansatz and only a minimal basis set (ANO-RCC-MB), we demonstrate that the transcorrelated Hamiltonians can produce ground state energies comparable to the much larger cc-pVTZ basis. This leads to a potential reduction in the number of required CNOT gates by more than three orders of magnitude for the chemical species studied in this work. Furthermore, using the qEOM formalism in conjunction with the transcorrelated Hamiltonian, we reduce the errors in excitation energies by an order of magnitude. The transcorrelated Hamiltonians developed here are Hermitian and contain only one- and two-body interaction terms and thus can be easily combined with any quantum algorithm for accurate electronic structure simulations.

preprint2022arXiv

Striking a Balance: Alleviating Inconsistency in Pre-trained Models for Symmetric Classification Tasks

While fine-tuning pre-trained models for downstream classification is the conventional paradigm in NLP, often task-specific nuances may not get captured in the resultant models. Specifically, for tasks that take two inputs and require the output to be invariant of the order of the inputs, inconsistency is often observed in the predicted labels or confidence scores. We highlight this model shortcoming and apply a consistency loss function to alleviate inconsistency in symmetric classification. Our results show an improved consistency in predictions for three paraphrase detection datasets without a significant drop in the accuracy scores. We examine the classification performance of six datasets (both symmetric and non-symmetric) to showcase the strengths and limitations of our approach.

preprint2021arXiv

Quantum simulation of electronic structure with a transcorrelated Hamiltonian: improved accuracy with a smaller footprint on the quantum computer

Quantum simulations of electronic structure with a transformed Hamiltonian that includes some electron correlation effects are demonstrated. The transcorrelated Hamiltonian used in this work is efficiently constructed classically, at polynomial cost, by an approximate similarity transformation with an explicitly correlated two-body unitary operator. This Hamiltonian is Hermitian, includes no more than two-particle interactions, and is free of electron-electron singularities. We investigate the effect of such a transformed Hamiltonian on the accuracy and computational cost of quantum simulations by focusing on a widely used solver for the Schrodinger equation, namely the variational quantum eigensolver method, based on the unitary coupled cluster with singles and doubles (q-UCCSD) Ansatz. Nevertheless, the formalism presented here translates straightforwardly to other quantum algorithms for chemistry. Our results demonstrate that a transcorrelated Hamiltonian, paired with extremely compact bases, produces explicitly correlated energies comparable to those from much larger bases. For the chemical species studied here, explicitly correlated energies based on an underlying 6-31G basis had cc-pVTZ quality. The use of the very compact transcorrelated Hamiltonian reduces the number of CNOT gates required to achieve cc-pVTZ quality by up to two orders of magnitude, and the number of qubits by a factor of three.

preprint2020arXiv

Syntax-guided Controlled Generation of Paraphrases

Given a sentence (e.g., "I like mangoes") and a constraint (e.g., sentiment flip), the goal of controlled text generation is to produce a sentence that adapts the input sentence to meet the requirements of the constraint (e.g., "I hate mangoes"). Going beyond such simple constraints, recent works have started exploring the incorporation of complex syntactic-guidance as constraints in the task of controlled paraphrase generation. In these methods, syntactic-guidance is sourced from a separate exemplar sentence. However, these prior works have only utilized limited syntactic information available in the parse tree of the exemplar sentence. We address this limitation in the paper and propose Syntax Guided Controlled Paraphraser (SGCP), an end-to-end framework for syntactic paraphrase generation. We find that SGCP can generate syntax conforming sentences while not compromising on relevance. We perform extensive automated and human evaluations over multiple real-world English language datasets to demonstrate the efficacy of SGCP over state-of-the-art baselines. To drive future research, we have made SGCP's source code available

preprint2020arXiv

Transients generate memory and break hyperbolicity in stochastic enzymatic networks

The hyperbolic dependence of catalytic rate on substrate concentration is a classical result in enzyme kinetics, quantified by the celebrated Michaelis-Menten equation. The ubiquity of this relation in diverse chemical and biological contexts has recently been rationalized by a graph-theoretic analysis of deterministic reaction networks. Experiments, however, have revealed that "molecular noise" - intrinsic stochasticity at the molecular scale - leads to significant deviations from classical results and to unexpected effects like "molecular memory", i.e., the breakdown of statistical independence between turnover events. Here we show, through a new method of analysis, that memory and non-hyperbolicity have a common source in an initial, and observably long, transient peculiar to stochastic reaction networks of multiple enzymes. Networks of single enzymes do not admit such transients. The transient yields, asymptotically, to a steady-state in which memory vanishes and hyperbolicity is recovered. We propose new statistical measures, defined in terms of turnover times, to distinguish between the transient and steady states and apply these to experimental data from a landmark experiment that first observed molecular memory in a single enzyme with multiple binding sites. Our study shows that catalysis at the molecular level with more than one enzyme always contains a non-classical regime and provides insight on how the classical limit is attained.