Source author record

Neha Gupta

Neha Gupta appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

nucl-th Machine Learning physics.soc-ph Social and Information Networks Computation and Language Cryptography and Security cs.CY Information Retrieval math.CT math.PR Other Computer Science Populations and Evolution Systems and Control

Catalog footprint

What is connected

16works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Ministral 3

We introduce the Ministral 3 series, a family of parameter-efficient dense language models designed for compute and memory constrained applications, available in three model sizes: 3B, 8B, and 14B parameters. For each model size, we release three variants: a pretrained base model for general-purpose use, an instruction finetuned, and a reasoning model for complex problem-solving. In addition, we present our recipe to derive the Ministral 3 models through Cascade Distillation, an iterative pruning and continued training with distillation technique. Each model comes with image understanding capabilities, all under the Apache 2.0 license.

preprint2022arXiv

Ensembling over Classifiers: a Bias-Variance Perspective

Ensembles are a straightforward, remarkably effective method for improving the accuracy,calibration, and robustness of models on classification tasks; yet, the reasons that underlie their success remain an active area of research. We build upon the extension to the bias-variance decomposition by Pfau (2013) in order to gain crucial insights into the behavior of ensembles of classifiers. Introducing a dual reparameterization of the bias-variance tradeoff, we first derive generalized laws of total expectation and variance for nonsymmetric losses typical of classification tasks. Comparing conditional and bootstrap bias/variance estimates, we then show that conditional estimates necessarily incur an irreducible error. Next, we show that ensembling in dual space reduces the variance and leaves the bias unchanged, whereas standard ensembling can arbitrarily affect the bias. Empirically, standard ensembling reducesthe bias, leading us to hypothesize that ensembles of classifiers may perform well in part because of this unexpected reduction.We conclude by an empirical analysis of recent deep learning methods that ensemble over hyperparameters, revealing that these techniques indeed favor bias reduction. This suggests that, contrary to classical wisdom, targeting bias reduction may be a promising direction for classifier ensembles.

preprint2022arXiv

Understanding the bias-variance tradeoff of Bregman divergences

This paper builds upon the work of Pfau (2013), which generalized the bias variance tradeoff to any Bregman divergence loss function. Pfau (2013) showed that for Bregman divergences, the bias and variances are defined with respect to a central label, defined as the mean of the label variable, and a central prediction, of a more complex form. We show that, similarly to the label, the central prediction can be interpreted as the mean of a random variable, where the mean operates in a dual space defined by the loss function itself. Viewing the bias-variance tradeoff through operations taken in dual space, we subsequently derive several results of interest. In particular, (a) the variance terms satisfy a generalized law of total variance; (b) if a source of randomness cannot be controlled, its contribution to the bias and variance has a closed form; (c) there exist natural ensembling operations in the label and prediction spaces which reduce the variance and do not affect the bias.

preprint2020arXiv

Active Local Learning

In this work we consider active local learning: given a query point $x$, and active access to an unlabeled training set $S$, output the prediction $h(x)$ of a near-optimal $h \in H$ using significantly fewer labels than would be needed to actually learn $h$ fully. In particular, the number of label queries should be independent of the complexity of $H$, and the function $h$ should be well-defined, independent of $x$. This immediately also implies an algorithm for distance estimation: estimating the value $opt(H)$ from many fewer labels than needed to actually learn a near-optimal $h \in H$, by running local learning on a few random query points and computing the average error. For the hypothesis class consisting of functions supported on the interval $[0,1]$ with Lipschitz constant bounded by $L$, we present an algorithm that makes $O(({1 / ε^6}) \log(1/ε))$ label queries from an unlabeled pool of $O(({L / ε^4})\log(1/ε))$ samples. It estimates the distance to the best hypothesis in the class to an additive error of $ε$ for an arbitrary underlying distribution. We further generalize our algorithm to more than one dimensions. We emphasize that the number of labels used is independent of the complexity of the hypothesis class which depends on $L$. Furthermore, we give an algorithm to locally estimate the values of a near-optimal function at a few query points of interest with number of labels independent of $L$. We also consider the related problem of approximating the minimum error that can be achieved by the Nadaraya-Watson estimator under a linear diagonal transformation with eigenvalues coming from a small range. For a $d$-dimensional pointset of size $N$, our algorithm achieves an additive approximation of $ε$, makes $\tilde{O}({d}/{ε^2})$ queries and runs in $\tilde{O}({d^2}/{ε^{d+4}}+{dN}/{ε^2})$ time.

preprint2020arXiv

Implicit regularization for deep neural networks driven by an Ornstein-Uhlenbeck like process

We consider networks, trained via stochastic gradient descent to minimize $\ell_2$ loss, with the training labels perturbed by independent noise at each iteration. We characterize the behavior of the training dynamics near any parameter vector that achieves zero training error, in terms of an implicit regularization term corresponding to the sum over the data points, of the squared $\ell_2$ norm of the gradient of the model with respect to the parameter vector, evaluated at each data point. This holds for networks of any connectivity, width, depth, and choice of activation function. We interpret this implicit regularization term for three simple settings: matrix sensing, two layer ReLU networks trained on one-dimensional data, and two layer networks with sigmoid activations trained on a single datapoint. For these settings, we show why this new and general implicit regularization effect drives the networks towards "simple" models.

preprint2020arXiv

Projections for COVID-19 spread in India and its worst affected five states using the Modified SEIRD and LSTM models

The last leg of the year 2019 gave rise to a virus named COVID-19 (Corona Virus Disease 2019). Since the beginning of this infection in India, the government implemented several policies and restrictions to curtail its spread among the population. As the time passed, these restrictions were relaxed and people were advised to follow precautionary measures by themselves. These timely decisions taken by the Indian government helped in decelerating the spread of COVID-19 to a large extent. Despite these decisions, the pandemic continues to spread and hence, there is an urgent need to plan and control the spread of this disease. This is possible by finding the future predictions about the spread. Scientists across the globe are working towards estimating the future growth of COVID-19. This paper proposes a Modified SEIRD (Susceptible-Exposed-Infected-Recovered-Deceased) model for projecting COVID-19 infections in India and its five states having the highest number of total cases. In this model, exposed compartment contains individuals which may be asymptomatic but infectious. Deep Learning based Long Short-Term Memory (LSTM) model has also been used in this paper to perform short-term projections. The projections obtained from the proposed Modified SEIRD model have also been compared with the projections made by LSTM for next 30 days. The epidemiological data up to 15th August 2020 has been used for carrying out predictions in this paper. These predictions will help in arranging adequate medical infrastructure and providing proper preventive measures to handle the current pandemic. The effect of different lockdowns imposed by the Indian government has also been used in modelling and analysis in the proposed Modified SEIRD model. The results presented in this paper will act as a beacon for future policy-making to control the COVID-19 spread in India.

preprint2016arXiv

Induced monoidal structure from the functor

Let $\mathcal{B}$ be a subcategory of a given category $\mathcal{D}$. Let $\mathcal{B}$ has monoidal structure. In this article, we discuss when can one extend the monoidal structure of $\mathcal{B}$ to $\mathcal{D}$ such that $\mathcal{B}$ becomes a sub monoidal category of monoidal category $\mathcal{D}$. Examples are discussed, and in particular, in an example of loop space, we elaborated all results discussed in this article.

preprint2014arXiv

bit.ly/malicious: Deep Dive into Short URL based e-Crime Detection

Existence of spam URLs over emails and Online Social Media (OSM) has become a massive e-crime. To counter the dissemination of long complex URLs in emails and character limit imposed on various OSM (like Twitter), the concept of URL shortening has gained a lot of traction. URL shorteners take as input a long URL and output a short URL with the same landing page (as in the long URL) in return. With their immense popularity over time, URL shorteners have become a prime target for the attackers giving them an advantage to conceal malicious content. Bitly, a leading service among all shortening services is being exploited heavily to carry out phishing attacks, work-from-home scams, pornographic content propagation, etc. This imposes additional performance pressure on Bitly and other URL shorteners to be able to detect and take a timely action against the illegitimate content. In this study, we analyzed a dataset of 763,160 short URLs marked suspicious by Bitly in the month of October 2013. Our results reveal that Bitly is not using its claimed spam detection services very effectively. We also show how a suspicious Bitly account goes unnoticed despite of a prolonged recurrent illegitimate activity. Bitly displays a warning page on identification of suspicious links, but we observed this approach to be weak in controlling the overall propagation of spam. We also identified some short URL based features and coupled them with two domain specific features to classify a Bitly URL as malicious or benign and achieved an accuracy of 86.41%. The feature set identified can be generalized to other URL shortening services as well. To the best of our knowledge, this is the first large scale study to highlight the issues with the implementation of Bitly's spam detection policies and proposing suitable countermeasures.

preprint2014arXiv

Exploration of gaps in Bitly's spam detection and relevant counter measures

Existence of spam URLs over emails and Online Social Media (OSM) has become a growing phenomenon. To counter the dissemination issues associated with long complex URLs in emails and character limit imposed on various OSM (like Twitter), the concept of URL shortening gained a lot of traction. URL shorteners take as input a long URL and give a short URL with the same landing page in return. With its immense popularity over time, it has become a prime target for the attackers giving them an advantage to conceal malicious content. Bitly, a leading service in this domain is being exploited heavily to carry out phishing attacks, work from home scams, pornographic content propagation, etc. This imposes additional performance pressure on Bitly and other URL shorteners to be able to detect and take a timely action against the illegitimate content. In this study, we analyzed a dataset marked as suspicious by Bitly in the month of October 2013 to highlight some ground issues in their spam detection mechanism. In addition, we identified some short URL based features and coupled them with two domain specific features to classify a Bitly URL as malicious / benign and achieved a maximum accuracy of 86.41%. To the best of our knowledge, this is the first large scale study to highlight the issues with Bitly's spam detection policies and proposing a suitable countermeasure.

preprint2013arXiv

Antikaons and higher order couplings in relativistic-mean field study of neutron stars

We investigate the role of higher order couplings, along with the condensation of antikaons ($K^-$ and $\bar K^0$), on the properties of neutron star (NS). We employ extended versions of the relativistic mean-field model, in which kaon-nucleon and nucleon-nucleon interactions are taken on the same footing. We find that the onset of condensation of $K^-$ and $\bar K^0$ highly depends not only on the strength of optical potential but also on the new couplings. The presence of antikaons leads to a softer equation of state and makes the neutron star core symmetric and lepton-deficient. We show that these effects strongly influence the mass-radius relation as well as the composition of neutron star. We also show that the recently observed 1.97$\pm$.04 solar mass NS can be explained in three ways: (i) a stiffer EoS with both antikaons, (ii) a relatively soft EoS with $K^-$ and (iii) a softer EoS without antikaons.

preprint2013arXiv

Antikaons in neutron star studied with recent versions of relativistic mean-field models

We study the impact of additional couplings in the relativistic mean field (RMF) models, in conjunction with antikaon condensation, on various neutron star properties. We analyze different properties such as in-medium antikaon and nucleon effective masses, antikaon energies, chemical potentials and the mass-radius relations of neutron star (NS). We calculate the NS properties with the RMF (NL3), E-RMF (G1, G2) and FSU2.1 models, which are quite successful in explaining several finite nuclear properties. Our results show that the onset of kaon condensation in NS strongly depends on the parameters of the Lagrangian, especially the additional couplings which play a significant role at higher densities where antikaons dominate the behavior of equation of state.

preprint2013arXiv

Pasta phases in neutron star studied with extended relativistic mean field models

To explain several properties of finite nuclei, infinite matter, and neutron stars in a unified way within the relativistic mean field models, it is important to extend them either with higher order couplings or with density-dependent couplings. These extensions are known to have strong impact in the high-density regime. Here we explore their role on the equation of state at densities lower than the saturation density of finite nuclei which govern the phase transitions associated with pasta structures in the crust of neutron stars.

preprint2013arXiv

The Pin-Bang Theory: Discovering The Pinterest World

Pinterest is an image-based online social network, which was launched in the year 2010 and has gained a lot of traction, ever since. Within 3 years, Pinterest has attained 48.7 million unique users. This stupendous growth makes it interesting to study Pinterest, and gives rise to multiple questions about it's users, and content. We characterized Pinterest on the basis of large scale crawls of 3.3 million user profiles, and 58.8 million pins. In particular, we explored various attributes of users, pins, boards, pin sources, and user locations, in detail and performed topical analysis of user generated textual content. The characterization revealed most prominent topics among users and pins, top image sources, and geographical distribution of users on Pinterest. We then investigated this social network from a privacy and security standpoint, and found traces of malware in the form of pin sources. Instances of Personally Identifiable Information (PII) leakage were also discovered in the form of phone numbers, BBM (Blackberry Messenger) pins, and email addresses. Further, our analysis demonstrated how Pinterest is a potential venue for copyright infringement, by showing that almost half of the images shared on Pinterest go uncredited. To the best of our knowledge, this is the first attempt to characterize Pinterest at such a large scale.

preprint2013arXiv

The size of most massive neutron stars may reveal its exotic cores

The recent high precision observation of the most massive pulsar J1614-2230 with $(1.97\pm0.04)$ solar mass ($M_\odot$) was reported with a suggestion that many nuclear models which consider exotic particles in the core could be ruled out. However, many recent calculations could explain this star with various exotic particles, rendering the precise mass measurements insufficient to conclude on exotic cores. We examine the sensitivity of the radius of such a star to the details of its core. With our calculations and analysis, here we show that, for the most massive neutron star, with a precise observation of its radius it is possible to ascertain the presence of exotic cores.

preprint2012arXiv

Low Power Low Voltage Bulk Driven Balanced OTA

The last few decades, a great deal of attention has been paid to low-voltage (LV) low-power (LP) integrated circuits design since the power consumption has become a critical issue. Among many techniques used for the design of LV LP analog circuits, the Bulk-driven principle offers a promising route towards this design for many aspects mainly the simplicity and using the conventional MOS technology to implement these designs. This paper is devoted to the Bulk-driven (BD) principle and utilizing this principle to design LV LP building block of Operational Transconductance Amplifier (OTA) in standard CMOS processes and supply voltage 0.9V. The simulation results have been carried out by the Spice simulator using the 130nm CMOS technology from TSMC.

preprint2012arXiv

Role of higher order couplings in the presence of kaons in relativistic mean field description of neutron stars

We discuss the role of higher order couplings in conjunction with kaon condensation using recent versions of relativistic mean field models.We focus on an interaction (G2) in which all the parameters are obtained by fitting the finite nuclear data and successfully applied to reproduce a variety of nuclear properties. Our results show that the higher order couplings play a significant role at higher densities where kaons dominate the behavior of the equation of state. We compare our results with other interactions (NLl, NL3, G1, and FSUGold) and show that the new couplings bring down the mass of a neutron star (NS), which is further reduced in the presence of kaons to yield results consistent with the present observational constraints. We show that the composition of the NS varies with the parameter sets.

Neha Gupta

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

Ministral 3

Ensembling over Classifiers: a Bias-Variance Perspective

Understanding the bias-variance tradeoff of Bregman divergences

Active Local Learning

Implicit regularization for deep neural networks driven by an Ornstein-Uhlenbeck like process

Projections for COVID-19 spread in India and its worst affected five states using the Modified SEIRD and LSTM models

Induced monoidal structure from the functor

bit.ly/malicious: Deep Dive into Short URL based e-Crime Detection

Exploration of gaps in Bitly's spam detection and relevant counter measures

Antikaons and higher order couplings in relativistic-mean field study of neutron stars

Antikaons in neutron star studied with recent versions of relativistic mean-field models

Pasta phases in neutron star studied with extended relativistic mean field models

The Pin-Bang Theory: Discovering The Pinterest World

The size of most massive neutron stars may reveal its exotic cores

Low Power Low Voltage Bulk Driven Balanced OTA

Role of higher order couplings in the presence of kaons in relativistic mean field description of neutron stars