Researcher profile

Akash Gupta

Akash Gupta contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2022arXiv

A-ACT: Action Anticipation through Cycle Transformations

While action anticipation has garnered a lot of research interest recently, most of the works focus on anticipating future action directly through observed visual cues only. In this work, we take a step back to analyze how the human capability to anticipate the future can be transferred to machine learning algorithms. To incorporate this ability in intelligent systems a question worth pondering upon is how exactly do we anticipate? Is it by anticipating future actions from past experiences? Or is it by simulating possible scenarios based on cues from the present? A recent study on human psychology explains that, in anticipating an occurrence, the human brain counts on both systems. In this work, we study the impact of each system for the task of action anticipation and introduce a paradigm to integrate them in a learning framework. We believe that intelligent systems designed by leveraging the psychological anticipation models will do a more nuanced job at the task of human action prediction. Furthermore, we introduce cyclic transformation in the temporal dimension in feature and semantic label space to instill the human ability of reasoning of past actions based on the predicted future. Experiments on Epic-Kitchen, Breakfast, and 50Salads dataset demonstrate that the action anticipation model learned using a combination of the two systems along with the cycle transformation performs favorably against various state-of-the-art approaches.

preprint2022arXiv

Properties of the radius valley around low mass stars: Predictions from the core-powered mass-loss mechanism

In recent years, analyzing the bimodality in the size distribution of small planets, i.e., the `radius valley', has given us unprecedented insight into the planet formation process. Here we explore the properties of the radius valley for low mass stars, assuming that the core-powered mass-loss is the dominant process shaping the small exoplanet population. We show that the slope of radius valley in the planet size-orbital period space, to first-order, does not vary with stellar mass and has a negative slope of $\text{d log}R_p/\text{d log}P \simeq -0.11$ even for stars as small as 0.1 $M_\odot$, as observed in latest studies. Furthermore, we find that the slope of the radius valley in the planet size-stellar mass space is $\text{d log}R_p/\text{d log}M_\ast \simeq (3 ζ- 2)/36$ where $ζ$ is given by the stellar mass-luminosity relation $L_\ast \propto M_\ast^ζ$. Because $ζ$ is $\gtrsim$ 2 and increases with stellar mass, we predict that the radius valley has a positive slope in the planet size-stellar mass space across FGKM dwarfs. This slope, however, decreases (increases) in magnitude towards lower (higher) mass stars, due to the variation of $ζ$ with stellar mass. While around 1.0 $M_\odot$ stars the slope is $\text{d log}R_p/\text{d log}M_\ast \sim 0.37$, it is as low as $\sim 0.13$ around 0.1 $M_\odot$ stars. In addition, we find that the radius valley is narrower and less empty around lower mass stars. Finally, we show that predictions for the radius valley for core-powered mass-loss and photoevaporation become increasingly distinct for lower mass stars.

preprint2022arXiv

UltraMNIST Classification: A Benchmark to Train CNNs for Very Large Images

Convolutional neural network (CNN) approaches available in the current literature are designed to work primarily with low-resolution images. When applied on very large images, challenges related to GPU memory, smaller receptive field than needed for semantic correspondence and the need to incorporate multi-scale features arise. The resolution of input images can be reduced, however, with significant loss of critical information. Based on the outlined issues, we introduce a novel research problem of training CNN models for very large images, and present 'UltraMNIST dataset', a simple yet representative benchmark dataset for this task. UltraMNIST has been designed using the popular MNIST digits with additional levels of complexity added to replicate well the challenges of real-world problems. We present two variants of the problem: 'UltraMNIST classification' and 'Budget-aware UltraMNIST classification'. The standard UltraMNIST classification benchmark is intended to facilitate the development of novel CNN training methods that make the effective use of the best available GPU resources. The budget-aware variant is intended to promote development of methods that work under constrained GPU memory. For the development of competitive solutions, we present several baseline models for the standard benchmark and its budget-aware variant. We study the effect of reducing resolution on the performance and present results for baseline models involving pretrained backbones from among the popular state-of-the-art models. Finally, with the presented benchmark dataset and the baselines, we hope to pave the ground for a new generation of CNN methods suitable for handling large images in an efficient and resource-light manner.

preprint2020arXiv

Adversarial Knowledge Transfer from Unlabeled Data

While machine learning approaches to visual recognition offer great promise, most of the existing methods rely heavily on the availability of large quantities of labeled training data. However, in the vast majority of real-world settings, manually collecting such large labeled datasets is infeasible due to the cost of labeling data or the paucity of data in a given domain. In this paper, we present a novel Adversarial Knowledge Transfer (AKT) framework for transferring knowledge from internet-scale unlabeled data to improve the performance of a classifier on a given visual recognition task. The proposed adversarial learning framework aligns the feature space of the unlabeled source data with the labeled target data such that the target classifier can be used to predict pseudo labels on the source data. An important novel aspect of our method is that the unlabeled source data can be of different classes from those of the labeled target data, and there is no need to define a separate pretext task, unlike some existing approaches. Extensive experiments well demonstrate that models learned using our approach hold a lot of promise across a variety of visual recognition tasks on multiple standard datasets.

preprint2020arXiv

ALANET: Adaptive Latent Attention Network forJoint Video Deblurring and Interpolation

Existing works address the problem of generating high frame-rate sharp videos by separately learning the frame deblurring and frame interpolation modules. Most of these approaches have a strong prior assumption that all the input frames are blurry whereas in a real-world setting, the quality of frames varies. Moreover, such approaches are trained to perform either of the two tasks - deblurring or interpolation - in isolation, while many practical situations call for both. Different from these works, we address a more realistic problem of high frame-rate sharp video synthesis with no prior assumption that input is always blurry. We introduce a novel architecture, Adaptive Latent Attention Network (ALANET), which synthesizes sharp high frame-rate videos with no prior knowledge of input frames being blurry or not, thereby performing the task of both deblurring and interpolation. We hypothesize that information from the latent representation of the consecutive frames can be utilized to generate optimized representations for both frame deblurring and frame interpolation. Specifically, we employ combination of self-attention and cross-attention module between consecutive frames in the latent space to generate optimized representation for each frame. The optimized representation learnt using these attention modules help the model to generate and interpolate sharp frames. Extensive experiments on standard datasets demonstrate that our method performs favorably against various state-of-the-art approaches, even though we tackle a much more difficult problem.

preprint2020arXiv

Non-Adversarial Video Synthesis with Learned Priors

Most of the existing works in video synthesis focus on generating videos using adversarial learning. Despite their success, these methods often require input reference frame or fail to generate diverse videos from the given data distribution, with little to no uniformity in the quality of videos that can be generated. Different from these methods, we focus on the problem of generating videos from latent noise vectors, without any reference input frames. To this end, we develop a novel approach that jointly optimizes the input latent space, the weights of a recurrent neural network and a generator through non-adversarial learning. Optimizing for the input latent space along with the network weights allows us to generate videos in a controlled environment, i.e., we can faithfully generate all videos the model has seen during the learning process as well as new unseen videos. Extensive experiments on three challenging and diverse datasets well demonstrate that our approach generates superior quality videos compared to the existing state-of-the-art methods.

preprint2020arXiv

Optimal Sepsis Patient Treatment using Human-in-the-loop Artificial Intelligence

Sepsis is one of the leading causes of death in Intensive Care Units (ICU). The strategy for treating sepsis involves the infusion of intravenous (IV) fluids and administration of antibiotics. Determining the optimal quantity of IV fluids is a challenging problem due to the complexity of a patient's physiology. In this study, we develop a data-driven optimization solution that derives the optimal quantity of IV fluids for individual patients. The proposed method minimizes the probability of severe outcomes by controlling the prescribed quantity of IV fluids and utilizes human-in-the-loop artificial intelligence. We demonstrate the performance of our model on 1122 ICU patients with sepsis diagnosis extracted from the MIMIC-III dataset. The results show that, on average, our model can reduce mortality by 22%. This study has the potential to help physicians synthesize optimal, patient-specific treatment strategies.

preprint2020arXiv

SAFER: Development and Evaluation of an IoT Device Risk Assessment Framework in a Multinational Organization

Users of Internet of Things (IoT) devices are often unaware of their security risks and cannot sufficiently factor security considerations into their device selection. This puts networks, infrastructure and users at risk. We developed and evaluated SAFER, an IoT device risk assessment framework designed to improve users' ability to assess the security of connected devices. We deployed SAFER in a large multinational organization that permits use of private devices. To evaluate the framework, we conducted a mixed-method study with 20 employees. Our findings suggest that SAFER increases users' awareness of security issues. It provides valuable advice and impacts device selection. Based on our findings, we discuss implications for the design of device risk assessment tools, with particular regard to the relationship between risk communication and user perceptions of device complexity.

preprint2020arXiv

Signatures of the Core-Powered Mass-Loss Mechanism in the Exoplanet Population: Dependence on Stellar Properties and Observational Predictions

Recent studies have shown that atmospheric mass-loss powered by the cooling luminosity of a planet's core can explain the observed radius valley separating super-Earths and sub-Neptunes, even without photoevaporation. In this work, we investigate the dependence of this core-powered mass-loss mechanism on stellar mass ($M_\ast$), metallicity ($Z_\ast$) and age ($τ_\ast$). Without making any changes to the underlying planet population, we find that the core-powered mass-loss model yields a shift in the radius valley to larger planet sizes around more massive stars with a slope given by $\text{d log}R_p/\text{d log}M_\ast \simeq 0.35$, in agreement with observations. To first order, this slope is driven by the dependence of core-powered mass-loss on the bolometric luminosity of the host star and is given by $\text{d log}R_p/\text{d log}M_\ast \simeq (3α-2)/36 \simeq 0.33$, where $(L_\ast/L_\odot) = (M_\ast/M_\odot)^α$ is the stellar mass-luminosity relation and $α\simeq 4.6$ for the CKS dataset. We therefore find, in contrast to photoevaporation models, no evidence for a linear correlation between planet and stellar mass, but can't rule it out either. In addition, we show that the location of the radius valley is, to first order, independent of stellar age and metallicity. Since core-powered mass-loss proceeds over Gyr timescales, the abundance of super-Earths relative to sub-Neptunes increases with age but decreases with stellar metallicity. Finally, due the dependence of the envelope's cooling timescale on metallicity, we find that the radii of sub-Neptunes increase with metallicity and decrease with age with slopes given by $\text{d log}R_p/\text{d log}Z_\ast \simeq 0.1$ and $\text{d log}R_p/\text{d log}τ_\ast \simeq -0.1$, respectively. We conclude with a series of observational tests that can differentiate between core-powered mass-loss and photoevaporation models.

preprint2020arXiv

The evolutionary track of H/He envelope in the observed population of sub-Neptunes and Super-Earths planets

The observational detection of a localized reduction in the small planet occurrence rate, sometimes termed a &#39;gap&#39;, is an exciting discovery because of the implications for planet evolutionary history. This gap appears to define a transition region in which sub-Neptune planets are believed to have lost their H/He envelope, potentially by photoevaporation or core powered mass loss, and have thus been transformed into bare cores terrestrial planets. Here we investigate the transition between sub-Neptunes and super-Earths using a real sample of observed small close-in planets and applying envelope evolution models of the H/He envelope together with the mass-radius diagram and a photoevaporation model. We find that photoevaporation can explain the H/He envelope loss of most super-Earths in 100Myr, although an additional loss mechanism appears necessary in some planets. We explore the possibility that these planets families have different core mass and find a continuum in the primordial population of the strongly irradiated super-Earths and the sub-Neptunes. Our analysis also shows that close-orbiting sub-Neptunes with R < 3.5 R_oplus typically lose ~ 30% of their primordial envelope.