Researcher profile

Auroop R. Ganguly

Auroop R. Ganguly contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2023arXiv

CDA: Contrastive-adversarial Domain Adaptation

Recent advances in domain adaptation reveal that adversarial learning on deep neural networks can learn domain invariant features to reduce the shift between source and target domains. While such adversarial approaches achieve domain-level alignment, they ignore the class (label) shift. When class-conditional data distributions are significantly different between the source and target domain, it can generate ambiguous features near class boundaries that are more likely to be misclassified. In this work, we propose a two-stage model for domain adaptation called \textbf{C}ontrastive-adversarial \textbf{D}omain \textbf{A}daptation \textbf{(CDA)}. While the adversarial component facilitates domain-level alignment, two-stage contrastive learning exploits class information to achieve higher intra-class compactness across domains resulting in well-separated decision boundaries. Furthermore, the proposed contrastive framework is designed as a plug-and-play module that can be easily embedded with existing adversarial methods for domain adaptation. We conduct experiments on two widely used benchmark datasets for domain adaptation, namely, \textit{Office-31} and \textit{Digits-5}, and demonstrate that CDA achieves state-of-the-art results on both datasets.

preprint2022arXiv

A framework for deep learning emulation of numerical models with a case study in satellite remote sensing

Numerical models based on physics represent the state-of-the-art in earth system modeling and comprise our best tools for generating insights and predictions. Despite rapid growth in computational power, the perceived need for higher model resolutions overwhelms the latest-generation computers, reducing the ability of modelers to generate simulations for understanding parameter sensitivities and characterizing variability and uncertainty. Thus, surrogate models are often developed to capture the essential attributes of the full-blown numerical models. Recent successes of machine learning methods, especially deep learning, across many disciplines offer the possibility that complex nonlinear connectionist representations may be able to capture the underlying complex structures and nonlinear processes in earth systems. A difficult test for deep learning-based emulation, which refers to function approximation of numerical models, is to understand whether they can be comparable to traditional forms of surrogate models in terms of computational efficiency while simultaneously reproducing model results in a credible manner. A deep learning emulation that passes this test may be expected to perform even better than simple models with respect to capturing complex processes and spatiotemporal dependencies. Here we examine, with a case study in satellite-based remote sensing, the hypothesis that deep learning approaches can credibly represent the simulations from a surrogate model with comparable computational efficiency. Our results are encouraging in that the deep learning emulation reproduces the results with acceptable accuracy and often even faster performance. We discuss the broader implications of our results in light of the pace of improvements in high-performance implementations of deep learning as well as the growing desire for higher-resolution simulations in the earth sciences.

preprint2022arXiv

Deep Transfer Learning on Satellite Imagery Improves Air Quality Estimates in Developing Nations

Urban air pollution is a public health challenge in low- and middle-income countries (LMICs). However, LMICs lack adequate air quality (AQ) monitoring infrastructure. A persistent challenge has been our inability to estimate AQ accurately in LMIC cities, which hinders emergency preparedness and risk mitigation. Deep learning-based models that map satellite imagery to AQ can be built for high-income countries (HICs) with adequate ground data. Here we demonstrate that a scalable approach that adapts deep transfer learning on satellite imagery for AQ can extract meaningful estimates and insights in LMIC cities based on spatiotemporal patterns learned in HIC cities. The approach is demonstrated for Accra in Ghana, Africa, with AQ patterns learned from two US cities, specifically Los Angeles and New York.

preprint2020arXiv

Machine Learning for Robust Identification of Complex Nonlinear Dynamical Systems: Applications to Earth Systems Modeling

Systems exhibiting nonlinear dynamics, including but not limited to chaos, are ubiquitous across Earth Sciences such as Meteorology, Hydrology, Climate and Ecology, as well as Biology such as neural and cardiac processes. However, System Identification remains a challenge. In climate and earth systems models, while governing equations follow from first principles and understanding of key processes has steadily improved, the largest uncertainties are often caused by parameterizations such as cloud physics, which in turn have witnessed limited improvements over the last several decades. Climate scientists have pointed to Machine Learning enhanced parameter estimation as a possible solution, with proof-of-concept methodological adaptations being examined on idealized systems. While climate science has been highlighted as a "Big Data" challenge owing to the volume and complexity of archived model-simulations and observations from remote and in-situ sensors, the parameter estimation process is often relatively a "small data" problem. A crucial question for data scientists in this context is the relevance of state-of-the-art data-driven approaches including those based on deep neural networks or kernel-based processes. Here we consider a chaotic system - two-level Lorenz-96 - used as a benchmark model in the climate science literature, adopt a methodology based on Gaussian Processes for parameter estimation and compare the gains in predictive understanding with a suite of Deep Learning and strawman Linear Regression methods. Our results show that adaptations of kernel-based Gaussian Processes can outperform other approaches under small data constraints along with uncertainty quantification; and needs to be considered as a viable approach in climate science and earth system modeling.

preprint2016arXiv

Characterizing climate predictability and model response variability from multiple initial condition and multi-model ensembles

Climate models are thought to solve boundary value problems unlike numerical weather prediction, which is an initial value problem. However, climate internal variability (CIV) is thought to be relatively important at near-term (0-30 year) prediction horizons, especially at higher resolutions. The recent availability of significant numbers of multi-model (MME) and multi-initial condition (MICE) ensembles allows for the first time a direct sensitivity analysis of CIV versus model response variability (MRV). Understanding the relative agreement and variability of MME and MICE ensembles for multiple regions, resolutions, and projection horizons is critical for focusing model improvements, diagnostics, and prognosis, as well as impacts, adaptation, and vulnerability studies. Here we find that CIV (MICE agreement) is lower (higher) than MRV (MME agreement) across all spatial resolutions and projection time horizons for both temperature and precipitation. However, CIV dominates MRV over higher latitudes generally and in specific regions. Furthermore, CIV is considerably larger than MRV for precipitation compared to temperature across all horizontal and projection scales and seasons. Precipitation exhibits larger uncertainties, sharper decay of MICE agreement compared to MME, and relatively greater dominance of CIV over MRV at higher latitudes. The findings are crucial for climate predictability and adaptation strategies at stakeholder-relevant scales.

preprint2015arXiv

Network science based quantification of resilience demonstrated on the Indian Railways Network

The structure, interdependence, and fragility of systems ranging from power grids and transportation to ecology, climate, biology and even human communities and the Internet, have been examined through network science. While the response to perturbations has been quantified, recovery strategies for perturbed networks have usually been either discussed conceptually or through anecdotal case studies. Here we develop a network science-based quantitative methods framework for measuring, comparing and interpreting hazard responses and as well as recovery strategies. The framework, motivated by the recently proposed temporal resilience paradigm, is demonstrated with the Indian Railways Network. The methods are demonstrated through the resilience of the network to natural or human-induced hazards and electric grid failure. Simulations inspired by the 2004 Indian Ocean Tsunami and the 2012 North Indian blackout as well as a cyber-physical attack scenario. Multiple metrics are used to generate various recovery strategies, which are simply sequences in which system components should be recovered after a disruption. Quantitative evaluation of recovery strategies suggests that faster and more resource-effective recovery is possible through network centrality measures. Case studies based on two historical events, specifically the 2004 Indian Ocean tsunami and the 2012 North Indian blackout, and a simulated cyber-physical attack scenario, provides means for interpreting the relative performance of various recovery strategies. Quantitative evaluation of recovery strategies suggests that faster and more resource-effective restoration is possible through network centrality measures, even though the specific strategy may be different for sub-networks or for the partial recovery.

preprint2015arXiv

Space-time Trends in U.S. Meteorological Droughts

Understanding droughts in a climate context remains a major challenge. Over the United States, different choices of observations and metrics have often produced diametrically opposite insights. This paper focuses on understanding and characterizing meteorological droughts from station measurements of precipitation. The Standardized Precipitation Index is computed and analyzed to obtain drought severity, duration and frequency. Average drought severity trends are found to be uncertain and data-dependent. Furthermore, the mean and spatial variance do not show any discernible non-stationary behavior. However, the spatial coverage of extreme meteorological droughts in the United States exhibits an increasing trend over nearly all of the last century. Furthermore, the coverage over the last half decade exceeds that of the dust bowl era. Previous literature suggests that climate extremes do not necessarily follow the trends or uncertainties exhibited by the averages. While this possibility has been suggested for droughts, this paper for the first time clearly delineates and differentiates the trends in the mean, variability and extremes of meteorological droughts in the United States, and uncovers the trends in the spatial coverage of extremes. Multiple data sets, as well as years exhibiting large, and possibly anomalous, droughts are carefully examined to characterize trends and uncertainties. Nonlinear dependence among meteorological drought attributes necessitates the use of copula-based tools from probability theory. Severity-duration-frequency curves are generated to demonstrate how these insights may be translated to design and policy.

preprint2015arXiv

Water Stress on U.S. Power Production at Decadal Time Horizons

Thermoelectric power production at risk, owing to current and projected water scarcity and rising stream temperatures, is assessed for the contiguous United States at decadal scales. Regional water scarcity is driven by climate variability and change, as well as by multi-sector water demand. While a planning horizon of zero to about thirty years is occasionally prescribed by stakeholders, the challenges to risk assessment at these scales include the difficulty in delineating decadal climate trends from intrinsic natural or multiple model variability. Current generation global climate or earth system models are not credible at the spatial resolutions of power plants, especially for surface water quantity and stream temperatures, which further exacerbates the assessment challenge. Population changes, which are difficult to project, cannot serve as adequate proxies for changes in the water demand across sectors. The hypothesis that robust assessments of power production at risk are possible, despite the uncertainties, has been examined as a proof of concept. An approach is presented for delineating water scarcity and temperature from climate models, observations and population storylines, as well as for assessing power production at risk by examining geospatial correlations of power plant locations within regions where the usable water supply for energy production happens to be scarcer and warmer. Our analyses showed that in the near term, more than 200 counties are likely to be exposed to water scarcity in the next three decades. Further, we noticed that stream gauges in more than five counties in the 2030s and ten counties in the 2040s showed a significant increase in water temperature, which exceeded the power plant effluent temperature threshold set by the EPA. Power plants in South Carolina, Louisiana, and Texas are likely to be vulnerable owing to climate-driven water stresses.