Researcher profile

Baskar Ganapathysubramanian

Baskar Ganapathysubramanian contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2026arXiv

ADKO: Agentic Decentralized Knowledge Optimization

We present Agentic Decentralized Knowledge Optimization (ADKO), a framework for collaborative black-box optimization across autonomous agents that achieves sample efficiency, privacy preservation, heterogeneous-objective handling, and communication efficiency. Each agent maintains a private Gaussian Process (GP) surrogate trained on local data and communicates only through knowledge tokens-compact, lossy summaries containing directional signals, advantage scores, and optional language-model (LM) insights-without sharing raw data or model parameters. ADKO unifies GP-Upper Confidence Bound (GP-UCB), parallel Bayesian optimization, decentralized learning, and LM-guided discovery. We provide the first formal analysis of dual information loss: token compression, quantified via mutual-information-based fidelity, and LM approximation error, decomposed into bias and stochastic noise. Our main result shows cumulative regret decomposes into GP error, LM bias, LM noise, and compression loss, with necessary and sufficient conditions for sublinear regret. We also propose fidelity-aware token pruning to preserve high-information tokens under memory budget. Experiments on neural architecture search and scientific discovery validate the theory and show consistent improvements over strong baselines.

preprint2023arXiv

Direct numerical simulation of electrokinetic transport phenomena in fluids: variational multi-scale stabilization and octree-based mesh refinement

Computational modeling of charged species transport has enabled the analysis, design, and optimization of a diverse array of electrochemical and electrokinetic devices. These systems are represented by the Poisson-Nernst-Planck (PNP) equations coupled with the Navier-Stokes (NS) equation. Direct numerical simulation (DNS) to accurately capture the spatio-temporal variation of ion concentration and current flux remains challenging due to the (a) small critical dimension of the diffuse charge layer (DCL), (b) stiff coupling due to fast charge relaxation times, large advective effects, and steep gradients close to boundaries, and (c) complex geometries exhibited by electrochemical devices. In the current study, we address these challenges by presenting a direct numerical simulation framework that incorporates (a) a variational multiscale (VMS) treatment, (b) a block-iterative strategy in conjunction with semi-implicit (for NS) and implicit (for PNP) time integrators, and (c) octree based adaptive mesh refinement. The VMS formulation provides numerical stabilization critical for capturing the electro-convective flows often observed in engineered devices. The block-iterative strategy decouples the difficulty of non-linear coupling between the NS and PNP equations and allows the use of tailored numerical schemes separately for NS and PNP equations. The carefully designed second-order, hybrid implicit methods circumvent the harsh timestep requirements of explicit time steppers, thus enabling simulations over longer time horizons. Finally, the octree-based meshing allows efficient and targeted spatial resolution of the DCL. These features are incorporated into a massively parallel computational framework, enabling the simulation of realistic engineering electrochemical devices. The numerical framework is illustrated using several challenging canonical examples.

preprint2022arXiv

A fully-coupled framework for solving Cahn-Hilliard Navier-Stokes equations: Second-order, energy-stable numerical methods on adaptive octree based meshes

We present a fully-coupled, implicit-in-time framework for solving a thermodynamically-consistent Cahn-Hilliard Navier-Stokes system that models two-phase flows. In this work, we extend the block iterative method presented in Khanwale et al. [\textit{Simulating two-phase flows with thermodynamically consistent energy stable Cahn-Hilliard Navier-Stokes equations on parallel adaptive octree based meshes}, J. Comput. Phys. (2020)], to a fully-coupled, provably second-order accurate scheme in time, while maintaining energy-stability. The new method requires fewer matrix assemblies in each Newton iteration resulting in faster solution time. The method is based on a fully-implicit Crank-Nicolson scheme in time and a pressure stabilization for an equal order Galerkin formulation. That is, we use a conforming continuous Galerkin (cG) finite element method in space equipped with a residual-based variational multiscale (RBVMS) procedure to stabilize the pressure. We deploy this approach on a massively parallel numerical implementation using parallel octree-based adaptive meshes. We present comprehensive numerical experiments showing detailed comparisons with results from the literature for canonical cases, including the single bubble rise, Rayleigh-Taylor instability, and lid-driven cavity flow problems. We analyze in detail the scaling of our numerical implementation.

preprint2022arXiv

A projection-based, semi-implicit time-stepping approach for the Cahn-Hilliard Navier-Stokes equations on adaptive octree meshes

The Cahn-Hilliard Navier-Stokes (CHNS) system provides a computationally tractable model that can be used to effectively capture interfacial dynamics in two-phase fluid flows. In this work, we present a semi-implicit, projection-based finite element framework for solving the CHNS system. We use a projection-based semi-implicit time discretization for the Navier-Stokes equation and a fully-implicit time discretization for the Cahn-Hilliard equation. We use a conforming continuous Galerkin (cG) finite element method in space equipped with a residual-based variational multiscale (RBVMS) formulation. Pressure is decoupled using a projection step, which results in two linear positive semi-definite systems for velocity and pressure, instead of the saddle point system of a pressure-stabilized method. All the linear systems are solved using an efficient and scalable algebraic multigrid (AMG) method. We deploy this approach on a massively parallel numerical implementation using parallel octree-based adaptive meshes. The overall approach allows the use of relatively large time steps with much faster time-to-solve than similar fully-implicit methods. We present comprehensive numerical experiments showing detailed comparisons with results from the literature for canonical cases, including the single bubble rise and Rayleigh-Taylor instability.

preprint2022arXiv

Stochastic Conservative Contextual Linear Bandits

Many physical systems have underlying safety considerations that require that the strategy deployed ensures the satisfaction of a set of constraints. Further, often we have only partial information on the state of the system. We study the problem of safe real-time decision making under uncertainty. In this paper, we formulate a conservative stochastic contextual bandit formulation for real-time decision making when an adversary chooses a distribution on the set of possible contexts and the learner is subject to certain safety/performance constraints. The learner observes only the context distribution and the exact context is unknown, and the goal is to develop an algorithm that selects a sequence of optimal actions to maximize the cumulative reward without violating the safety constraints at any time step. By leveraging the UCB algorithm for this setting, we propose a conservative linear UCB algorithm for stochastic bandits with context distribution. We prove an upper bound on the regret of the algorithm and show that it can be decomposed into three terms: (i) an upper bound for the regret of the standard linear UCB algorithm, (ii) a constant term (independent of time horizon) that accounts for the loss of being conservative in order to satisfy the safety constraint, and (ii) a constant term (independent of time horizon) that accounts for the loss for the contexts being unknown and only the distribution being known. To validate the performance of our approach we perform extensive simulations on synthetic data and on real-world maize data collected through the Genomes to Fields (G2F) initiative.

preprint2021arXiv

Case study of SARS-CoV-2 transmission risk assessment in indoor environments using cloud computing resources

Complex flow simulations are conventionally performed on HPC clusters. However, the limited availability of HPC resources and steep learning curve of executing on traditional supercomputer infrastructure has drawn attention towards deploying flow simulation software on the cloud. We showcase how a complex computational framework -- that can evaluate COVID-19 transmission risk in various indoor classroom scenarios -- can be abstracted and deployed on cloud services. The availability of such cloud-based personalized planning tools can enable educational institutions, medical institutions, public sector workers (courthouses, police stations, airports, etc.), and other entities to comprehensively evaluate various in-person interaction scenarios for transmission risk. We deploy the simulation framework on the Azure cloud framework, utilizing the Dendro-kT mesh generation tool and PETSc solvers. The cloud abstraction is provided by RocketML cloud infrastructure. We compare the performance of the cloud machines with state-of-the-art HPC machine TACC Frontera. Our results suggest that cloud-based HPC resources are a viable strategy for a diverse array of end-users to rapidly and efficiently deploy simulation software.

preprint2021arXiv

Modeling electrochemical systems with weakly imposed Dirichlet boundary conditions

Finite element modeling of charged species transport has enabled analysis, design, and optimization of a diverse array of electrochemical and electrokinetic devices. These systems are represented by the Poisson-Nernst-Planck equations coupled with the Navier-Stokes equation, with a key quantity of interest being the current at the system boundaries. Accurately computing the current flux is challenging due to the small critical dimension of the boundary layers (small Debye layer) that require fine mesh resolution at the boundaries. We resolve this challenge by using the Dirichlet-to-Neumanntransformation to weakly impose the Dirichlet conditions for the Poisson-Nernst-Planck equations. The results obtained with weakly imposed Dirichlet boundary conditions showed excellent agreement with those obtained when conventional boundary conditions with highly resolved mesh we reemployed. Furthermore, the calculated current flux showed faster mesh convergence using weakly imposed conditions compared to the conventionally imposed Dirichlet boundary conditions. We illustrate the approach on canonical 3D problems that otherwise would have been computationally intractable to solve accurately. This approach substantially reduces the computational cost of model-ing electrochemical systems.

preprint2020arXiv

Deep Generative Models that Solve PDEs: Distributed Computing for Training Large Data-Free Models

Recent progress in scientific machine learning (SciML) has opened up the possibility of training novel neural network architectures that solve complex partial differential equations (PDEs). Several (nearly data free) approaches have been recently reported that successfully solve PDEs, with examples including deep feed forward networks, generative networks, and deep encoder-decoder networks. However, practical adoption of these approaches is limited by the difficulty in training these models, especially to make predictions at large output resolutions ($\geq 1024 \times 1024$). Here we report on a software framework for data parallel distributed deep learning that resolves the twin challenges of training these large SciML models - training in reasonable time as well as distributing the storage requirements. Our framework provides several out of the box functionality including (a) loss integrity independent of number of processes, (b) synchronized batch normalization, and (c) distributed higher-order optimization methods. We show excellent scalability of this framework on both cloud as well as HPC clusters, and report on the interplay between bandwidth, network topology and bare metal vs cloud. We deploy this approach to train generative models of sizes hitherto not possible, showing that neural PDE solvers can be viably trained for practical applications. We also demonstrate that distributed higher-order optimization methods are $2-3\times$ faster than stochastic gradient-based methods and provide minimal convergence drift with higher batch-size.

preprint2020arXiv

How useful is Active Learning for Image-based Plant Phenotyping?

Deep learning models have been successfully deployed for a diverse array of image-based plant phenotyping applications including disease detection and classification. However, successful deployment of supervised deep learning models requires large amount of labeled data, which is a significant challenge in plant science (and most biological) domains due to the inherent complexity. Specifically, data annotation is costly, laborious, time consuming and needs domain expertise for phenotyping tasks, especially for diseases. To overcome this challenge, active learning algorithms have been proposed that reduce the amount of labeling needed by deep learning models to achieve good predictive performance. Active learning methods adaptively select samples to annotate using an acquisition function to achieve maximum (classification) performance under a fixed labeling budget. We report the performance of four different active learning methods, (1) Deep Bayesian Active Learning (DBAL), (2) Entropy, (3) Least Confidence, and (4) Coreset, with conventional random sampling-based annotation for two different image-based classification datasets. The first image dataset consists of soybean [Glycine max L. (Merr.)] leaves belonging to eight different soybean stresses and a healthy class, and the second consists of nine different weed species from the field. For a fixed labeling budget, we observed that the classification performance of deep learning models with active learning-based acquisition strategies is better than random sampling-based acquisition for both datasets. The integration of active learning strategies for data annotation can help mitigate labelling challenges in the plant sciences applications particularly where deep domain knowledge is required.

preprint2020arXiv

Simulating two-phase flows with thermodynamically consistent energy stable Cahn-Hilliard Navier-Stokes equations on parallel adaptive octree based meshes

We report on simulations of two-phase flows with deforming interfaces at various density contrasts by solving thermodynamically consistent Cahn-Hilliard Navier-Stokes equations. An (essentially) unconditionally energy-stable Crank-Nicolson-type time integration scheme is used. Detailed proofs of energy stability of the semi-discrete scheme and for the existence of solutions of the advective-diffusive Cahn-Hilliard operator are provided. In space we discretize with a conforming continuous Galerkin finite element method in conjunction with a residual-based variational multi-scale (VMS) approach in order to provide pressure stabilization. We deploy this approach on a massively parallel numerical implementation using fast octree-based adaptive meshes. A detailed scaling analysis of the solver is presented. Numerical experiments showing convergence and validation with experimental results from the literature are presented for a large range of density ratios.

preprint2020arXiv

Usefulness of interpretability methods to explain deep learning based plant stress phenotyping

Deep learning techniques have been successfully deployed for automating plant stress identification and quantification. In recent years, there is a growing push towards training models that are interpretable -i.e. that justify their classification decisions by visually highlighting image features that were crucial for classification decisions. The expectation is that trained network models utilize image features that mimic visual cues used by plant pathologists. In this work, we compare some of the most popular interpretability methods: Saliency Maps, SmoothGrad, Guided Backpropogation, Deep Taylor Decomposition, Integrated Gradients, Layer-wise Relevance Propagation and Gradient times Input, for interpreting the deep learning model. We train a DenseNet-121 network for the classification of eight different soybean stresses (biotic and abiotic). Using a dataset consisting of 16,573 RGB images of healthy and stressed soybean leaflets captured under controlled conditions, we obtained an overall classification accuracy of 95.05 \%. For a diverse subset of the test data, we compared the important features with those identified by a human expert. We observed that most interpretability methods identify the infected regions of the leaf as important features for some -- but not all -- of the correctly classified images. For some images, the output of the interpretability methods indicated that spurious feature correlations may have been used to correctly classify them. Although the output explanation maps of these interpretability methods may be different from each other for a given image, we advocate the use of these interpretability methods as `hypothesis generation' mechanisms that can drive scientific insight.

preprint2019arXiv

Shape-design for stabilizing micro-particles in inertial microfluidic flows

Design of microparticles which stabilize at the centerline of a channel flow when part of a dilute suspension is examined numerically for moderate Reynolds numbers ($10 \le Re \le 80$). Stability metrics for particles with arbitrary shapes are formulated based on linear-stability theory. Particle shape is parametrized by a compact, Non-Uniform Rational B-Spline (NURBS)-based representation. Shape-design is posed as an optimization problem and solved using adaptive Bayesian optimization. We focus on designing particles for maximal stability at the channel-centerline robust to perturbations. Our results indicate that centerline-focusing particles are families of characteristic "fish"/"bottle"/"dumbbell"-like shapes, exhibiting fore-aft asymmetry. A parametric exploration is then performed to identify stable particle-designs at different k's (particle chord-to-channel width ratio) and Re's ($0.1 \le k \le 0.4, 10 \le Re \le 80$). Particles at high-k's and Re's are highly stabilized when compared to those at low-k's and Re's. A comparison of the modified dumbbell designs from the current framework also shows better performance to perturbations in Fluid-Structure Interaction (FSI) when compared to the rod-disk model reported previously (Uspal & Doyle 2014) for low-Re Hele-Shaw flow. We identify basins of attraction around the centerline, which span larger release-angle-ranges and lateral locations (tending to the channel width) for narrower channels, which effectively standardizes the notion of global focusing in such configurations using the current stability-paradigm. The present framework is illustrated for 2D cases and is potentially generalizable to stability in 3D flow-fields. The current formulation is also agnostic to Re and particle/channel geometry which indicates substantial potential for integration with imaging flow-cytometry tools and microfluidic biosensing-assays.

preprint2018arXiv

Interpretable deep learning for guided structure-property explorations in photovoltaics

The performance of an organic photovoltaic device is intricately connected to its active layer morphology. This connection between the active layer and device performance is very expensive to evaluate, either experimentally or computationally. Hence, designing morphologies to achieve higher performances is non-trivial and often intractable. To solve this, we first introduce a deep convolutional neural network (CNN) architecture that can serve as a fast and robust surrogate for the complex structure-property map. Several tests were performed to gain trust in this trained model. Then, we utilize this fast framework to perform robust microstructural design to enhance device performance.