Researcher profile

Ayan Mukhopadhyay

Ayan Mukhopadhyay contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

Grid-Aware Charging and Operational Optimization for Mixed-Fleet Public Transit

The rapid growth of urban populations and the increasing need for sustainable transportation solutions have prompted a shift towards electric buses in public transit systems. However, the effective management of mixed fleets consisting of both electric and diesel buses poses significant operational challenges. One major challenge is coping with dynamic electricity pricing, where charging costs vary throughout the day. Transit agencies must optimize charging assignments in response to such dynamism while accounting for secondary considerations such as seating constraints. This paper presents a comprehensive mixed-integer linear programming (MILP) model to address these challenges by jointly optimizing charging schedules and trip assignments for mixed (electric and diesel bus) fleets while considering factors such as dynamic electricity pricing, vehicle capacity, and route constraints. We address the potential computational intractability of the MILP formulation, which can arise even with relatively small fleets, by employing a hierarchical approach tailored to the fleet composition. By using real-world data from the city of Chattanooga, Tennessee, USA, we show that our approach can result in significant savings in the operating costs of the mixed transit fleets.

preprint2026arXiv

Hybrid thermalization in the large $N$ limit

Semi-holography provides a formulation of dynamics in gauge theories involving both weakly self-interacting (perturbative) and strongly self-interacting (non-perturbative) degrees of freedom. These two subsectors interact via their effective metrics and sources, while the full local energy-momentum tensor is conserved in the physical background metric. In the large $N$ limit, the subsectors have their individual entropy currents, and so the full system can reach a pseudo-equilibrium state in which each subsector has a different physical temperature. We first complete the proof that the global thermal equilibrium state, where both subsectors have the \textit{same} physical temperature, can be defined in consistency with the principles of thermodynamics and statistical mechanics. Particularly, we show that the global equilibrium state is the unique state with maximum entropy in the microcanonical ensemble. Furthermore, we show that in the large $N$ limit, a \textit{typical} non-equilibrium state of the full isolated system relaxes to the global equilibrium state when the average energy density is large compared to the scale set by the inter-system coupling. We discuss quantum statistical perspectives.

preprint2026arXiv

Online Decision-Making Under Uncertainty for Vehicle-to-Building Systems

Vehicle-to-building (V2B) systems integrate physical infrastructures, such as smart buildings and electric vehicles (EVs) connected to chargers at the building, with digital control mechanisms to manage energy use. By utilizing EVs as flexible energy reservoirs, buildings can dynamically charge and discharge them to optimize energy use and cut costs under time-variable pricing and demand charge policies. This setup leads to the V2B optimization problem, where buildings coordinate EV charging and discharging to minimize total electricity costs while meeting users' charging requirements. However, the V2B optimization problem is challenging because of: (1) fluctuating electricity pricing, which includes both energy charges ($/kWh) and demand charges ($/kW); (2) long planning horizons (typically over 30 days); (3) heterogeneous chargers with varying charging rates, controllability, and directionality (i.e., unidirectional or bidirectional); and (4) user-specific battery levels at departure to ensure user requirements are met. In contrast to existing approaches that often model this setting as a single-shot combinatorial optimization problem, we highlight critical limitations in prior work and instead model the V2B optimization problem as a Markov decision process (MDP), i.e., a stochastic control process. Solving the resulting MDP is challenging due to the large state and action spaces. To address the challenges of the large state space, we leverage online search, and we counter the action space by using domain-specific heuristics to prune unpromising actions. We validate our approach in collaboration with Nissan Advanced Technology Center - Silicon Valley. Using data from their EV testbed, we show that the proposed framework significantly outperforms state-of-the-art methods.

preprint2026arXiv

Toward Template-Free Explainability for Monte Carlo Tree Search

Probabilistic search algorithms, such as Monte Carlo Tree Search (MCTS), have proven very effective in solving sequential decision-making tasks under uncertainty. However, interpreting asymmetric search trees that incorporate bandit-based tree traversal and simulation-based value estimation is difficult for end users based solely on raw tree statistics. While prior work requires hand-crafted formal logic constraints that must be updated when the problem changes, we present a framework that enables large language models (LLMs) to generate evidence-grounded explanations of MCTS decisions from recorded search traces in an end-to-end manner. Our framework maps natural-language questions to a structured set of intent categories, determines whether the existing tree contains sufficient evidence, triggers targeted expansion when needed, and generates explanations using tree statistics such as visit counts, value estimates, and risk information. Experimental results provide the first evidence that LLMs can serve as end-to-end explainers for probabilistic search, without requiring intermediate formal representations.

preprint2022arXiv

ADVISER: AI-Driven Vaccination Intervention Optimiser for Increasing Vaccine Uptake in Nigeria

More than 5 million children under five years die from largely preventable or treatable medical conditions every year, with an overwhelmingly large proportion of deaths occurring in under-developed countries with low vaccination uptake. One of the United Nations' sustainable development goals (SDG 3) aims to end preventable deaths of newborns and children under five years of age. We focus on Nigeria, where the rate of infant mortality is appalling. We collaborate with HelpMum, a large non-profit organization in Nigeria to design and optimize the allocation of heterogeneous health interventions under uncertainty to increase vaccination uptake, the first such collaboration in Nigeria. Our framework, ADVISER: AI-Driven Vaccination Intervention Optimiser, is based on an integer linear program that seeks to maximize the cumulative probability of successful vaccination. Our optimization formulation is intractable in practice. We present a heuristic approach that enables us to solve the problem for real-world use-cases. We also present theoretical bounds for the heuristic method. Finally, we show that the proposed approach outperforms baseline methods in terms of vaccination uptake through experimental evaluation. HelpMum is currently planning a pilot program based on our approach to be deployed in the largest city of Nigeria, which would be the first deployment of an AI-driven vaccination uptake program in the country and hopefully, pave the way for other data-driven programs to improve health outcomes in Nigeria.

preprint2022arXiv

An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services

Many transit agencies operating paratransit and microtransit services have to respond to trip requests that arrive in real-time, which entails solving hard combinatorial and sequential decision-making problems under uncertainty. To avoid decisions that lead to significant inefficiency in the long term, vehicles should be allocated to requests by optimizing a non-myopic utility function or by batching requests together and optimizing a myopic utility function. While the former approach is typically offline, the latter can be performed online. We point out two major issues with such approaches when applied to paratransit services in practice. First, it is difficult to batch paratransit requests together as they are temporally sparse. Second, the environment in which transit agencies operate changes dynamically (e.g., traffic conditions), causing estimates that are learned offline to become stale. To address these challenges, we propose a fully online approach to solve the dynamic vehicle routing problem (DVRP) with time windows and stochastic trip requests that is robust to changing environmental dynamics by construction. We focus on scenarios where requests are relatively sparse - our problem is motivated by applications to paratransit services. We formulate DVRP as a Markov decision process and use Monte Carlo tree search to evaluate actions for any given state. Accounting for stochastic requests while optimizing a non-myopic utility function is computationally challenging; indeed, the action space for such a problem is intractably large in practice. To tackle the large action space, we leverage the structure of the problem to design heuristics that can sample promising actions for the tree search. Our experiments using real-world data from our partner agency show that the proposed approach outperforms existing state-of-the-art approaches both in terms of performance and robustness.

preprint2022arXiv

Decision Making in Non-Stationary Environments with Policy-Augmented Monte Carlo Tree Search

Decision-making under uncertainty (DMU) is present in many important problems. An open challenge is DMU in non-stationary environments, where the dynamics of the environment can change over time. Reinforcement Learning (RL), a popular approach for DMU problems, learns a policy by interacting with a model of the environment offline. Unfortunately, if the environment changes the policy can become stale and take sub-optimal actions, and relearning the policy for the updated environment takes time and computational effort. An alternative is online planning approaches such as Monte Carlo Tree Search (MCTS), which perform their computation at decision time. Given the current environment, MCTS plans using high-fidelity models to determine promising action trajectories. These models can be updated as soon as environmental changes are detected to immediately incorporate them into decision making. However, MCTS's convergence can be slow for domains with large state-action spaces. In this paper, we present a novel hybrid decision-making approach that combines the strengths of RL and planning while mitigating their weaknesses. Our approach, called Policy Augmented MCTS (PA-MCTS), integrates a policy's actin-value estimates into MCTS, using the estimates to seed the action trajectories favored by the search. We hypothesize that PA-MCTS will converge more quickly than standard MCTS while making better decisions than the policy can make on its own when faced with nonstationary environments. We test our hypothesis by comparing PA-MCTS with pure MCTS and an RL agent applied to the classical CartPole environment. We find that PC-MCTS can achieve higher cumulative rewards than the policy in isolation under several environmental shifts while converging in significantly fewer iterations than pure MCTS.

preprint2022arXiv

Designing Decision Support Systems for Emergency Response: Challenges and Opportunities

Designing effective emergency response management (ERM) systems to respond to incidents such as road accidents is a major problem faced by communities. In addition to responding to frequent incidents each day (about 240 million emergency medical services calls and over 5 million road accidents in the US each year), these systems also support response during natural hazards. Recently, there has been a consistent interest in building decision support and optimization tools that can help emergency responders provide more efficient and effective response. This includes a number of principled subsystems that implement early incident detection, incident likelihood forecasting and strategic resource allocation and dispatch policies. In this paper, we highlight the key challenges and provide an overview of the approach developed by our team in collaboration with our community partners.

preprint2022arXiv

Generative Anomaly Detection for Time Series Datasets

Traffic congestion anomaly detection is of paramount importance in intelligent traffic systems. The goals of transportation agencies are two-fold: to monitor the general traffic conditions in the area of interest and to locate road segments under abnormal congestion states. Modeling congestion patterns can achieve these goals for citywide roadways, which amounts to learning the distribution of multivariate time series (MTS). However, existing works are either not scalable or unable to capture the spatial-temporal information in MTS simultaneously. To this end, we propose a principled and comprehensive framework consisting of a data-driven generative approach that can perform tractable density estimation for detecting traffic anomalies. Our approach first clusters segments in the feature space and then uses conditional normalizing flow to identify anomalous temporal snapshots at the cluster level in an unsupervised setting. Then, we identify anomalies at the segment level by using a kernel density estimator on the anomalous cluster. Extensive experiments on synthetic datasets show that our approach significantly outperforms several state-of-the-art congestion anomaly detection and diagnosis methods in terms of Recall and F1-Score. We also use the generative model to sample labeled data, which can train classifiers in a supervised setting, alleviating the lack of labeled data for anomaly detection in sparse settings.

preprint2022arXiv

Holographic spacetime, black holes and quantum error correcting codes: A review

This article reviews the progress in our understanding of the reconstruction of the bulk spacetime in the holographic correspondence from the dual field theory including an account of how these developments have led to the reproduction of the Page curve of the Hawking radiation from black holes. We review quantum error correction and relevant recovery maps with toy examples based on tensor networks, and discuss how it provides the desired framework for bulk reconstruction in which apparent inconsistencies with properties of the operator algebra in the dual field theory are naturally resolved. The importance of understanding the modular flow in the dual field theory has been emphasized. We discuss how the state-dependence of reconstruction of black hole microstates can be formulated in the framework of quantum error correction with inputs from extremal surfaces along with a quantification of the complexity of encoding of bulk operators. Finally, we motivate and discuss a class of tractable microstate models of black holes which can illuminate how the black hole complementarity principle can emerge operationally without encountering information paradoxes, and provide new insights into generation of desirable features of encoding into the Hawking radiation.

preprint2022arXiv

Offline Vehicle Routing Problem with Online Bookings: A Novel Problem Formulation with Applications to Paratransit

Vehicle routing problems (VRPs) can be divided into two major categories: offline VRPs, which consider a given set of trip requests to be served, and online VRPs, which consider requests as they arrive in real-time. Based on discussions with public transit agencies, we identify a real-world problem that is not addressed by existing formulations: booking trips with flexible pickup windows (e.g., 3 hours) in advance (e.g., the day before) and confirming tight pickup windows (e.g., 30 minutes) at the time of booking. Such a service model is often required in paratransit service settings, where passengers typically book trips for the next day over the phone. To address this gap between offline and online problems, we introduce a novel formulation, the offline vehicle routing problem with online bookings. This problem is very challenging computationally since it faces the complexity of considering large sets of requests -- similar to offline VRPs -- but must abide by strict constraints on running time -- similar to online VRPs. To solve this problem, we propose a novel computational approach, which combines an anytime algorithm with a learning-based policy for real-time decisions. Based on a paratransit dataset obtained from the public transit agency of Chattanooga, TN, we demonstrate that our novel formulation and computational approach lead to significantly better outcomes in this setting than existing algorithms.

preprint2022arXiv

Quantum thermodynamics of holographic quenches and bounds on the growth of entanglement from the QNEC

The quantum null energy condition (QNEC) is a lower bound on the energy-momentum tensor in terms of the variation of the entanglement entropy of a sub-region along a null direction. To gain insights into quantum thermodynamics of many-body systems, we study if the QNEC restricts irreversible entropy production in quenches driven by energy-momentum inflow from an infinite memoryless bath in two-dimensional holographic theories. We find that an increase in both entropy and temperature, as implied by the Clausius inequality of classical thermodynamics, are necessary but not sufficient to not violate QNEC in quenches leading to transitions between thermal states with momentum which are dual to Banados-Teitelboim-Zanelli geometries. For an arbitrary initial state, we can determine the lower and upper bounds on the increase of entropy (temperature) for a fixed increase in temperature (entropy). Our results provide explicit instances of quantum lower and upper bounds on irreversible entropy production whose existence has been established in literature. We also find monotonic behavior of the non-saturation of the QNEC with time after a quench, and analytically determine their asymptotic values. Our study shows that the entanglement entropy of an interval of length $l$ always thermalizes in time $l/2$ with an exponent $3/2$. Furthermore, we determine the coefficient of initial quadratic growth of entanglement analytically for any $l$, and show that the slope of the asymptotic ballistic growth of entanglement for a semi-infinite interval is twice the difference of the entropy densities of the final and initial states. We determine explicit upper and lower bounds on these rates of growth of entanglement.

preprint2021arXiv

Hierarchical Planning for Resource Allocation in Emergency Response Systems

A classical problem in city-scale cyber-physical systems (CPS) is resource allocation under uncertainty. Typically, such problems are modeled as Markov (or semi-Markov) decision processes. While online, offline, and decentralized approaches have been applied to such problems, they have difficulty scaling to large decision problems. We present a general approach to hierarchical planning that leverages structure in city-level CPS problems for resource allocation under uncertainty. We use the emergency response as a case study and show how a large resource allocation problem can be split into smaller problems. We then create a principled framework for solving the smaller problems and tackling the interaction between them. Finally, we use real-world data from Nashville, Tennessee, a major metropolitan area in the United States, to validate our approach. Our experiments show that the proposed approach outperforms state-of-the-art approaches used in the field of emergency response.

preprint2020arXiv

Hydrodynamic attractor of a hybrid viscous fluid in Bjorken flow

The nonequilibrium evolution in a boost-invariant Bjorken flow of a hybrid viscous fluid model containing two interacting components with different viscosities, such that they represent strongly and weakly self-coupled sectors, is shown to be characterized by a hydrodynamic attractor which has an early-time behavior that is reminiscent of the so-called bottom-up thermalization scenario in heavy-ion collisions. The hydrodynamization times for the two sectors can differ strongly, with details depending on the curve realized on the two-dimensional attractor surface, which might account for different scenarios for small and large systems in nuclear collisions. The total system behaves like a single viscous fluid with a dynamically determined effective shear viscosity.

preprint2020arXiv

On Algorithmic Decision Procedures in Emergency Response Systems in Smart and Connected Communities

Emergency Response Management (ERM) is a critical problem faced by communities across the globe. Despite this, it is common for ERM systems to follow myopic decision policies in the real world. Principled approaches to aid ERM decision-making under uncertainty have been explored but have failed to be accepted into real systems. We identify a key issue impeding their adoption --- algorithmic approaches to emergency response focus on reactive, post-incident dispatching actions, i.e. optimally dispatching a responder \textit{after} incidents occur. However, the critical nature of emergency response dictates that when an incident occurs, first responders always dispatch the closest available responder to the incident. We argue that the crucial period of planning for ERM systems is not post-incident, but between incidents. This is not a trivial planning problem --- a major challenge with dynamically balancing the spatial distribution of responders is the complexity of the problem. An orthogonal problem in ERM systems is planning under limited communication, which is particularly important in disaster scenarios that affect communication networks. We address both problems by proposing two partially decentralized multi-agent planning algorithms that utilize heuristics and exploit the structure of the dispatch problem. We evaluate our proposed approach using real-world data, and find that in several contexts, dynamic re-balancing the spatial distribution of emergency responders reduces both the average response time as well as its variance.

preprint2019arXiv

Time-dependent $NAdS_2$ holography with applications

We develop a method for obtaining exact time-dependent solutions in Jackiw-Teitelboim gravity coupled to non-conformal matter and study consequences for $NAdS_2$ holography. We study holographic quenches in which we find that the black hole mass increases. A semi-holographic model composed of an infrared $NAdS_2$ holographic sector representing the mutual strong interactions of trapped impurities confined at a spatial point is proposed. The holographic sector couples to the position of a displaced impurity acting as a self-consistent boundary source. This effective $0+1-$dimensional description has a total conserved energy. Irrespective of the initial velocity of the particle, the black hole mass initially increases, but after the horizon runs away to infinity in the physical patch, the mass vanishes in the long run. The total energy is completely transferred to the kinetic energy or the self-consistent confining potential energy of the impurity. For initial velocities below a critical value determined by the mutual coupling, the black hole mass changes sign in finite time. Above this critical velocity, the initial condition of the particle can be retrieved from the $SL(2,R)$ invariant exponent that governs the exponential growth of the bulk gravitational $SL(2,R)$ charges at late time.