Source author record

Pramod P. Khargonekar

Pramod P. Khargonekar appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Systems and Control eess.SY Machine Learning Computer Science and Game Theory math.OC Multiagent Systems Applications Computer Vision cs.CY Information Theory math.DS math.IT Neural and Evolutionary Computing Neurons and Cognition

Catalog footprint

What is connected

15works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Long-term Fairness For Real-time Decision Making: A Constrained Online Optimization Approach

Machine learning (ML) has demonstrated remarkable capabilities across many real-world systems, from predictive modeling to intelligent automation. However, the widespread integration of machine learning also makes it necessary to ensure machine learning-driven decision-making systems do not violate ethical principles and values of society in which they operate. As ML-driven decisions proliferate, particularly in cases involving sensitive attributes such as gender, race, and age, to name a few, the need for equity and impartiality has emerged as a fundamental concern. In situations demanding real-time decision-making, fairness objectives become more nuanced and complex: instantaneous fairness to ensure equity in every time slot, and long-term fairness to ensure fairness over a period of time. There is a growing awareness that real-world systems that operate over long periods and require fairness over different timelines. However, existing approaches mainly address dynamic costs with time-invariant fairness constraints, often disregarding the challenges posed by time-varying fairness constraints. To bridge this gap, this work introduces a framework for ensuring long-term fairness within dynamic decision-making systems characterized by time-varying fairness constraints. We formulate the decision problem with fairness constraints over a period as a constrained online optimization problem. A novel online algorithm, named LoTFair, is presented that solves the problem 'on the fly'. We prove that LoTFair can make overall fairness violations negligible while maintaining the performance over the long run.

preprint2023arXiv

Competing Bandits in Time Varying Matching Markets

We study the problem of online learning in two-sided non-stationary matching markets, where the objective is to converge to a stable match. In particular, we consider the setting where one side of the market, the arms, has fixed known set of preferences over the other side, the players. While this problem has been studied when the players have fixed but unknown preferences, in this work we study the problem of how to learn when the preferences of the players are time varying and unknown. Our contribution is a methodology that can handle any type of preference structure and variation scenario. We show that, with the proposed algorithm, each player receives a uniform sub-linear regret of {$\widetilde{\mathcal{O}}(L^{1/2}_TT^{1/2})$} up to the number of changes in the underlying preferences of the agents, $L_T$. Therefore, we show that the optimal rates for single-agent learning can be achieved in spite of the competition up to a difference of a constant factor. We also discuss extensions of this algorithm to the case where the number of changes need not be known a priori.

preprint2022arXiv

Meta-Learning Online Control for Linear Dynamical Systems

In this paper, we consider the problem of finding a meta-learning online control algorithm that can learn across the tasks when faced with a sequence of $N$ (similar) control tasks. Each task involves controlling a linear dynamical system for a finite horizon of $T$ time steps. The cost function and system noise at each time step are adversarial and unknown to the controller before taking the control action. Meta-learning is a broad approach where the goal is to prescribe an online policy for any new unseen task exploiting the information from other tasks and the similarity between the tasks. We propose a meta-learning online control algorithm for the control setting and characterize its performance by \textit{meta-regret}, the average cumulative regret across the tasks. We show that when the number of tasks are sufficiently large, our proposed approach achieves a meta-regret that is smaller by a factor $D/D^{*}$ compared to an independent-learning online control algorithm which does not perform learning across the tasks, where $D$ is a problem constant and $D^{*}$ is a scalar that decreases with increase in the similarity between tasks. Thus, when the sequence of tasks are similar the regret of the proposed meta-learning online control is significantly lower than that of the naive approaches without meta-learning. We also present experiment results to demonstrate the superior performance achieved by our meta-learning algorithm.

preprint2022arXiv

Online Learning Robust Control of Nonlinear Dynamical Systems

In this work we address the problem of the online robust control of nonlinear dynamical systems perturbed by disturbance. We study the problem of attenuation of the total cost over a duration $T$ in response to the disturbances. We consider the setting where the cost function (at a particular time) is a general continuous function and adversarial, the disturbance is adversarial and bounded at any point of time. Our goal is to design a controller that can learn and adapt to achieve a certain level of attenuation. We analyse two cases (i) when the system is known and (ii) when the system is unknown. We measure the performance of the controller by the deviation of the controller's cost for a sequence of cost functions with respect to an attenuation $γ$, $R^p_t$. We propose an online controller and present guarantees for the metric $R^p_t$ when the maximum possible attenuation is given by $\overlineγ$, which is a system constant. We show that when the controller has preview of the cost functions and the disturbances for a short duration of time and the system is known $R^p_T(γ) = O(1)$ when $γ\geq γ_c$, where $γ_c = \mathcal{O}(\overlineγ)$. We then show that when the system is unknown the proposed controller with a preview of the cost functions and the disturbances for a short horizon achieves $R^p_T(γ) = \mathcal{O}(N) + \mathcal{O}(1) + \mathcal{O}((T-N)g(N))$, when $γ\geq γ_c$, where $g(N)$ is the accuracy of a given nonlinear estimator and $N$ is the duration of the initial estimation period. We also characterize the lower bound on the required prediction horizon for these guarantees to hold in terms of the system constants.

preprint2022arXiv

Optimal Storage and Solar Capacity of a Residential Household under Net Metering and Time-of-Use Pricing

Incentive programs and ongoing reduction in costs are driving joint installation of solar PV panels and storage systems in residential households. There is a need for optimal investment decisions to reduce the electricity consumption costs of the households further. In this paper, we first develop analytical expression of storage investment decision and then of solar investment decision for a household which is under net metering billing mechanism with time of use pricing condition. Using real data of a residential household in Austin, TX, USA, we study how the investment decisions would provide benefit for a period of one year. Results show significant profit when using storage devices and solar panels optimally for the system. It is important to note that though our approach can help significantly to take investment decisions, the solution will still be sub-optimal for somebody who needs optimal investment jointly on both storage and solar systems.

preprint2021arXiv

Neuroscience-Inspired Algorithms for the Predictive Maintenance of Manufacturing Systems

If machine failures can be detected preemptively, then maintenance and repairs can be performed more efficiently, reducing production costs. Many machine learning techniques for performing early failure detection using vibration data have been proposed; however, these methods are often power and data-hungry, susceptible to noise, and require large amounts of data preprocessing. Also, training is usually only performed once before inference, so they do not learn and adapt as the machine ages. Thus, we propose a method of performing online, real-time anomaly detection for predictive maintenance using Hierarchical Temporal Memory (HTM). Inspired by the human neocortex, HTMs learn and adapt continuously and are robust to noise. Using the Numenta Anomaly Benchmark, we empirically demonstrate that our approach outperforms state-of-the-art algorithms at preemptively detecting real-world cases of bearing failures and simulated 3D printer failures. Our approach achieves an average score of 64.71, surpassing state-of-the-art deep-learning (49.38) and statistical (61.06) methods.

preprint2020arXiv

Improved Attention Models for Memory Augmented Neural Network Adaptive Controllers

We introduced a {\it working memory} augmented adaptive controller in our recent work. The controller uses attention to read from and write to the working memory. Attention allows the controller to read specific information that is relevant and update its working memory with information based on its relevance. The retrieved information is used to modify the final control input computed by the controller. We showed that this modification speeds up learning. In the above work, we used a soft-attention mechanism for the adaptive controller. Controllers that use soft attention or hard attention mechanisms are limited either because they can forget the information or fail to shift attention when the information they are reading becomes less relevant. We propose an attention mechanism that comprises of (i) a hard attention mechanism and additionally (ii) an attention reallocation mechanism. The attention reallocation enables the controller to reallocate attention to a different location when the relevance of the location it is reading from diminishes. The reallocation also ensures that the information stored in the memory before the shift in attention is retained which can be lost in both soft and hard attention mechanisms. We illustrate through detailed simulations of various scenarios for two link robot and three link robot arm systems we illustrate the effectiveness of the proposed attention mechanism.

preprint2020arXiv

Incentive Design in a Distributed Problem with Strategic Agents

In this paper, we consider a general distributed system with multiple agents who select and then implement actions in the system. The system has an operator with a centralized objective. The agents, on the other hand, are selfinterested and strategic in the sense that each agent optimizes its own individual objective. The operator aims to mitigate this misalignment by designing an incentive scheme for the agents. The problem is difficult due to the cost functions of the agents being coupled, the objective of the operator not being social welfare, and the operator having no direct control over actions being implemented by the agents. This problem has been studied in many fields, particularly in mechanism design and cost allocation. However, mechanism design typically assumes that the operator has knowledge of the cost functions of the agents and the actions being implemented by the operator. On the other hand, cost allocation classically assumes that agents do not anticipate the effect of their actions on the incentive that they obtain. We remove these assumptions and present an incentive rule for this setup by bridging the gap between mechanism design and classical cost allocation. We analyze whether the proposed design satisfies various desirable properties such as social optimality, budget balance, participation constraint, and so on. We also analyze which of these properties can be satisfied if the assumptions of cost functions of the agents being private and the agents being anticipatory are relaxed.

preprint2020arXiv

Memory Augmented Neural Network Adaptive Controller for Strict Feedback Nonlinear Systems

In this work, we consider the adaptive nonlinear control problem for strict feedback nonlinear systems, where the functions that determine the dynamics of the system are completely unknown. We assume that certain upper bounds for the functions $g_i$s of the system are known. The objective of the control design is to design an adaptive controller that can adapt to changes in the unknown functions that are even abrupt. We propose a novel backstepping memory augmented NN (MANN) adaptive control method for the control of strict feedback non-linear systems. Here, each NN, in the backstepping NN adaptive controller, is augmented with an external working memory. The NN can write relevant information to its working memory and later retrieve them to modify its output, thus providing it with the capability to leverage past learned information effectively and improve its speed of learning. We propose a specific design for this external memory interface and show that the proposed control design achieves bounded stability for the closed loop system. We also provide substantial numerical evidence showing that the proposed memory augmentation improves the speed of learning by a significant margin.

preprint2020arXiv

Online Algorithms for Dynamic Matching Markets in Power Distribution Systems

This paper proposes online algorithms for dynamic matching markets in power distribution systems, which at any real-time operation instance decides about matching -- or delaying the supply of -- flexible loads with available renewable generation with the objective of maximizing the social welfare of the exchange in the system. More specifically, two online matching algorithms are proposed for the following generation-load scenarios: (i) when the mean of renewable generation is greater than the mean of the flexible load, and (ii) when the condition (i) is reversed. With the intuition that the performance of such algorithms degrades with increasing randomness of the supply and demand, two properties are proposed for assessing the performance of the algorithms. First property is convergence to optimality (CO) as the underlying randomness of renewable generation and customer loads goes to zero. The second property is deviation from optimality, is measured as a function of the standard deviation of the underlying randomness of renewable generation and customer loads. The algorithm proposed for the first scenario is shown to satisfy CO and a deviation from optimal that varies linearly with the variation in the standard deviation. But the same algorithm is shown to not satisfy CO for the second scenario. We then show that the algorithm proposed for the second scenario satisfies CO and a deviation from optimal that varies linearly with the variation in standard deviation plus an offset.

preprint2020arXiv

Scene-Graph Augmented Data-Driven Risk Assessment of Autonomous Vehicle Decisions

Despite impressive advancements in Autonomous Driving Systems (ADS), navigation in complex road conditions remains a challenging problem. There is considerable evidence that evaluating the subjective risk level of various decisions can improve ADS' safety in both normal and complex driving scenarios. However, existing deep learning-based methods often fail to model the relationships between traffic participants and can suffer when faced with complex real-world scenarios. Besides, these methods lack transferability and explainability. To address these limitations, we propose a novel data-driven approach that uses scene-graphs as intermediate representations. Our approach includes a Multi-Relation Graph Convolution Network, a Long-Short Term Memory Network, and attention layers for modeling the subjective risk of driving maneuvers. To train our model, we formulate this task as a supervised scene classification problem. We consider a typical use case to demonstrate our model's capabilities: lane changes. We show that our approach achieves a higher classification accuracy than the state-of-the-art approach on both large (96.4% vs. 91.2%) and small (91.8% vs. 71.2%) synthesized datasets, also illustrating that our approach can learn effectively even from smaller datasets. We also show that our model trained on a synthesized dataset achieves an average accuracy of 87.8% when tested on a real-world dataset compared to the 70.3% accuracy achieved by the state-of-the-art model trained on the same synthesized dataset, showing that our approach can more effectively transfer knowledge. Finally, we demonstrate that the use of spatial and temporal attention layers improves our model's performance by 2.7% and 0.7% respectively, and increases its explainability.

preprint2014arXiv

A Global Identifiability Condition for Consensus Networks with Tree Graphs

In this paper we present a sufficient condition that guarantees identifiability of linear network dynamic systems exhibiting continuous-time weighted consensus protocols with acyclic structure. Each edge of the underlying network graph $\mathcal G$ of the system is defined by a constant parameter, referred to as the weight of the edge, while each node is defined by a scalar state whose dynamics evolve as the weighted linear combination of its difference with the states of its neighboring nodes. Following the classical definitions of identifiability and indistinguishability, we first derive a condition that ensure the identifiability of the edge weights of $\mathcal G$ in terms of the associated transfer function. Using this characterization, we propose a sensor placement algorithm that guarantees identifiability of the edge weights. We describe our results using several illustrative examples.

preprint2013arXiv

Computational Modeling of Channelrhodopsin-2 Photocurrent Characteristics in Relation to Neural Signaling

Channelrhodopsins-2 (ChR2) are a class of light sensitive proteins that offer the ability to use light stimulation to regulate neural activity with millisecond precision. In order to address the limitations in the efficacy of the wild-type ChR2 (ChRwt) to achieve this objective, new variants of ChR2 that exhibit fast mono-exponential photocurrent decay characteristics have been recently developed and validated. In this paper, we investigate whether the framework of transition rate model with 4 states, primarily developed to mimic the bi-exponential photocurrent decay kinetics of ChRwt, as opposed to the low complexity 3 state model, is warranted to mimic the mono-exponential photocurrent decay kinetics of the newly developed fast ChR2 variants: ChETA (Gunaydin et al., Nature Neurosci, 13:387-392, 2010) and ChRET/TC (Berndt et al., PNAS, 108:7595-7600, 2011). We begin by estimating the parameters for the 3-state and 4-state models from experimental data on the photocurrent kinetics of ChRwt, ChETA and ChRET/TC. We then incorporate these models into a fast-spiking interneuron model (Wang and Buzsaki., J Neurosci, 16:6402-6413,1996) and a hippocampal pyramidal cell model (Golomb et al., J Neurophysiol, 96:1912-1926, 2006) and investigate the extent to which the experimentally observed neural response to various optostimulation protocols can be captured by these models. We demonstrate that for all ChR2 variants investigated, the 4 state model implementation is better able to capture neural response consistent with experiments across wide range of optostimulation protocol. We conclude by analytically investigating the conditions under which the characteristic specific to the 3-state model, namely the mono-exponential photocurrent decay of the newly developed variants of ChR2, can occurs in the framework of the 4-state model.

preprint2013arXiv

Fast SVM training using approximate extreme points

Applications of non-linear kernel Support Vector Machines (SVMs) to large datasets is seriously hampered by its excessive training time. We propose a modification, called the approximate extreme points support vector machine (AESVM), that is aimed at overcoming this burden. Our approach relies on conducting the SVM optimization over a carefully selected subset, called the representative set, of the training dataset. We present analytical results that indicate the similarity of AESVM and SVM solutions. A linear time algorithm based on convex hulls and extreme points is used to compute the representative set in kernel space. Extensive computational experiments on nine datasets compared AESVM to LIBSVM \citep{LIBSVM}, CVM \citep{Tsang05}, BVM \citep{Tsang07}, LASVM \citep{Bordes05}, $\text{SVM}^{\text{perf}}$ \citep{Joachims09}, and the random features method \citep{rahimi07}. Our AESVM implementation was found to train much faster than the other methods, while its classification accuracy was similar to that of LIBSVM in all cases. In particular, for a seizure detection dataset, AESVM training was almost $10^3$ times faster than LIBSVM and LASVM and more than forty times faster than CVM and BVM. Additionally, AESVM also gave competitively fast classification times.

preprint2013arXiv

Signal Reconstruction via H-infinity Sampled-Data Control Theory: Beyond the Shannon Paradigm

This paper presents a new method for signal reconstruction by leveraging sampled-data control theory. We formulate the signal reconstruction problem in terms of an analog performance optimization problem using a stable discrete-time filter. The proposed H-infinity performance criterion naturally takes intersample behavior into account, reflecting the energy distributions of the signal. We present methods for computing optimal solutions which are guaranteed to be stable and causal. Detailed comparisons to alternative methods are provided. We discuss some applications in sound and image reconstruction.

Pramod P. Khargonekar

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

Long-term Fairness For Real-time Decision Making: A Constrained Online Optimization Approach

Competing Bandits in Time Varying Matching Markets

Meta-Learning Online Control for Linear Dynamical Systems

Online Learning Robust Control of Nonlinear Dynamical Systems

Optimal Storage and Solar Capacity of a Residential Household under Net Metering and Time-of-Use Pricing

Neuroscience-Inspired Algorithms for the Predictive Maintenance of Manufacturing Systems

Improved Attention Models for Memory Augmented Neural Network Adaptive Controllers

Incentive Design in a Distributed Problem with Strategic Agents

Memory Augmented Neural Network Adaptive Controller for Strict Feedback Nonlinear Systems

Online Algorithms for Dynamic Matching Markets in Power Distribution Systems

Scene-Graph Augmented Data-Driven Risk Assessment of Autonomous Vehicle Decisions

A Global Identifiability Condition for Consensus Networks with Tree Graphs

Computational Modeling of Channelrhodopsin-2 Photocurrent Characteristics in Relation to Neural Signaling

Fast SVM training using approximate extreme points

Signal Reconstruction via H-infinity Sampled-Data Control Theory: Beyond the Shannon Paradigm