Source author record

Saverio Bolognani

Saverio Bolognani appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Systems and Control eess.SY Computer Science and Game Theory Artificial Intelligence cs.CY Machine Learning Multiagent Systems quant-ph Robotics

Catalog footprint

What is connected

19works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Towards Model-Free Learning in Dynamic Population Games: An Application to Karma Economies

Dynamic Population Games (DPGs) provide a tractable framework for modeling strategic interactions in large populations of self-interested agents, and have been successfully applied to the design of Karma economies, a class of fair non-monetary resource allocation mechanisms. Despite their appealing theoretical properties, existing computational tools for DPGs assume full knowledge of the game model and operate in a centralized fashion, limiting their applicability in realistic settings where agents have access only to their own private experience. This paper takes a step towards addressing this gap by studying model-free equilibrium learning in Karma DPGs. First, we analyze the setting in which a novel agent joins a Karma DPG already at its Stationary Nash Equilibrium (SNE) and learns a policy via Deep Q-Networks (DQN) without knowledge of the game model. Leveraging recent convergence results for DQN, we establish a suboptimality bound consisting of a DQN approximation error of order $O(1/\sqrt{N_s})$ and a mean field perturbation error of order $O(1/N)$, where $N_s$ is the replay buffer size and $N$ is the population size. Second, we consider the challenging problem of learning the SNE from scratch. We show empirically that combining deep RL with fictitious play and smoothed policy iteration allows agents to converge, in a model-free fashion, to a configuration close to the centrally computed SNE. Together, these contributions support the vision of Karma economies as practical tools for fair resource allocation.

preprint2024arXiv

Optimization Algorithms as Robust Feedback Controllers

Mathematical optimization is one of the cornerstones of modern engineering research and practice. Yet, throughout all application domains, mathematical optimization is, for the most part, considered to be a numerical discipline. Optimization problems are formulated to be solved numerically with specific algorithms running on microprocessors. An emerging alternative is to view optimization algorithms as dynamical systems. Besides being insightful in itself, this perspective liberates optimization methods from specific numerical and algorithmic aspects and opens up new possibilities to endow complex real-world systems with sophisticated self-optimizing behavior. Towards this goal, it is necessary to understand how numerical optimization algorithms can be converted into feedback controllers to enable robust "closed-loop optimization". In this article, we focus on recent control designs under the name of "feedback-based optimization" which implement optimization algorithms directly in closed loop with physical systems. In addition to a brief overview of selected continuous-time dynamical systems for optimization, our particular emphasis in this survey lies on closed-loop stability as well as the robust enforcement of physical and operational constraints in closed-loop implementations. To bypass accessing partial model information of physical systems, we further elaborate on fully data-driven and model-free operations. We highlight an emerging application in autonomous reserve dispatch in power systems, where the theory has transitioned to practice by now. We also provide short expository reviews of pioneering applications in communication networks and electricity grids, as well as related research streams, including extremum seeking and pertinent methods from model predictive and process control, to facilitate high-level comparisons with the main topic of this survey.

preprint2023arXiv

A Classification of Feedback Loops and Their Relation to Biases in Automated Decision-Making Systems

Prediction-based decision-making systems are becoming increasingly prevalent in various domains. Previous studies have demonstrated that such systems are vulnerable to runaway feedback loops, e.g., when police are repeatedly sent back to the same neighborhoods regardless of the actual rate of criminal activity, which exacerbate existing biases. In practice, the automated decisions have dynamic feedback effects on the system itself that can perpetuate over time, making it difficult for short-sighted design choices to control the system's evolution. While researchers started proposing longer-term solutions to prevent adverse outcomes (such as bias towards certain groups), these interventions largely depend on ad hoc modeling assumptions and a rigorous theoretical understanding of the feedback dynamics in ML-based decision-making systems is currently missing. In this paper, we use the language of dynamical systems theory, a branch of applied mathematics that deals with the analysis of the interconnection of systems with dynamic behaviors, to rigorously classify the different types of feedback loops in the ML-based decision-making pipeline. By reviewing existing scholarly work, we show that this classification covers many examples discussed in the algorithmic fairness community, thereby providing a unifying and principled framework to study feedback loops. By qualitative analysis, and through a simulation example of recommender systems, we show which specific types of ML biases are affected by each type of feedback loop. We find that the existence of feedback loops in the ML-based decision-making pipeline can perpetuate, reinforce, or even reduce ML biases.

preprint2022arXiv

Adaptive Real-Time Grid Operation via Online Feedback Optimization with Sensitivity Estimation

In this paper we propose an approach based on an Online Feedback Optimization (OFO) controller with grid input-output sensitivity estimation for real-time grid operation, e.g., at subsecond time scales. The OFO controller uses grid measurements as feedback to update the value of the controllable elements in the grid, and track the solution of a time-varying AC Optimal Power Flow (AC-OPF). Instead of relying on a full grid model, e.g., grid admittance matrix, OFO only requires the steady-state sensitivity relating a change in the controllable inputs, e.g., power injections set-points, to a change in the measured outputs, e.g., voltage magnitudes. Since an inaccurate sensitivity may lead to a model-mismatch and jeopardize the performance, we propose a recursive least-squares estimation that enables OFO to learn the sensitivity from measurements during real-time operation, turning OFO into a model-free approach. We analytically certify the convergence of the proposed OFO with sensitivity estimation, and validate its performance on a simulation using the IEEE 123-bus test feeder, and comparing it against a state-of-the-art OFO with constant sensitivity.

preprint2022arXiv

Cross-layer Design for Real-Time Grid Operation: Estimation, Optimization and Power Flow

In this paper, we propose a combined Online Feedback Optimization (OFO) and dynamic estimation approach for a real-time power grid operation under time-varying conditions. A dynamic estimation uses grid measurements to generate the information required by an OFO controller, that incrementally steers the controllable power injections set-points towards the solutions of a time-varying AC Optimal Power Flow (AC-OPF) problem. More concretely, we propose a quadratic programming-based OFO that guarantees satisfying the grid operational constraints, like admissible voltage limits. Within the estimation, we design an online power flow solver that efficiently computes power flow approximations in real time. Finally, we certify the stability and convergence of this combined approach under time-varying conditions, and we validate its effectiveness on a simulation with a test feeder and high resolution consumption data.

preprint2022arXiv

Experimental Validation of Feedback Optimization in Power Distribution Grids

We consider the problem of controlling the voltage of a distribution feeder using the reactive power capabilities of inverters. On a real distribution grid, we compare the local Volt/VAr droop control recommended in recent grid codes, a centralized dispatch based on optimal power flow (OPF) programming, and a feedback optimization (FO) controller that we propose. The local droop control yields suboptimal regulation, as predicted analytically. The OPF-based dispatch strategy requires an accurate grid model and measurement of all loads on the feeder in order to achieve proper voltage regulation. However, in the experiment, the OPF-based strategy violates voltage constraints due to inevitable model mismatch and uncertainties. Our proposed FO controller, on the other hand, satisfies the constraints and does not require load measurements or any grid state estimation. The only needed model knowledge is the sensitivity of the voltages with respect to reactive power, which can be obtained from data. As we show, an approximation of these sensitivities is also sufficient, which makes the approach essentially model-free, easy to tune, compatible with the current sensing and control infrastructure, and remarkably robust to measurement noise. We expect these properties to be fundamental features of FO for power systems and not specific to Volt/VAr regulation or to distribution grids.

preprint2022arXiv

Experimental Validation of Fully Distributed Peer-to-Peer Optimal Voltage Control with Minimal Model Requirements

This paper addresses the problem of voltage regulation in a power distribution grid using the reactive power injections of grid-connected power inverters. We first discuss how purely local voltage control schemes cannot regulate the voltages within a desired range under all circumstances and may even yield detrimental control decisions. Communication and, through that, coordination are therefore needed. On the other hand, short-range peer-to-peer communication and knowledge of electric distances between neighbouring controllers are sufficient for this task. We implement such a peer-to-peer controller and test it on a 400~V distribution feeder with asynchronous communication channels, confirming its viability on real-life systems. Finally, we analyze the scalability of this approach with respect to the number of agents on the feeder that participate in the voltage regulation task.

preprint2021arXiv

A Dynamic Population Model of Strategic Interaction and Migration under Epidemic Risk

In this paper, we show how a dynamic population game can model the strategic interaction and migration decisions made by a large population of agents in response to epidemic prevalence. Specifically, we consider a modified susceptible-asymptomatic-infected-recovered (SAIR) epidemic model over multiple zones. Agents choose whether to activate (i.e., interact with others), how many other agents to interact with, and which zone to move to in a time-scale which is comparable with the epidemic evolution. We define and analyze the notion of equilibrium in this game, and investigate the transient behavior of the epidemic spread in a range of numerical case studies, providing insights on the effects of the agents' degree of future awareness, strategic migration decisions, as well as different levels of lockdown and other interventions. One of our key findings is that the strategic behavior of agents plays an important role in the progression of the epidemic and can be exploited in order to design suitable epidemic control measures.

preprint2021arXiv

Optimal Placement of Virtual Inertia in Power Grids

A major transition in the operation of electric power grids is the replacement of synchronous machines by distributed generation connected via power electronic converters. The accompanying "loss of rotational inertia" and the fluctuations by renewable sources jeopardize the system stability, as testified by the ever-growing number of frequency incidents. As a remedy, numerous studies demonstrate how virtual inertia can be emulated through various devices, but few of them address the question of "where" to place this inertia. It is however strongly believed that the placement of virtual inertia hugely impacts system efficiency, as demonstrated by recent case studies. In this article, we carry out a comprehensive analysis in an attempt to address the optimal inertia placement problem. We consider a linear network-reduced power system model along with an H2 performance metric accounting for the network coherency. The optimal inertia placement problem turns out to be non-convex, yet we provide a set of closed-form global optimality results for particular problem instances as well as a computational approach resulting in locally optimal solutions. Further, we also consider the robust inertia allocation problem, wherein the optimization is carried out accounting for the worst-case disturbance location. We illustrate our results with a three-region power grid case study and compare our locally optimal solution with different placement heuristics in terms of different performance metrics.

preprint2021arXiv

Posetal Games: Efficiency, Existence, and Refinement of Equilibria in Games with Prioritized Metrics

Modern applications require robots to comply with multiple, often conflicting rules and to interact with the other agents. We present Posetal Games as a class of games in which each player expresses a preference over the outcomes via a partially ordered set of metrics. This allows one to combine hierarchical priorities of each player with the interactive nature of the environment. By contextualizing standard game theoretical notions, we provide two sufficient conditions on the preference of the players to prove existence of pure Nash Equilibria in finite action sets. Moreover, we define formal operations on the preference structures and link them to a refinement of the game solutions, showing how the set of equilibria can be systematically shrunk. The presented results are showcased in a driving game where autonomous vehicles select from a finite set of trajectories. The results demonstrate the interpretability of results in terms of minimum-rank-violation for each player.

preprint2021arXiv

Sampled-Data Online Feedback Equilibrium Seeking: Stability and Tracking

This paper proposes a general framework for constructing feedback controllers that drive complex dynamical systems to "efficient" steady-state (or slowly varying) operating points. Efficiency is encoded using generalized equations which can model a broad spectrum of useful objectives, such as optimality or equilibria (e.g. Nash, Wardrop, etc.) in noncooperative games. The core idea of the proposed approach is to directly implement iterative solution (or equilibrium seeking) algorithms in closed loop with physical systems. Sufficient conditions for closed-loop stability and robustness are derived; these also serve as the first closed-loop stability results for sampled-data feedback-based optimization. Numerical simulations of smart building automation and game-theoretic robotic swarm coordination support the theoretical results.

preprint2020arXiv

Non-convex Feedback Optimization with Input and Output Constraints

In this paper, we present a novel control scheme for feedback optimization. That is, we propose a discrete-time controller that can steer the steady state of a physical plant to the solution of a constrained optimization problem without numerically solving the problem. Our controller can be interpreted as a discretization of a continuous-time projected gradient flow. Compared to other schemes used for feedback optimization, such as saddle-point flows or inexact penalty methods, our algorithm combines several desirable properties: It asymptotically enforces constraints on the plant steady-state outputs, and temporary constraint violations can be easily quantified. Our algorithm requires only reduced model information in the form of steady-state input-output sensitivities of the plant. Further, as we prove in this paper, global convergence is guaranteed even for non-convex problems. Finally, our algorithm is straightforward to tune, since the step-size is the only tuning parameter.

preprint2020arXiv

Projected Dynamical Systems on Irregular, Non-Euclidean Domains for Nonlinear Optimization

Continuous-time projected dynamical systems are an elementary class of discontinuous dynamical systems with trajectories that remain in a feasible domain by means of projecting outward-pointing vector fields. They are essential when modeling physical saturation in control systems, constraints of motion, as well as studying projection-based numerical optimization algorithms. Motivated by the emerging application of feedback-based continuous-time optimization schemes that rely on the physical system to enforce nonlinear hard constraints, we study the fundamental properties of these dynamics on general locally-Euclidean sets. Among others, we propose the use of Krasovskii solutions, show their existence on nonconvex, irregular subsets of low-regularity Riemannian manifolds, and investigate how they relate to conventional Carathéodory solutions. Furthermore, we establish conditions for uniqueness, thereby introducing a generalized definition of prox-regularity which is suitable for non-flat domains. Finally, we use these results to study the stability and convergence of projected gradient flows as an illustrative application of our framework. We provide simple counter-examples for our main results to illustrate the necessity of our already weak assumptions.

preprint2018arXiv

Generic Existence of Unique Lagrange Multipliers in AC Optimal Power Flow

Solutions to nonlinear, nonconvex optimization problems can fail to satisfy the KKT optimality conditions even when they are optimal. This is due to the fact that unless constraint qualifications (CQ) are satisfied, Lagrange multipliers may fail to exist. Even if the KKT conditions are applicable, the multipliers may not be unique. These possibilities also affect AC optimal power flow (OPF) problems which are routinely solved in power systems planning, scheduling and operations. The complex structure -- in particular the presence of the nonlinear power flow equations which naturally exhibit a structural degeneracy -- make any attempt to establish CQs for the entire class of problems very challenging. In this paper, we resort to tools from differential topology to show that for AC OPF problems in various contexts the linear independence constraint qualification is satisfied almost certainly, thus effectively obviating the usual assumption on CQs. Consequently, for any local optimizer there generically exists a unique set of multipliers that satisfy the KKT conditions.

preprint2018arXiv

Stability of Dynamic Feedback Optimization with Applications to Power Systems

We consider the problem of optimizing the steady state of a dynamical system in closed loop. Conventionally, the design of feedback optimization control laws assumes that the system is stationary. However, in reality, the dynamics of the (slow) iterative optimization routines can interfere with the (fast) system dynamics. We provide a study of the stability and convergence of these feedback optimization setups in closed loop with the underlying plant, via a custom-tailored singular perturbation analysis result. Our study is particularly geared towards applications in power systems and the question whether recently developed online optimization schemes can be deployed without jeopardizing dynamic system stability.

preprint2018arXiv

Time-varying Projected Dynamical Systems with Applications to Feedback Optimization of Power Systems

This paper is concerned with the study of continuous-time, non-smooth dynamical systems which arise in the context of time-varying non-convex optimization problems, as for example the feedback-based optimization of power systems. We generalize the notion of projected dynamical systems to time-varying, possibly non-regular, domains and derive conditions for the existence of so-called Krasovskii solutions. The key insight is that for trajectories to exist, informally, the time-varying domain can only contract at a bounded rate whereas it may expand discontinuously. This condition is met, in particular, by feasible sets delimited via piecewise differentiable functions under appropriate constraint qualifications. To illustrate the necessity and usefulness of such a general framework, we consider a simple yet insightful power system example, and we discuss the implications of the proposed conditions for the design of feedback optimization schemes.

preprint2014arXiv

Distributed reactive power feedback control for voltage regulation and loss minimization

We consider the problem of exploiting the microgenerators dispersed in the power distribution network in order to provide distributed reactive power compensation for power losses minimization and voltage regulation. In the proposed strategy, microgenerators are smart agents that can measure their phasorial voltage, share these data with the other agents on a cyber layer, and adjust the amount of reactive power injected into the grid, according to a feedback control law that descends from duality-based methods applied to the optimal reactive power flow problem. Convergence to the configuration of minimum losses and feasible voltages is proved analytically for both a synchronous and an asynchronous version of the algorithm, where agents update their state independently one from the other. Simulations are provided in order to illustrate the performance and the robustness of the algorithm, and the innovative feedback nature of such strategy is discussed.

preprint2012arXiv

A distributed control strategy for reactive power compensation in smart microgrids

We consider the problem of optimal reactive power compensation for the minimization of power distribution losses in a smart microgrid. We first propose an approximate model for the power distribution network, which allows us to cast the problem into the class of convex quadratic, linearly constrained, optimization problems. We then consider the specific problem of commanding the microgenerators connected to the microgrid, in order to achieve the optimal injection of reactive power. For this task, we design a randomized, gossip-like optimization algorithm. We show how a distributed approach is possible, where microgenerators need to have only a partial knowledge of the problem parameters and of the state, and can perform only local measurements. For the proposed algorithm, we provide conditions for convergence together with an analytic characterization of the convergence speed. The analysis shows that, in radial networks, the best performance can be achieved when we command cooperation among units that are neighbors in the electric topology. Numerical simulations are included to validate the proposed model and to confirm the analytic results about the performance of the proposed algorithm.

preprint2009arXiv

Engineering Stable Discrete-Time Quantum Dynamics via a Canonical QR Decomposition

We analyze the asymptotic behavior of discrete-time, Markovian quantum systems with respect to a subspace of interest. Global asymptotic stability of subspaces is relevant to quantum information processing, in particular for initializing the system in pure states or subspace codes. We provide a linear-algebraic characterization of the dynamical properties leading to invariance and attractivity of a given quantum subspace. We then construct a design algorithm for discrete-time feedback control that allows to stabilize a target subspace, proving that if the control problem is feasible, then the algorithm returns an effective control choice. In order to prove this result, a canonical QR matrix decomposition is derived, and also used to establish the control scheme potential for the simulation of open-system dynamics.

Saverio Bolognani

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

Towards Model-Free Learning in Dynamic Population Games: An Application to Karma Economies

Optimization Algorithms as Robust Feedback Controllers

A Classification of Feedback Loops and Their Relation to Biases in Automated Decision-Making Systems

Adaptive Real-Time Grid Operation via Online Feedback Optimization with Sensitivity Estimation

Cross-layer Design for Real-Time Grid Operation: Estimation, Optimization and Power Flow

Experimental Validation of Feedback Optimization in Power Distribution Grids

Experimental Validation of Fully Distributed Peer-to-Peer Optimal Voltage Control with Minimal Model Requirements

A Dynamic Population Model of Strategic Interaction and Migration under Epidemic Risk

Optimal Placement of Virtual Inertia in Power Grids

Posetal Games: Efficiency, Existence, and Refinement of Equilibria in Games with Prioritized Metrics

Sampled-Data Online Feedback Equilibrium Seeking: Stability and Tracking

Non-convex Feedback Optimization with Input and Output Constraints

Projected Dynamical Systems on Irregular, Non-Euclidean Domains for Nonlinear Optimization

Generic Existence of Unique Lagrange Multipliers in AC Optimal Power Flow

Stability of Dynamic Feedback Optimization with Applications to Power Systems

Time-varying Projected Dynamical Systems with Applications to Feedback Optimization of Power Systems

Distributed reactive power feedback control for voltage regulation and loss minimization

A distributed control strategy for reactive power compensation in smart microgrids

Engineering Stable Discrete-Time Quantum Dynamics via a Canonical QR Decomposition