Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
22works
0followers
20topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

22 published item(s)

preprint2023arXiv

Probabilistic design of optimal sequential decision-making algorithms in learning and control

This survey is focused on certain sequential decision-making problems that involve optimizing over probability functions. We discuss the relevance of these problems for learning and control. The survey is organized around a framework that combines a problem formulation and a set of resolution methods. The formulation consists of an infinite-dimensional optimization problem. The methods come from approaches to search optimal solutions in the space of probability functions. Through the lenses of this overarching framework we revisit popular learning and control algorithms, showing that these naturally arise from suitable variations on the formulation mixed with different resolution methods. A running example, for which we make the code available, complements the survey. Finally, a number of challenges arising from the survey are also outlined.

preprint2022arXiv

A smart electric bike for smart cities

This is a Masters Thesis completed at University College Dublin, Ireland in 2017 which involved augmenting an off-the-shelf electric bike with sensors to enable new services to be delivered to cyclists in cities. The application of primary interest was to control the cyclist's ventilation rate based on the concentration of local air pollutants. Detailed modelling and system design is presented for our Cyberphysical system which consisted of a modified BTwin e-bike, Cycle Analyst sensors, the cyclist themselves, a Bluetooth connected smartphone and our algorithms. Control algorithms to regulate the proportion of power the cyclist provided as a proxy for their ventilation rate were proposed and validated in a basic way, which were later proven significantly further in Further Work (see IEEE Transactions on Intelligent Transportation Systems paper: https://ieeexplore.ieee.org/abstract/document/8357977). The basic idea was to provide more electrical assistance to cyclists in areas of high air pollution to reduce the cyclist ventilation rate and thereby the amount of air pollutants inhaled. This presents an interesting control challenge due to the human-in-the-loop characteristics and the potential for impactful real life applications. A background literature review is provided on energy as it relates to cycling and some other applications are also discussed. A link to a video which demonstrates the system is provided, and also to a blog published by IBM Research about the system.

preprint2022arXiv

Anomalous sorption kinetics of self-interacting particles by a spherical trap

In this paper we propose a computational framework for the investigation of the correlated motion between positive and negative ions exposed to the attraction of a bubble surface that mimics the (oscillating) cell membrane. The correlated diffusion of surfactants is described by a Poisson-Nernst-Planck (PNP) system, in which the drift term is given by the gradient of a potential which includes both the effect of the bubble and the Coulomb interaction between the carriers. The latter term is obtained from the solution of a self-consistent Poisson equation. For very short Debye lengths one can adopt the so called Quasi-Neutral limit which drastically simplifies the system, thus allowing for much faster numerical simulations. The paper has four main objectives. The first one is to present a PNP model that describes ion charges in presence of a trap. The second one is to provide benchmark tests for the validation of simplified multiscale models under current development [1]. The third one is to explore the relevance of the term describing the interaction among the apolar tails of the anions. The last one is to quantitatively explore the validity of the Quasi-Neutral limit by comparison with detailed numerical simulation for smaller and smaller Debye lengths. In order to reach these goals, we propose a simple and efficient Alternate Direction Implicit method for the numerical solution of the non-linear PNP system, which guarantees second order accuracy both in space and time, without requiring solution of nonlinear equation at each time step. New semi-implicit scheme for a simplified PNP system near quasi neutrality is also proposed.

preprint2022arXiv

External control of a genetic toggle switch via Reinforcement Learning

We investigate the problem of using a learning-based strategy to stabilize a synthetic toggle switch via an external control approach. To overcome the data efficiency problem that would render the algorithm unfeasible for practical use in synthetic biology, we adopt a sim-to-real paradigm where the policy is learnt via training on a simplified model of the toggle switch and it is then subsequently exploited to control a more realistic model of the switch parameterized from in-vivo experiments. Our in-silico experiments confirm the viability of the approach suggesting its potential use for in-vivo control implementations.

preprint2022arXiv

Implicit and semi-implicit well-balanced finite-volume methods for systems of balance laws

The aim of this work is to design implicit and semi-implicit high-order well-balanced finite-volume numerical methods for 1D systems of balance laws. The strategy introduced by two of the authors in a previous paper for explicit schemes based on the application of a well-balanced reconstruction operator has been applied. The well-balanced property is preserved when quadrature formulas are used to approximate the averages and the integral of the source term in the cells. Concerning the time evolution, this technique is combined with a time discretization method of type RK-IMEX or RK-implicit. The methodology will be applied to several systems of balance laws.

preprint2022arXiv

On a probabilistic approach to synthesize control policies from example datasets

This paper is concerned with the design of control policies from example datasets. The case considered is when just a black box description of the system to be controlled is available and the system is affected by actuation constraints. These constraints are not necessarily fulfilled by the (possibly, noisy) example data and the system under control is not necessarily the same as the one from which these data are collected. In this context, we introduce a number of theoretical results to compute a control policy from example datasets that: (i) makes the behavior of the closed-loop system similar to the one illustrated in the data; (ii) guarantees compliance with the constraints. We recast the control problem as a finite-horizon optimal control problem and give an explicit expression for its optimal solution. Moreover, we turn our findings into an algorithmic procedure. The procedure gives a systematic tool to compute the policy. The effectiveness of our approach is illustrated via a numerical example, where we use real data collected from test drives to synthesize a control policy for the merging of a car on a highway.

preprint2022arXiv

On the design of scalable networks rejecting first order disturbances

This paper is concerned with the problem of designing distributed control protocols for network systems affected by delays and disturbances consisting of a first-order polynomial component and a residual signal. Specifically, we propose the use of a multiplex architecture to design distributed control protocols to reject polynomial disturbances up to ramps and guarantee a scalability property that prohibits the amplification of residual disturbances. For this architecture, we give a sufficient condition on the control protocols to guarantee scalability and ramps rejection. The effectiveness of the result, which can be used to study networks of nonlinearly coupled nonlinear agents, is illustrated via a robot formation control problem.

preprint2022arXiv

Scalability in nonlinear network systems affected by delays and disturbances

This paper is concerned with the study of scalability in nonlinear heterogeneous networks affected by communication delays and disturbances. After formalizing the notion of scalability, we give two sufficient conditions to assess this property. Our results can be used to study leader-follower and leaderless networks and also allow to consider the case when the desired configuration of the system changes over time. We show how our conditions can be turned into design guidelines to guarantee scalability and illustrate their effectiveness via numerical examples.

preprint2021arXiv

A local velocity grid conservative semi-Lagrangian schemes for BGK model

Most numerical schemes proposed for solving BGK models for rarefied gas dynamics are based on the discrete velocity approximation. Since such approach uses fixed velocity grids, one must secure a sufficiently large domain with fine velocity grids to resolve the structure of distribution functions. When one treats high Mach number problems, the computational cost becomes prohibitively expensive. In this paper, we propose a velocity adaptation technique in the semi-Lagrangian framework for BGK model. The velocity grid will be set locally in time and space, according to mean velocity and temperature. We apply a weighted minimization approach to impose conservation. We presented several numerical tests that illustrate the effectiveness of our proposed scheme.

preprint2021arXiv

A meshfree arbitrary Lagrangian-Eulerian method for the BGK model of the Boltzmann equation with moving boundaries

In this paper we present a novel technique for the simulation of moving boundaries and moving rigid bodies immersed in a rarefied gas using an Eulerian-Lagrangian formulation based on least square method. The rarefied gas is simulated by solving the Bhatnagar-Gross-Krook (BGK) model for the Boltzmann equation of rarefied gas dynamics. The BGK model is solved by an Arbitrary Lagrangian-Eulerian (ALE) method, where grid-points/particles are moved with the mean velocity of the gas. The computational domain for the rarefied gas changes with time due to the motion of the boundaries. To allow a simpler handling of the interface motion we have used a meshfree method based on a least-square approximation for the reconstruction procedures required for the scheme. We have considered a one way, as well as a two-way coupling of boundaries/rigid bodies and gas flow. The numerical results are compared with analytical as well as with Direct Simulation Monte Carlo (DSMC) solutions of the Boltzmann equation. Convergence studies are performed for one-dimensional and two-dimensional test-cases. Several further test problems and applications illustrate the versatility of the approach.

preprint2021arXiv

BGK models for inert mixtures: comparison and applications

Consistent BGK models for inert mixtures are compared, first in their kinetic behavior and then versus the hydrodynamic limits that can be derived in different collision-dominated regimes. The comparison is carried out both analytically and numerically, for the latter using an asymptotic preserving semi-Lagrangian scheme for the BGK models. Application to the plane shock wave in a binary mixture of noble gases is also presented.

preprint2021arXiv

Intermittent non-pharmaceutical strategies to mitigate the COVID-19 epidemic in a network model of Italy via constrained optimization

This paper is concerned with the design of intermittent non-pharmaceutical strategies to mitigate the spread of the COVID-19 epidemic exploiting network epidemiological models. Specifically, by studying a variational equation for the dynamics of the infected in a network model of the epidemic spread, we derive, using contractivity arguments, a condition that can be used to guarantee that, in epidemiological terms, the effective reproduction number is less than unity. This condition has three advantages: (i) it is easily computable; (ii) it is directly related to the model parameters; (iii) it can be used to enforce a scalability condition that prohibits the amplification of disturbances within the network system. We then include satisfaction of such a condition as a constraint in a Model Predictive Control problem so as to mitigate (or suppress) the spread of the epidemic while minimizing the economic impact of the interventions. A data-driven model of Italy as a network of three macro-regions (North, Center, and South), whose parameters are identified from real data, is used to illustrate and evaluate the effectiveness of the proposed control strategy.

preprint2021arXiv

Lack of practical identifiability may hamper reliable predictions in COVID-19 epidemic models

Compartmental models are widely adopted to describe and predict the spreading of infectious diseases. The unknown parameters of such models need to be estimated from the data. Furthermore, when some of the model variables are not empirically accessible, as in the case of asymptomatic carriers of COVID-19, they have to be obtained as an outcome of the model. Here, we introduce a framework to quantify how the uncertainty in the data impacts the determination of the parameters and the evolution of the unmeasured variables of a given model. We illustrate how the method is able to characterize different regimes of identifiability, even in models with few compartments. Finally, we discuss how the lack of identifiability in a realistic model for COVID-19 may prevent reliable forecasting of the epidemic dynamics.

preprint2021arXiv

Matrix measures, stability and contraction theory for dynamical systems on time scales

This paper is concerned with the study of the stability of dynamical systems evolving on time scales. We first {formalize the notion of matrix measures on time scales, prove some of their key properties and make use of this notion to study both linear and nonlinear dynamical systems on time scales.} Specifically, we start with considering linear time-varying systems and, for these, we prove a time scale analogous of an upper bound due to Coppel. We make use of this upper bound to give stability and input-to-state stability conditions for linear time-varying systems. {Then, we consider nonlinear time-varying dynamical systems on time scales and} establish a sufficient condition for the convergence of the solutions. Finally, after linking our results to the existence of a Lyapunov function, we make use of our approach to study certain epidemic dynamics and complex networks. For the former, we give a sufficient condition on the parameters of a SIQR model on time scales ensuring that its solutions converge to the disease-free solution. For the latter, we first give a sufficient condition for pinning controllability of complex time scale networks and then use this condition to study certain collective opinion dynamics. The theoretical results are complemented with simulations.

preprint2020arXiv

Convergence estimates of a semi-Lagrangian scheme for the ellipsoidal BGK model for polyatomic molecules

In this paper, we propose a new semi-Lagrangian scheme for the polyatomic ellipsoidal BGK model. In order to avoid time step restrictions coming from convection term and small Knudsen number, we combine a semi-Lagrangian approach for the convection term with an implicit treatment for the relaxation term. We show how to explicitly solve the implicit step, thus obtaining an efficient and stable scheme for any Knudsen number. We also derive an explicit error estimate on the convergence of the proposed scheme for every fixed value of the Knudsen number.

preprint2020arXiv

Driving Reinforcement Learning with Models

In this paper we propose a new approach to complement reinforcement learning (RL) with model-based control (in particular, Model Predictive Control - MPC). We introduce an algorithm, the MPC augmented RL (MPRL) that combines RL and MPC in a novel way so that they can augment each other's strengths. We demonstrate the effectiveness of the MPRL by letting it play against the Atari game Pong. For this task, the results highlight how MPRL is able to outperform both RL and MPC when these are used individually.

preprint2020arXiv

Intermittent yet coordinated regional strategies can alleviate the COVID-19 epidemic: a network model of the Italian case

The COVID-19 epidemic that emerged in Wuhan China at the end of 2019 hit Italy particularly hard, yielding the implementation of strict national lockdown rules (Phase 1). There is now a hot ongoing debate in Italy and abroad on what the best strategy is to restart a country to exit a national lockdown (Phase 2). Previous studies have focused on modelling possible restarting scenarios at the national level, overlooking the fact that Italy, as other nations around the world, is divided in administrative regions who can independently oversee their own share of the Italian National Health Service. In this study, we show that regionalism, and heterogeneity between regions, is essential to understand the spread of the epidemic and, more importantly, to design effective post Lock-Down strategies to control the disease. To achieve this, we model Italy as a network of regions and parameterize the model of each region on real data spanning almost two months from the initial outbreak. Using the model, we confirm the effectiveness at the regional level of the national lockdown strategy implemented so far by the Italian government to mitigate the spread of the disease and show its efficacy at the regional level. We also propose that differentiated, albeit coordinated, regional interventions can be effective in Phase 2 to restart the country and avoid future recurrence of the epidemic, while avoiding saturation of the regional health systems and mitigating impact on costs. Our study and methodology can be easily extended to other levels of granularity (provinces or counties in the same region or states in other federal countries, etc.) to support policy- and decision-makers.

preprint2020arXiv

Multi-moment maps on nearly Kähler six-manifolds

We study multi-moment maps induced by a two-torus action on the four homogeneous nearly Kähler six-manifolds. Their explicit expression and stationary orbits are derived. The configuration of fixed-points and one-dimensional orbits is worked out for generic six-manifolds equipped with an $\mathrm{SU}(3)$-structure admitting a two-torus symmetry. Projecting the subspaces obtained to the orbit space yields a trivalent graph. We illustrate this result concretely on the homogeneous nearly Kähler examples.

preprint2020arXiv

On the synthesis of control policies from noisy example datasets: a probabilistic approach

In this note we consider the problem of synthesizing optimal control policies for a system from noisy datasets. We present a novel algorithm that takes as input the available dataset and, based on these inputs, computes an optimal policy for possibly stochastic and nonlinear systems that also satisfies actuation constraints. The algorithm relies on solid theoretical foundations, which have their key roots into a probabilistic interpretation of dynamical systems. The effectiveness of our approach is illustrated by considering an autonomous car use case. For such use case, we make use of our algorithm to synthesize a control policy from noisy data allowing the car to merge onto an intersection, while satisfying additional constraints on the variance of the car speed

preprint2020arXiv

Tutoring Reinforcement Learning via Feedback Control

We introduce a control-tutored reinforcement learning (CTRL) algorithm. The idea is to enhance tabular learning algorithms by means of a control strategy with limited knowledge of the system model. By tutoring the learning process, the learning rate can be substantially reduced. We use the classical problem of stabilizing an inverted pendulum as a benchmark to numerically illustrate the advantages and disadvantages of the approach.

preprint2019arXiv

Resilient consensus for multi-agent systems subject to differential privacy requirements

We consider multi-agent systems interacting over directed network topologies where a subset of agents is adversary/faulty and where the non-faulty agents have the goal of reaching consensus, while fulfilling a differential privacy requirement on their initial conditions. To address this problem, we develop an update law for the non-faulty agents. Specifically, we propose a modification of the so-called Mean-Subsequence-Reduced (MSR) algorithm, the Differentially Private MSR (DP-MSR) algorithm, and characterize three important properties of the algorithm: correctness, accuracy and differential privacy. We show that if the network topology is $(2f +1)$-robust, then the algorithm allows the non-faulty agents to reach consensus despite the presence of up to $f$ faulty agents and we characterize the accuracy of the algorithm. Furthermore, we also show in two important cases that our distributed algorithm can be tuned to guarantees differential privacy of the initial conditions and the differential privacy requirement is related to the maximum network degree. The results are illustrated via simulations.

preprint2017arXiv

Exploiting nodes symmetries to control synchronization and consensus patterns in multiagent systems

We present new conditions to obtain synchronization and consensus patterns in complex network systems. The key idea is to exploit symmetries of the nodes' vector fields to induce a desired synchronization/consensus pattern, where nodes are clustered in different groups each converging towards a different synchronized evolution. We show that the new conditions we present offer a systematic methodology to design a distributed network controller able to drive a network of interest towards a desired synchronization/consensus pattern.