Researcher profile

Anouck Girard

Anouck Girard contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2022arXiv

Robust Action Governor for Uncertain Piecewise Affine Systems with Non-convex Constraints and Safe Reinforcement Learning

The action governor is an add-on scheme to a nominal control loop that monitors and adjusts the control actions to enforce safety specifications expressed as pointwise-in-time state and control constraints. In this paper, we introduce the Robust Action Governor (RAG) for systems the dynamics of which can be represented using discrete-time Piecewise Affine (PWA) models with both parametric and additive uncertainties and subject to non-convex constraints. We develop the theoretical properties and computational approaches for the RAG. After that, we introduce the use of the RAG for realizing safe Reinforcement Learning (RL), i.e., ensuring all-time constraint satisfaction during online RL exploration-and-exploitation process. This development enables safe real-time evolution of the control policy and adaptation to changes in the operating environment and system parameters (due to aging, damage, etc.). We illustrate the effectiveness of the RAG in constraint enforcement and safe RL using the RAG by considering their applications to a soft-landing problem of a mass-spring-damper system.

preprint2021arXiv

Beating humans in a penny-matching game by leveraging cognitive hierarchy theory and Bayesian learning

It is a long-standing goal of artificial intelligence (AI) to be superior to human beings in decision making. Games are suitable for testing AI capabilities of making good decisions in non-numerical tasks. In this paper, we develop a new AI algorithm to play the penny-matching game considered in Shannon's "mind-reading machine" (1953) against human players. In particular, we exploit cognitive hierarchy theory and Bayesian learning techniques to continually evolve a model for predicting human player decisions, and let the AI player make decisions according to the model predictions to pursue the best chance of winning. Experimental results show that our AI algorithm beats 27 out of 30 volunteer human players.

preprint2021arXiv

Coordinated Receding-Horizon Control of Battery Electric Vehicle Speed and Gearshift Using Relaxed Mixed Integer Nonlinear Programming

In this paper, we propose an approach to coordinated receding-horizon control of vehicle speed and transmission gearshift for automated battery electric vehicles (BEVs) to achieve improved energy efficiency. The introduction of multi-speed transmissions in BEVs creates an opportunity to manipulate the operating point of electric motors under given vehicle speed and acceleration command, thus providing the potential to further improve the energy efficiency. However, co-optimization of vehicle speed and transmission gearshift leads to a mixed integer nonlinear program (MINLP), solving which can be computationally very challenging. In this paper, we propose a novel continuous relaxation technique to treat such MINLPs that makes it possible to compute solutions with conventional nonlinear programming solvers. After analyzing its theoretical properties, we use it to solve the optimization problem involved in coordinated receding-horizon control of BEV speed and gearshift. Through simulation studies, we show that co-optimizing vehicle speed and transmission gearshift can achieve considerably greater energy efficiency than optimizing them sequentially, and the proposed relaxation technique can reduce the online computational cost to a level that is comparable to the time available for real-time implementation.

preprint2020arXiv

A Game Theoretic Approach for Parking Spot Search with Limited Parking Lot Information

We propose a game theoretic approach to address the problem of searching for available parking spots in a parking lot and picking the ``optimal'' one to park. The approach exploits limited information provided by the parking lot, i.e., its layout and the current number of cars in it. Considering the fact that such information is or can be easily made available for many structured parking lots, the proposed approach can be applicable without requiring major updates to existing parking facilities. For large parking lots, a sampling-based strategy is integrated with the proposed approach to overcome the associated computational challenge. The proposed approach is compared against a state-of-the-art heuristic-based parking spot search strategy in the literature through simulation studies and demonstrates its advantage in terms of achieving lower cost function values.

preprint2020arXiv

Action Governor for Discrete-Time Linear Systems with Non-Convex Constraints

This paper introduces an add-on, supervisory scheme, referred to as Action Governor (AG), for discrete-time linear systems to enforce exclusion-zone avoidance requirements. It does so by monitoring, and minimally modifying when necessary, the nominal control signal to a constraint-admissible one. The AG operates based on set-theoretic techniques and online optimization. This paper establishes its theoretical foundation, discusses its computational realization, and uses two simulation examples to illustrate its effectiveness.

preprint2020arXiv

Deep Reinforcement Learning with Enhanced Safety for Autonomous Highway Driving

In this paper, we present a safe deep reinforcement learning system for automated driving. The proposed framework leverages merits of both rule-based and learning-based approaches for safety assurance. Our safety system consists of two modules namely handcrafted safety and dynamically-learned safety. The handcrafted safety module is a heuristic safety rule based on common driving practice that ensure a minimum relative gap to a traffic vehicle. On the other hand, the dynamically-learned safety module is a data-driven safety rule that learns safety patterns from driving data. Specifically, the dynamically-leaned safety module incorporates a model lookahead beyond the immediate reward of reinforcement learning to predict safety longer into the future. If one of the future states leads to a near-miss or collision, then a negative reward will be assigned to the reward function to avoid collision and accelerate the learning process. We demonstrate the capability of the proposed framework in a simulation environment with varying traffic density. Our results show the superior capabilities of the policy enhanced with dynamically-learned safety module.

preprint2020arXiv

Game-theoretic Modeling of Traffic in Unsignalized Intersection Network for Autonomous Vehicle Control Verification and Validation

For a foreseeable future, autonomous vehicles (AVs) will operate in traffic together with human-driven vehicles. Their planning and control systems need extensive testing, including early-stage testing in simulations where the interactions among autonomous/human-driven vehicles are represented. Motivated by the need for such simulation tools, we propose a game-theoretic approach to modeling vehicle interactions, in particular, for urban traffic environments with unsignalized intersections. We develop traffic models with heterogeneous (in terms of their driving styles) and interactive vehicles based on our proposed approach, and use them for virtual testing, evaluation, and calibration of AV control systems. For illustration, we consider two AV control approaches, analyze their characteristics and performance based on the simulation results with our developed traffic models, and optimize the parameters of one of them.

preprint2020arXiv

Suboptimal Nonlinear Model Predictive Control Strategies for Tracking Near Rectilinear Halo Orbits

Near Rectilinear Halo Orbits (NRHOs), a subclass of halo orbits around the L1 and L2 Lagrange points, are promising candidates for future lunar gateways in cis-lunar space and as staging orbits for lunar missions. Closed-loop control is beneficial to compensate orbital perturbations and potential instabilities while maintaining spacecraft on NRHOs and performing relative motion maneuvers. This paper investigates the use of nonlinear model predictive control (NMPC) coupled with low-thrust actuators for station-keeping on NRHOs. It is demonstrated through numerical simulations that NMPC is able to stabilize a spacecraft to a reference orbit and handle control constraints. Further, it is shown that the computational burden of NMPC can be managed using specialized optimization routines and suboptimal approaches without jeopardizing closed-loop performance.

preprint2020arXiv

Vision-Based Autonomous Driving: A Model Learning Approach

We present an integrated approach for perception and control for an autonomous vehicle and demonstrate this approach in a high-fidelity urban driving simulator. Our approach first builds a model for the environment, then trains a policy exploiting the learned model to identify the action to take at each time-step. To build a model for the environment, we leverage several deep learning algorithms. To that end, first we train a variational autoencoder to encode the input image into an abstract latent representation. We then utilize a recurrent neural network to predict the latent representation of the next frame and handle temporal information. Finally, we utilize an evolutionary-based reinforcement learning algorithm to train a controller based on these latent representations to identify the action to take. We evaluate our approach in CARLA, a high-fidelity urban driving simulator, and conduct an extensive generalization study. Our results demonstrate that our approach outperforms several previously reported approaches in terms of the percentage of successfully completed episodes for a lane keeping task.

preprint2019arXiv

A Novel Approach for Optimal Trajectory Design with Multiple Operation Modes of Propulsion System, Part 2

Equipping a spacecraft with multiple solar-powered electric engines (of the same or different types) compounds the task of optimal trajectory design due to presence of both real-valued inputs (power input to each engine in addition to the direction of thrust vector) and discrete variables (number of active engines). Each engine can be switched on/off independently and "optimal" operating power of each engine depends on the available solar power, which depends on the distance from the Sun. Application of the Composite Smooth Control (CSC) framework to a heliocentric fuel-optimal trajectory optimization from the Earth to the comet 67P/Churyumov-Gerasimenko is demonstrated, which presents a new approach to deal with multiple-engine problems. Operation of engine clusters with 4, 6, 10 and even 20 engines of the same type can be optimized. Moreover, engine clusters with different/mixed electric engines are considered with either 2, 3 or 4 different types of engines. Remarkably, the CSC framework allows us 1) to reduce the original multi-point boundary-value problem to a two-point boundary-value problem (TPBVP), and 2) to solve the resulting TPBVPs using a single-shooting solution scheme and with a random initialization of the missing costates. While the approach we present is a continuous neighbor of the discontinuous extremals, we show that the discontinuous necessary conditions are satisfied in the asymptotic limit. We believe this is the first indirect method to accommodate a multi-mode control of this level of complexity with realistic engine performance curves. The results are interesting and promising for dealing with a large family of such challenging multi-mode optimal control problems.