Researcher profile

Ilya Kolmanovsky

Ilya Kolmanovsky contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
29works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

29 published item(s)

preprint2024arXiv

Less Conservative Robust Reference Governors and Their Applications

The applications of reference governors to systems with unmeasured set-bounded disturbances can lead to conservative solutions. This conservatism can be reduced by estimating the disturbance from output measurements and canceling it in the nominal control law. In this paper, a reference governor based on such an approach is considered and time-varying, disturbance and state estimation errors bounding sets are derived. Consequently, the traditional implementation of a reference governor, which exploits a constraint admissible positively-invariant set of constant commands and initial states, is replaced by one which utilizes a time-dependent sequence of similar sets (which are not necessary nested). Examples are reported which include two applications to longitudinal control of aircraft that illustrate handling of elevator uncertainty and wing icing.

preprint2022arXiv

A Computationally Governed Log-domain Interior-point Method for Model Predictive Control

This paper introduces a computationally efficient approach for solving Model Predictive Control (MPC) reference tracking problems with state and control constraints. The approach consists of three key components: First, a log-domain interior-point quadratic programming method that forms the basis of the overall approach; second, a method of warm-starting this optimizer by using the MPC solution from the previous timestep; and third, a computational governor that bounds the suboptimality of the warm-start by altering the reference command provided to the MPC problem. As a result, the closed-loop system is altered in a manner so that MPC solutions can be computed using fewer optimizer iterations per timestep. In a numerical experiment, the computational governor reduces the worst-case computation time of a standard MPC implementation by 90, while maintaining good closed-loop performance.

preprint2022arXiv

Benefits of Feedforward for Model Predictive Airpath Control of Diesel Engines

This paper investigates options to complement a diesel engine airpath feedback controller with a feedforward. The control objective is to track the intake manifold pressure and exhaust gas recirculation (EGR) rate targets by manipulating the EGR valve and variable geometry turbine (VGT) while satisfying state and input constraints. The feedback controller is based on rate-based Model Predictive Control (MPC) that provides integral action for tracking. Two options for the feedforward are considered one based on a look-up table that specifies the feedforward as a function of engine speed and fuel injection rate, and another one based on a (non-rate-based) MPC that generates dynamic feedforward trajectories. The controllers are designed and verified using a high-fidelity engine model in GT-Power and exploit a low-order rate-based linear parameter-varying (LPV) model for prediction which is identified from transient response data generated by the GT-Power model. It is shown that the combination of feedforward and feedback MPC has the potential to improve the performance and robustness of the control design. In particular, the feedback MPC without feedforward can lose stability at low engine speeds, while MPC-based feedforward results in the best transient response. Mechanisms by which feedforward is able to assist in stabilization and improve performance are discussed.

preprint2022arXiv

Development of a Model Predictive Airpath Controller for a Diesel Engine on a High-Fidelity Engine Model with Transient Thermal Dynamics

This paper presents the results of a model predictive controller (MPC) development for diesel engine air-path regulation. The control objective is to track the intake manifold pressure and exhaust gas recirculation (EGR) rate targets by manipulating the EGR valve and variable geometry turbine (VGT) while satisfying state and control constraints. The MPC controller is designed and verified using a high-fidelity engine model in GT-Power. The controller exploits a low-order rate-based linear parameter-varying (LPV) model for prediction which is identified from transient response data generated by the GT-Power model. It is shown that transient engine thermal dynamics influence the airpath dynamics, specifically the intake manifold pressure response, however, MPC demonstrates robustness against inaccuracies in modeling these thermal dynamics. In particular, we show that MPC can be successfully implemented using a rate-based prediction model with two inputs (EGR and VGT positions) identified from data with steady-state wall temperature dynamics, however, closed-loop performance can be improved if a prediction model (i) is identified from data with transient thermal dynamics, and (ii) has the fuel injection rate as extra model input. Further, the MPC calibration process across the engine operating range to achieve improved performance is addressed. As the MPC calibration is shown to be sensitive to the operating conditions, a fast calibration process is proposed.

preprint2022arXiv

Implementing Optimization-Based Control Tasks in Cyber-Physical Systems With Limited Computing Capacity

A common aspect of today's cyber-physical systems is that multiple optimization-based control tasks may execute in a shared processor. Such control tasks make use of online optimization and thus have large execution times; hence, their sampling periods must be large as well to satisfy real-time schedulability condition. However, larger sampling periods may cause worse control performance. The goal of our work is to develop a robust to early termination optimization approach that can be used to effectively solve onboard optimization problems involved in controlling the system despite the presence of unpredictable, variable, and limited computing capacity. The significance of the developed approach is that the optimization iterations can be stopped at any time instant with a guaranteed feasible solution; as a result, optimization-based control tasks can be implemented with a small sampling period (and consequently with a minimum degradation in the control performance).

preprint2022arXiv

Integrated Adaptive Control and Reference Governors for Constrained Systems with State-Dependent Uncertainties

This paper presents an adaptive reference governor (RG) framework for a linear system with matched nonlinear uncertainties that can depend on both time and states, subject to both state and input constraints. The proposed framework leverages an L1 adaptive controller (L1AC) that estimates and compensates for the uncertainties, and provides guaranteed transient performance, in terms of uniform bounds on the error between actual states and inputs and those of a nominal (i.e., uncertainty-free) system. The uniform performance bounds provided by the L1AC are used to tighten the pre-specified state and control constraints. A reference governor is then designed for the nominal system using the tightened constraints, and guarantees robust constraint satisfaction. Moreover, the conservatism introduced by the constraint tightening can be systematically reduced by tuning some parameters within the L1AC. Compared with existing solutions, the proposed adaptive RG framework can potentially yield less conservative results for constraint enforcement due to the removal of uncertainty propagation along a prediction horizon, and improved tracking performance due to the inherent uncertainty compensation mechanism. Simulation results for a flight control example illustrate the efficacy of the proposed framework.

preprint2022arXiv

MPC-Based Emergency Vehicle-Centered Multi-Intersection Traffic Control

This paper proposes a traffic control scheme to alleviate traffic congestion in a network of interconnected signaled lanes/roads. The proposed scheme is emergency vehicle-centered, meaning that it provides an efficient and timely routing for emergency vehicles. In the proposed scheme, model predictive control is utilized to control inlet traffic flows by means of network gates, as well as configuration of traffic lights across the network. Two schemes are considered in this paper: i) centralized; and ii) decentralized. In the centralized scheme, a central unit controls the entire network. This scheme provides the optimal solution, even though it might not fulfil real-time computation requirements for large networks. In the decentralized scheme, each intersection has its own control unit, which sends local information to an aggregator. The main responsibility of this aggregator is to receive local information from all control units across the network as well as the emergency vehicle, to augment the received information, and to share it with the control units. Since the decision-making in decentralized scheme is local and the aggregator should fulfil the above-mentioned tasks during a traffic cycle which takes a long period of time, the decentralized scheme is suitable for large networks, even though it may provide a sub-optimal solution. Extensive simulation studies are carried out to validate the proposed schemes, and assess their performance. Notably, the obtained results reveal that traveling times of emergency vehicles can be reduced up to ~50% by using the centralized scheme and up to ~30% by using the decentralized scheme, without causing congestion in other lanes.

preprint2022arXiv

Robust Action Governor for Uncertain Piecewise Affine Systems with Non-convex Constraints and Safe Reinforcement Learning

The action governor is an add-on scheme to a nominal control loop that monitors and adjusts the control actions to enforce safety specifications expressed as pointwise-in-time state and control constraints. In this paper, we introduce the Robust Action Governor (RAG) for systems the dynamics of which can be represented using discrete-time Piecewise Affine (PWA) models with both parametric and additive uncertainties and subject to non-convex constraints. We develop the theoretical properties and computational approaches for the RAG. After that, we introduce the use of the RAG for realizing safe Reinforcement Learning (RL), i.e., ensuring all-time constraint satisfaction during online RL exploration-and-exploitation process. This development enables safe real-time evolution of the control policy and adaptation to changes in the operating environment and system parameters (due to aging, damage, etc.). We illustrate the effectiveness of the RAG in constraint enforcement and safe RL using the RAG by considering their applications to a soft-landing problem of a mass-spring-damper system.

preprint2022arXiv

ROTEC: Robust to Early Termination Command Governor for Systems with Limited Computing Capacity

A Command Governor (CG) is an optimization-based add-on scheme to a nominal closed-loop system. It is used to enforce state and control constraints by modifying reference commands. This paper considers the implementation of a CG on embedded processors that have limited computing resources and must execute multiple control and diagnostics functions; consequently, the time available for CG computations is limited and may vary over time. To address this issue, a robust to early termination command governor is developed which embeds the solution of a CG problem into the internal states of a virtual continuous-time dynamical system which runs in parallel to the process. This virtual system is built so that its trajectory converges to the optimal solution (with a tunable convergence rate), and provides a sub-optimal but feasible solution whenever its evolution is terminated. This allows the designer to implement a CG strategy with a small sampling period (and consequently with a minimum degradation in its performance), while maintaining its constraint-handling capabilities. Simulations are carried out to assess the effectiveness of the developed scheme in satisfying performance requirements and real-time schedulability conditions for a practical vehicle rollover example.

preprint2021arXiv

An Analysis of Closed-Loop Stability for Linear Model Predictive Control Based on Time-Distributed Optimization

Time-distributed Optimization (TDO) is an approach for reducing the computational burden of Model Predictive Control (MPC). When using TDO, optimization iterations are distributed over time by maintaining a running solution estimate and updating it at each sampling instant. In this paper, TDO applied to input constrained linear MPC is studied in detail, and analytic expressions for the system gains and a bound on the number of optimization iterations per sampling instant required to guarantee closed-loop stability is derived. Further, it is shown that the closed-loop stability of TDO-based MPC can be guaranteed using multiple mechanisms including increasing the number of solver iterations, preconditioning the optimal control problem, adjusting the MPC cost matrices, and reducing the length of the receding horizon. These results in a linear system setting also provide insights and guidelines that could be more broadly applicable, e.g., to nonlinear MPC.

preprint2021arXiv

Beating humans in a penny-matching game by leveraging cognitive hierarchy theory and Bayesian learning

It is a long-standing goal of artificial intelligence (AI) to be superior to human beings in decision making. Games are suitable for testing AI capabilities of making good decisions in non-numerical tasks. In this paper, we develop a new AI algorithm to play the penny-matching game considered in Shannon's "mind-reading machine" (1953) against human players. In particular, we exploit cognitive hierarchy theory and Bayesian learning techniques to continually evolve a model for predicting human player decisions, and let the AI player make decisions according to the model predictions to pursue the best chance of winning. Experimental results show that our AI algorithm beats 27 out of 30 volunteer human players.

preprint2021arXiv

Coordinated Receding-Horizon Control of Battery Electric Vehicle Speed and Gearshift Using Relaxed Mixed Integer Nonlinear Programming

In this paper, we propose an approach to coordinated receding-horizon control of vehicle speed and transmission gearshift for automated battery electric vehicles (BEVs) to achieve improved energy efficiency. The introduction of multi-speed transmissions in BEVs creates an opportunity to manipulate the operating point of electric motors under given vehicle speed and acceleration command, thus providing the potential to further improve the energy efficiency. However, co-optimization of vehicle speed and transmission gearshift leads to a mixed integer nonlinear program (MINLP), solving which can be computationally very challenging. In this paper, we propose a novel continuous relaxation technique to treat such MINLPs that makes it possible to compute solutions with conventional nonlinear programming solvers. After analyzing its theoretical properties, we use it to solve the optimization problem involved in coordinated receding-horizon control of BEV speed and gearshift. Through simulation studies, we show that co-optimizing vehicle speed and transmission gearshift can achieve considerably greater energy efficiency than optimizing them sequentially, and the proposed relaxation technique can reduce the online computational cost to a level that is comparable to the time available for real-time implementation.

preprint2021arXiv

Experimental Validation of Eco-Driving and Eco-Heating Strategies for Connected and Automated HEVs

This paper presents experimental results that validate eco-driving and eco-heating strategies developed for connected and automated vehicles (CAVs). By exploiting vehicle-to-infrastructure (V2I) communications, traffic signal timing, and queue length estimations, optimized and smoothed speed profiles for the ego-vehicle are generated to reduce energy consumption. Next, the planned eco-trajectories are incorporated into a real-time predictive optimization framework that coordinates the cabin thermal load (in cold weather) with the speed preview, i.e., eco-heating. To enable eco-heating, the engine coolant (as the only heat source for cabin heating) and the cabin air are leveraged as two thermal energy storages. Our eco-heating strategy stores thermal energy in the engine coolant and cabin air while the vehicle is driving at high speeds, and releases the stored energy slowly during the vehicle stops for cabin heating without forcing the engine to idle to provide the heating source. To test and validate these solutions, a power-split hybrid electric vehicle (HEV) has been instrumented for cabin thermal management, allowing to regulate heating, ventilation, and air conditioning (HVAC) system inputs (cabin temperature setpoint and blower flow rate) in real-time. Experiments were conducted to demonstrate the energy-saving benefits of eco-driving and eco-heating strategies over real-world city driving cycles at different cold ambient temperatures. The data confirmed average fuel savings of 14.5% and 4.7% achieved by eco-driving and eco-heating, respectively, offering a combined energy saving of more than 19% when comparing to the baseline vehicle driven by a human driver with a constant-heating strategy.

preprint2020arXiv

A constraint-separation principle in model predictive control

In this brief, we consider the constrained optimization problem underpinning model predictive control (MPC). We show that this problem can be decomposed into an unconstrained optimization problem with the same cost function as the original problem and a constrained optimization problem with a modified cost function and dynamics that have been precompensated according to the solution of the unconstrained problem. In the case of linear systems subject to a quadratic cost, the unconstrained problem has the familiar LQR solution and the constrained problem reduces to a minimum-norm projection. This implies that solving linear MPC problems is equivalent to precompensating a system using LQR and applying MPC to penalize only the control input. We propose to call this a constraint-separation principle and discuss the utility of both constraint separation and general decomposition in the design of MPC schemes and the development of numerical solvers for MPC problems.

preprint2020arXiv

A Game Theoretic Approach for Parking Spot Search with Limited Parking Lot Information

We propose a game theoretic approach to address the problem of searching for available parking spots in a parking lot and picking the ``optimal'' one to park. The approach exploits limited information provided by the parking lot, i.e., its layout and the current number of cars in it. Considering the fact that such information is or can be easily made available for many structured parking lots, the proposed approach can be applicable without requiring major updates to existing parking facilities. For large parking lots, a sampling-based strategy is integrated with the proposed approach to overcome the associated computational challenge. The proposed approach is compared against a state-of-the-art heuristic-based parking spot search strategy in the literature through simulation studies and demonstrates its advantage in terms of achieving lower cost function values.

preprint2020arXiv

A Sum-of-Squares-Based Procedure to Approximate the Pontryagin Difference of Semialgebraic Sets

The P-difference between two sets $\mathcal{A}$ and $\mathcal{B}$ is the set of all points, $\mathcal{C}$, such that the addition of $\mathcal{B}$ to any of the points in $\mathcal{C}$ is contained in $\mathcal{A}$. Such a set difference plays an important role in robust model predictive control and in set-theoretic control. In the paper we demonstrate that an inner approximation of the P-difference between two semialgebraic sets can be computed using the Sums of Squares Programming, and we illustrate the procedure using several computational examples.

preprint2020arXiv

Action Governor for Discrete-Time Linear Systems with Non-Convex Constraints

This paper introduces an add-on, supervisory scheme, referred to as Action Governor (AG), for discrete-time linear systems to enforce exclusion-zone avoidance requirements. It does so by monitoring, and minimally modifying when necessary, the nominal control signal to a constraint-admissible one. The AG operates based on set-theoretic techniques and online optimization. This paper establishes its theoretical foundation, discusses its computational realization, and uses two simulation examples to illustrate its effectiveness.

preprint2020arXiv

Deep Reinforcement Learning with Enhanced Safety for Autonomous Highway Driving

In this paper, we present a safe deep reinforcement learning system for automated driving. The proposed framework leverages merits of both rule-based and learning-based approaches for safety assurance. Our safety system consists of two modules namely handcrafted safety and dynamically-learned safety. The handcrafted safety module is a heuristic safety rule based on common driving practice that ensure a minimum relative gap to a traffic vehicle. On the other hand, the dynamically-learned safety module is a data-driven safety rule that learns safety patterns from driving data. Specifically, the dynamically-leaned safety module incorporates a model lookahead beyond the immediate reward of reinforcement learning to predict safety longer into the future. If one of the future states leads to a near-miss or collision, then a negative reward will be assigned to the reward function to avoid collision and accelerate the learning process. We demonstrate the capability of the proposed framework in a simulation environment with varying traffic density. Our results show the superior capabilities of the policy enhanced with dynamically-learned safety module.

preprint2020arXiv

Elimination of Redundant Polynomial Constraints and Its Use in Constrained Control

The reduction of constraints to obtain minimal representations of sets is a very common problem in many engineering applications. While well-established methodologies exist for the case of linear constraints, the problem of how to detect redundant non-linear constraints is an open problem. In this paper we present a novel methodology based on Sum of Squares for the elimination of redundant polynomial constraints. The paper also presents some relevant applications of the presented method to constrained control problems. In particular, we show how the proposed method can be used in the Model Predictive Control and in the Reference Governor frameworks to reduce the computational burden of the online algorithms. Furthermore, this method can also be used to eliminate the terminal constraints in MPC in a simple way that is independent from the cost function.

preprint2020arXiv

Engine and Aftertreatment Co-Optimization of Connected HEVs via Multi-Range Vehicle Speed Planning and Prediction

Connected vehicles (CVs) have situational awareness that can be exploited for control and optimization of the powertrain system. While extensive studies have been carried out for energy efficiency improvement of CVs via eco-driving and planning, the implication of such technologies on the thermal responses of CVs has not been fully investigated. One of the key challenges in leveraging connectivity for optimization-based thermal management of CVs is the relatively slow thermal dynamics, which necessitate the use of a long prediction horizon to achieve the best performance. Long-term prediction of the CV speed, unlike the V2V/V2I-based short-range prediction, is difficult and error-prone. The multiple timescales inherent to power and thermal systems call for a variable timescale optimization framework with access to short- and long-term vehicle speed preview. To this end, a model predictive controller (MPC) with a multi-range speed preview for integrated power and thermal management (iPTM) of connected hybrid electric vehicles (HEVs) is presented in this paper. The MPC is formulated to manage the power-split between the engine and the battery while enforcing the power and thermal (engine coolant and catalytic converter temperatures) constraints. The MPC exploits prediction and optimization over a shorter receding horizon and longer shrinking horizon. Over the longer shrinking horizon, the vehicle speed estimation is based on the data collected from the connected vehicles traveling on the same route as the ego-vehicle. Simulation results of applying the MPC over real-world urban driving cycles in Ann Arbor, MI are presented to demonstrate the effectiveness and fuel-saving potentials of the proposed iPTM strategy under the uncertainty associated with long-term predictions of the CV's speed.

preprint2020arXiv

Game-theoretic Modeling of Traffic in Unsignalized Intersection Network for Autonomous Vehicle Control Verification and Validation

For a foreseeable future, autonomous vehicles (AVs) will operate in traffic together with human-driven vehicles. Their planning and control systems need extensive testing, including early-stage testing in simulations where the interactions among autonomous/human-driven vehicles are represented. Motivated by the need for such simulation tools, we propose a game-theoretic approach to modeling vehicle interactions, in particular, for urban traffic environments with unsignalized intersections. We develop traffic models with heterogeneous (in terms of their driving styles) and interactive vehicles based on our proposed approach, and use them for virtual testing, evaluation, and calibration of AV control systems. For illustration, we consider two AV control approaches, analyze their characteristics and performance based on the simulation results with our developed traffic models, and optimize the parameters of one of them.

preprint2020arXiv

Integrated Power and Thermal Management of Connected HEVs via Multi-Horizon MPC

In this paper, a multi-horizon model predictive controller (MH-MPC) is developed for integrated power and thermal management (iPTM) of a power-split hybrid electric vehicle (HEV). The proposed MH-MPC leverages an accurate short-horizon vehicle speed preview and an approximate forecast over a longer shrinking horizon till the end of the driving cycle. This multiple-horizon scheme is developed to cope with fast and slow dynamics associated with power and thermal responses. The main objective of the proposed MH-MPC is to minimize fuel consumption and enforce the power and thermal constraints on the battery state-of-charge and engine coolant temperature, while meeting the driving (traction) and cabin air conditioning (heating) demands. The proposed MH-MPC allows for exploiting the engine coolant as thermal energy storage, providing more flexibility for the HEV energy flow optimization. The simulation results show that the proposed MH-MPC provides near-optimal results in reference to the Dynamic Programming (DP) solution with an affordable computational cost. Moreover, compared with a more conventional MPC strategy, the MH-MPC can leverage the speed previews with different resolutions effectively to achieve the desired performance with satisfactory robustness.

preprint2020arXiv

Scalable Vehicle Team Continuum Deformation Coordination with Eigen Decomposition

The continuum deformation leader-follower cooperative control strategy models vehicles in a multi-agent system as particles of a deformable body. A desired continuum deformation is defined based on leaders' trajectories and acquired by followers in real-time through local communication. The existing continuum deformation theory requires followers to be placed inside the convex simplex defined by leaders. This constraint is relaxed in this paper. We prove that under suitable assumptions any (n+1)(n=1,2,3) vehicles forming an n-D simplex can be selected as leaders while followers, arbitrarily positioned inside or outside the leading simplex, can acquire a desired continuum deformation in a decentralized fashion. The paper's second contribution is to assign a one-to-one mapping between leaders' smooth trajectories and homogeneous deformation features obtained by continuum deformation eigen-decomposition. Therefore, a safe and smooth continuum deformation coordination can be planned either by shaping homogeneous transformation features or by choosing appropriate leader trajectories. This is beneficial to efficiently plan and guarantee inter-agent collision avoidance in a large-scale group. A simulation case study is reported in which a virtual convex simplex contains a quadcopter vehicle team at any time t; A* search is applied to optimize quadcopter team continuum deformation in an obstacle-laden environment.

preprint2020arXiv

Suboptimal Nonlinear Model Predictive Control Strategies for Tracking Near Rectilinear Halo Orbits

Near Rectilinear Halo Orbits (NRHOs), a subclass of halo orbits around the L1 and L2 Lagrange points, are promising candidates for future lunar gateways in cis-lunar space and as staging orbits for lunar missions. Closed-loop control is beneficial to compensate orbital perturbations and potential instabilities while maintaining spacecraft on NRHOs and performing relative motion maneuvers. This paper investigates the use of nonlinear model predictive control (NMPC) coupled with low-thrust actuators for station-keeping on NRHOs. It is demonstrated through numerical simulations that NMPC is able to stabilize a spacecraft to a reference orbit and handle control constraints. Further, it is shown that the computational burden of NMPC can be managed using specialized optimization routines and suboptimal approaches without jeopardizing closed-loop performance.

preprint2020arXiv

Vision-Based Autonomous Driving: A Model Learning Approach

We present an integrated approach for perception and control for an autonomous vehicle and demonstrate this approach in a high-fidelity urban driving simulator. Our approach first builds a model for the environment, then trains a policy exploiting the learned model to identify the action to take at each time-step. To build a model for the environment, we leverage several deep learning algorithms. To that end, first we train a variational autoencoder to encode the input image into an abstract latent representation. We then utilize a recurrent neural network to predict the latent representation of the next frame and handle temporal information. Finally, we utilize an evolutionary-based reinforcement learning algorithm to train a controller based on these latent representations to identify the action to take. We evaluate our approach in CARLA, a high-fidelity urban driving simulator, and conduct an extensive generalization study. Our results demonstrate that our approach outperforms several previously reported approaches in terms of the percentage of successfully completed episodes for a lane keeping task.

preprint2019arXiv

A Novel Approach for Optimal Trajectory Design with Multiple Operation Modes of Propulsion System, Part 2

Equipping a spacecraft with multiple solar-powered electric engines (of the same or different types) compounds the task of optimal trajectory design due to presence of both real-valued inputs (power input to each engine in addition to the direction of thrust vector) and discrete variables (number of active engines). Each engine can be switched on/off independently and "optimal" operating power of each engine depends on the available solar power, which depends on the distance from the Sun. Application of the Composite Smooth Control (CSC) framework to a heliocentric fuel-optimal trajectory optimization from the Earth to the comet 67P/Churyumov-Gerasimenko is demonstrated, which presents a new approach to deal with multiple-engine problems. Operation of engine clusters with 4, 6, 10 and even 20 engines of the same type can be optimized. Moreover, engine clusters with different/mixed electric engines are considered with either 2, 3 or 4 different types of engines. Remarkably, the CSC framework allows us 1) to reduce the original multi-point boundary-value problem to a two-point boundary-value problem (TPBVP), and 2) to solve the resulting TPBVPs using a single-shooting solution scheme and with a random initialization of the missing costates. While the approach we present is a continuous neighbor of the discontinuous extremals, we show that the discontinuous necessary conditions are satisfied in the asymptotic limit. We believe this is the first indirect method to accommodate a multi-mode control of this level of complexity with realistic engine performance curves. The results are interesting and promising for dealing with a large family of such challenging multi-mode optimal control problems.

preprint2019arXiv

FBstab: A Stabilized Semismooth Quadratic Programming Algorithm with Applications in Model Predictive Control

This paper introduces the proximally stabilized Fischer-Burmeister method (FBstab); a new algorithm for convex quadratic programming that synergistically combines the proximal point algorithm with a primal-dual semismooth Newton-type method. FBstab is numerically robust, easy to warmstart, handles degenerate primal-dual solutions, detects infeasibility/unboundedness and requires only that the Hessian matrix be positive semidefinite. We outline the algorithm, provide convergence and convergence rate proofs, report some numerical results from model predictive control benchmarks, and also include experimental results. We show that FBstab is competitive with and often superior to, state of the art methods, has attractive scaling properties, and is especially promising for model predictive control applications.

preprint2019arXiv

Model Reference Adaptive Control Allocation for Constrained Systems with Guaranteed Closed Loop Stability

This paper proposes an adaptive control allocation approach for uncertain over-actuated systems with actuator saturation. The proposed method does not require uncertainty estimation or a persistent excitation assumption. Using the element-wise non-symmetric projection algorithm, the adaptive parameters are restricted to satisfy certain optimality conditions leading to overall closed loop system stability. Furthermore, a sliding mode controller with a time-varying sliding surface, working in tandem with the adaptive control allocation, is proposed to guarantee the outer loop stability and reference tracking in the presence of control allocation errors and disturbances. Simulation results are provided, where the Aerodata Model in Research Environment is used as an over-actuated system with actuator saturation, to demonstrate the effectiveness of the proposed method.

preprint2019arXiv

Time Distributed Optimization for Model Predictive Control: Stability, Robustness, and Constraint Satisfaction

Time distributed optimization is an implementation strategy that can significantly reduce the computational burden of model predictive control by exploiting its robustness to incomplete optimization. When using this strategy, optimization iterations are distributed over time by maintaining a running solution estimate for the optimal control problem and updating it at each sampling instant. The resulting controller can be viewed as a dynamic compensator which is placed in closed-loop with the plant. This paper presents a general systems theoretic analysis framework for time distributed optimization. The coupled plant-optimizer system is analyzed using input-to-state stability concepts and sufficient conditions for stability and constraint satisfaction are derived. When applied to time distributed sequential quadratic programming, the framework significantly extends the existing theoretical analysis for the real-time iteration scheme. Numerical simulations are presented that demonstrate the effectiveness of the scheme.