Researcher profile

Murat Arcak

Murat Arcak contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2024arXiv

Stability Margins of Neural Network Controllers

We present a method to train neural network controllers with guaranteed stability margins. The method is applicable to linear time-invariant plants interconnected with uncertainties and nonlinearities that are described by integral quadratic constraints. The type of stability margin we consider is the disk margin. Our training method alternates between a training step to maximize reward and a stability margin-enforcing step. In the stability margin enforcing-step, we solve a semidefinite program to project the controller into the set of controllers for which we can certify the desired disk margin.

preprint2022arXiv

Detecting Braess Routes: an Algorithm Accounting for Queuing Delays With an Extended Graph

The Braess paradox is a counter-intuitive phenomenon whereby adding roads to a network results in higher travel time at equilibrium. In this paper we present an algorithm to detect the occurrence of this paradox in real-world networks with the help of an improved graph representation accounting for queues. The addition of queues to the network representation enables a closer match with real data. Moreover, we search for routes causing this phenomenon ("Braess routes") rather than links, and advocate removing such routes virtually from navigation systems so that the associated links can continue to serve other routes. Our algorithm relies on a convex optimization problem utilizing Beckmann potentials for road links as well as queues, and results in a route reconfiguration with reduced delay. We assume the availability of historical data to build the optimization model. We also assume the existence of a centralized navigation system to manage the routing options and remove the Braess routes. The theoretical solution demonstrates up to 12% delay reduction in a network from Montgomery County, Maryland. We validate the improvement with simulations.

preprint2022arXiv

Population Games With Erlang Clocks: Convergence to Nash Equilibria For Pairwise Comparison Dynamics

The prevailing methodology for analyzing population games and evolutionary dynamics in the large population limit assumes that a Poisson process (or clock) inherent to each agent determines when the agent can revise its strategy. Hence, such an approach presupposes exponentially distributed inter-revision intervals, and is inadequate for cases where each strategy entails a sequence of sub-tasks (sub-strategies) that must be completed before a new revision time occurs. This article proposes a methodology for such cases under the premise that a sub-strategy's duration is exponentially-distributed, leading to Erlang distributed inter-revision intervals. We assume that a so-called pairwise-comparison protocol captures the agents' revision preferences to render our analysis concrete. The presence of sub-strategies brings on additional dynamics that is incompatible with existing models and results. Our main contributions are twofold, both derived for a deterministic approximation valid for large populations. We prove convergence of the population's state to the Nash equilibrium set when a potential game generates a payoff for the strategies. We use system-theoretic passivity to determine conditions under which this convergence is guaranteed for contractive games.

preprint2022arXiv

Safe-by-Design Planner-Tracker Synthesis

We present a safe-by-design trajectory planning and tracking framework for nonlinear dynamical systems using a hierarchy of system models. The planning layer uses a low-fidelity model to plan a feasible trajectory satisfying the planning constraints, and the tracking layer utilizes the high-fidelity model to design a controller that restricts the error states between the low- and high-fidelity models to a bounded set. The low-fidelity model enables the planning to be performed online (e.g. using Model Predictive Control) and the tracking controller and error bound are derived offline (e.g. using sum-of-squares programming). To provide freedom in the choice of the low-fidelity model, we allow the tracking error to depend on both the states and inputs of the planner. The goal of this article is to provide a tutorial review of this hierarchical framework and to illustrate it with examples, including a design for vehicle obstacle avoidance.

preprint2021arXiv

Attitude Trajectory Optimization for Agile Satellites in Autonomous Remote Sensing Constellation

Agile attitude maneuvering maximizes the utility of remote sensing satellite constellations. By taking into account a satellite's physical properties and its actuator specifications, we may leverage the full performance potential of the attitude control system to conduct agile remote sensing beyond conventional slew-and-stabilize maneuvers. Employing a constellation of agile satellites, coordinated by an autonomous and responsive scheduler, can significantly increase overall response rate, revisit time and global coverage for the mission. In this paper, we use recent advances in sequential convex programming based trajectory optimization to enable rapid-target acquisition, pointing and tracking capabilities for a scheduler-based constellation. We present two problem formulations. The Minimum-Time Slew Optimal Control Problem determines the minimum time, required energy, and optimal trajectory to slew between any two orientations given nonlinear quaternion kinematics, gyrostat and actuator dynamics, and state/input constraints. By gridding the space of 3D rotations and efficiently solving this problem on the grid, we produce lookup tables or parametric fits off-line that can then be used on-line by a scheduler to compute accurate estimates of minimum-time and maneuver energy for real-time constellation scheduling. The Minimum-Effort Multi-Target Pointing Optimal Control Problem is used on-line by each satellite to produce continuous attitude-state and control-input trajectories that realize a given schedule while minimizing attitude error and control effort. The optimal trajectory may then be achieved by a low-level tracking controller. We demonstrate our approach with an example of a reference satellite in Sun-synchronous orbit passing over globally-distributed, Earth-observation targets.

preprint2021arXiv

Escaping spurious local minimum trajectories in online time-varying nonconvex optimization

A major limitation of online algorithms that track the optimizers of time-varying nonconvex optimization problems is that they focus on a specific local minimum trajectory, which may lead to poor spurious local solutions. In this paper, we show that the natural temporal variation may help simple online tracking methods find and track time-varying global minima. To this end, we investigate the properties of a time-varying projected gradient flow system with inertia, which can be regarded as the continuous-time limit of (1) the optimality conditions for a discretized sequential optimization problem with a proximal regularization and (2) the online tracking scheme. We introduce the notion of the dominant trajectory and show that the inherent temporal variation could re-shape the landscape of the Lagrange functional and help a proximal algorithm escape the spurious local minimum trajectories if the global minimum trajectory is dominant. For a problem with twice continuously differentiable objective function and constraints, sufficient conditions are derived to guarantee that no matter how a local search method is initialized, it will track a time-varying global solution after some time. The results are illustrated on a benchmark example with many local minima.

preprint2021arXiv

Stability Analysis using Quadratic Constraints for Systems with Neural Network Controllers

A method is presented to analyze the stability of feedback systems with neural network controllers. Two stability theorems are given to prove asymptotic stability and to compute an ellipsoidal inner-approximation to the region of attraction (ROA). The first theorem addresses linear time-invariant systems, and merges Lyapunov theory with local (sector) quadratic constraints to bound the nonlinear activation functions in the neural network. The second theorem allows the system to include perturbations such as unmodeled dynamics, slope-restricted nonlinearities, and time delay, using integral quadratic constraint (IQCs) to capture their input/output behavior. This in turn allows for off-by-one IQCs to refine the description of activation functions by capturing their slope restrictions. Both results rely on semidefinite programming to approximate the ROA. The method is illustrated on systems with neural networks trained to stabilize a nonlinear inverted pendulum as well as vehicle lateral dynamics with actuator uncertainty.

preprint2020arXiv

Approximate abstractions of control systems with an application to aggregation

Previous approaches to constructing abstractions for control systems rely on geometric conditions or, in the case of an interconnected control system, a condition on the interconnection topology. Since these conditions are not always satisfiable, we relax the restrictions on the choice of abstractions, instead opting to select ones which nearly satisfy such conditions via optimization-based approaches. To quantify the resulting effect on the error between the abstraction and concrete control system, we introduce the notions of practical simulation functions and practical storage functions. We show that our approach facilitates the procedure of aggregation, where one creates an abstraction by partitioning agents into aggregate areas. We demonstrate the results on an application where we regulate the temperature in three separate zones of a building.

preprint2020arXiv

Backward Reachability using Integral Quadratic Constraints for Uncertain Nonlinear Systems

A method is proposed to compute robust inner-approximations to the backward reachable set for uncertain nonlinear systems. It also produces a robust control law that drives trajectories starting in these sets to the target set. The method merges dissipation inequalities and integral quadratic constraints (IQCs) with both hard and soft IQC factorizations. Computational algorithms are presented using the generalized S-procedure and sum-of-squares techniques. The use of IQCs in backward reachability analysis allows for a variety of perturbations including parametric uncertainty, unmodeled dynamics, nonlinearities, and uncertain time delays. The method is demonstrated on two examples, including a 6-state quadrotor with actuator uncertainties.

preprint2020arXiv

Bayesian Safe Learning and Control with Sum-of-Squares Analysis and Polynomial Kernels

We propose an iterative method to safely learn the unmodeled dynamics of a nonlinear system using Bayesian Gaussian process (GP) models with polynomial kernel functions. The method maintains safety by ensuring that the system state stays within the region of attraction (ROA) of a stabilizing control policy while collecting data. A quadratic programming based exploration control policy is computed to keep the exploration trajectory inside an inner-approximation of the ROA and to maximize the information gained from the trajectory. A prior GP model, which incorporates prior information about the unknown dynamics, is used to construct an initial stabilizing policy. As the GP model is updated with data, it is used to synthesize a new policy and a larger ROA, which increases the range of safe exploration. The use of polynomial kernels allows us to compute ROA inner-approximations and stabilizing control laws for the model using sum-of-squares programming. We also provide a probabilistic guarantee of safety which ensures that the policy computed using the learned model stabilizes the true dynamics with high confidence.

preprint2020arXiv

Improving Urban Traffic Throughput with Vehicle Platooning: Theory and Experiments

In this paper we present a model-predictive control (MPC) based approach for vehicle platooning in an urban traffic setting. Our primary goal is to demonstrate that vehicle platooning has the potential to significantly increase throughput at intersections, which can create bottlenecks in the traffic flow. To do so, our approach relies on vehicle connectivity: vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) communication. In particular, we introduce a customized V2V message set which features a velocity forecast, i.e. a prediction on the future velocity trajectory, which enables platooning vehicles to accurately maintain short following distances, thereby increasing throughput. Furthermore, V2I communication allows platoons to react immediately to changes in the state of nearby traffic lights, e.g. when the traffic phase becomes green, enabling additional gains in traffic efficiency. We present our design of the vehicle platooning system, and then evaluate performance by estimating the potential gains in terms of throughput using our results from simulation, as well as experiments conducted with real test vehicles on a closed track. Lastly, we briefly overview our demonstration of vehicle platooning on public roadways in Arcadia, CA.

preprint2020arXiv

Optimal assignment of collaborating agents in multi-body asset-guarding games

We study a multi-body asset-guarding game in missile defense where teams of interceptor missiles collaborate to defend a non-manuevering asset against a group of threat missiles. We approach the problem in two steps. We first formulate an assignment problem where we optimally assign subsets of collaborating interceptors to each threat so that all threats are intercepted as far away from the asset as possible. We assume that each interceptor is controlled by a collaborative guidance law derived from linear quadratic dynamic games. Our results include a 6-DOF simulation of a 5-interceptor versus 3-threat missile engagement where each agent is modeled as a missile airframe controlled by an autopilot. Despite the assumption of linear dynamics in our collaborative guidance law and the unmodeled dynamics in the simulation environment (e.g., varying density and gravity), we show that the simulated trajectories match well with those predicted by our approach. Furthermore, we show that a more agile threat, with greater speed and acceleration, can be intercepted by inferior interceptors when they collaborate. We believe the concepts introduced in this paper may be applied in asymmetric missile defense scenarios, including defense against advanced cruise missiles and hypersonic vehicles.

preprint2020arXiv

Passivity-based distributed acquisition and station-keeping control of a satellite constellation in areostationary orbit

We present a distributed control law to assemble a cluster of satellites into an equally-spaced, planar constellation in a desired circular orbit about a planet. We assume each satellite only uses local information, transmitted through communication links with neighboring satellites. The same control law is used to maintain relative angular positions in the presence of disturbance forces. The stability of the constellation in the desired orbit is proved using a compositional approach. We first show the existence and uniqueness of an equilibrium of the interconnected system. We then certify each satellite and communication link is equilibrium-independent passive with respective storage functions. By leveraging the skew symmetric coupling structure of the constellation and the equilibrium-independent passivity property of each subsystem, we show that the equilibrium of the interconnected system is stable with a Lyapunov function composed of the individual subsystem storage functions. We further prove that the angular velocity of each satellite converges to the desired value necessary to maintain circular, areostationary orbit. Finally, we present simulation results to demonstrate the efficacy of the proposed control law in acquisition and station-keeping of an equally-spaced satellite constellation in areostationary orbit despite the presence of unmodeled disturbance forces.

preprint2020arXiv

PIRK: Scalable Interval Reachability Analysis for High-Dimensional Nonlinear Systems

Reachability analysis is a critical tool for the formal verification of dynamical systems and the synthesis of controllers for them. Due to their computational complexity, many reachability analysis methods are restricted to systems with relatively small dimensions. One significant reason for such limitation is that those approaches, and their implementations, are not designed to leverage parallelism. They use algorithms that are designed to run serially within one compute unit and they can not utilize widely-available high-performance computing (HPC) platforms such as many-core CPUs, GPUs and Cloud-computing services. This paper presents PIRK, a tool to efficiently compute reachable sets for general nonlinear systems of extremely high dimensions. PIRK has been tested on several systems, with state dimensions ranging from ten up to 4 billion. The scalability of PIRK's parallel implementations is found to be highly favorable.

preprint2020arXiv

Reachability Analysis Using Dissipation Inequalities For Uncertain Nonlinear Systems

We propose a method to outer bound forward reachable sets on finite horizons for uncertain nonlinear systems with polynomial dynamics. This method makes use of time-dependent polynomial storage functions that satisfy appropriate dissipation inequalities that account for time-varying uncertain parameters, L2 disturbances, and perturbations characterized by integral quadratic constraints (IQCs) with both hard and soft factorizations. In fact, to our knowledge, this is the first result introducing IQCs to reachability analysis, thus allowing for various types of uncertainty, including unmodeled dynamics. The generalized S-procedure and Sum-of-Squares techniques are used to derive algorithms with the goal of finding the tightest outer bound with a desired shape. Both pedagogical and practically motivated examples are presented, including a 7-state F-18 aircraft model.

preprint2019arXiv

Continuous Abstraction of Nonlinear Systems using Sum-of-Squares Programming

We present a control design procedure for nonlinear control systems in which we represent a potentially high dimensional system with a low dimensional continuous-state abstraction. The abstraction generates a reference which the original system follows with a low level controller. We propose sum-of-squares programming as a tool to design this controller and to provide an upper bound on the relative error between the system and its abstraction. We compute the low level controller simultaneously with a simulation function that gives the boundedness guarantee for the relative error.