Source author record

Soon-Jo Chung

Soon-Jo Chung appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC eess.SY Systems and Control Robotics Machine Learning math.DS Artificial Intelligence math.PR Information Theory math.IT math.ST nlin.AO Statistics Theory

Catalog footprint

What is connected

18works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Competitive Control with Delayed Imperfect Information

This paper studies the impact of imperfect information in online control with adversarial disturbances. In particular, we consider both delayed state feedback and inexact predictions of future disturbances. We introduce a greedy, myopic policy that yields a constant competitive ratio against the offline optimal policy. We also analyze the fundamental limits of online control with limited information by showing that our competitive ratio bounds for the greedy, myopic policy in the adversarial setting match (up to lower-order terms) lower bounds in the stochastic setting.

preprint2022arXiv

Incremental Nonlinear Stability Analysis of Stochastic Systems Perturbed by Lévy Noise

We present a theoretical framework for characterizing incremental stability of nonlinear stochastic systems perturbed by compound Poisson shot noise and finite-measure Lévy noise. For each noise type, we compare trajectories of the perturbed system with distinct noise sample paths against trajectories of the nominal, unperturbed system. We show that for a finite number of jumps arising from the noise process, the mean-squared error between the trajectories exponentially converge towards a bounded error ball across a finite interval of time under practical boundedness assumptions. The convergence rate for shot noise systems is the same as the exponentially-stable nominal system, but with a tradeoff between the parameters of the shot noise process and the size of the error ball. The convergence rate and the error ball for the Lévy noise system are shown to be nearly direct sums of the respective quantities for the shot and white noise systems separately, a result which is analogous to the Lévy-Khintchine theorem. We demonstrate our results using several numerical case studies.

preprint2022arXiv

Meta-Learning-Based Robust Adaptive Flight Control Under Uncertain Wind Conditions

Realtime model learning proves challenging for complex dynamical systems, such as drones flying in variable wind conditions. Machine learning technique such as deep neural networks have high representation power but is often too slow to update onboard. On the other hand, adaptive control relies on simple linear parameter models can update as fast as the feedback control loop. We propose an online composite adaptation method that treats outputs from a deep neural network as a set of basis functions capable of representing different wind conditions. To help with training, meta-learning techniques are used to optimize the network output useful for adaptation. We validate our approach by flying a drone in an open air wind tunnel under varying wind conditions and along challenging trajectories. We compare the result with other adaptive controller with different basis function sets and show improvement over tracking and prediction errors.

preprint2022arXiv

Trajectory Optimization of Chance-Constrained Nonlinear Stochastic Systems for Motion Planning Under Uncertainty

We present gPC-SCP: Generalized Polynomial Chaos-based Sequential Convex Programming to compute a sub-optimal solution for a continuous-time chance-constrained stochastic nonlinear optimal control (SNOC) problem. The approach enables motion planning for robotic systems under uncertainty. The gPC-SCP method involves two steps. The first step is to derive a surrogate problem of \emph{deterministic} nonlinear optimal control (DNOC) with convex constraints by using gPC expansion and the distributionally-robust convex subset of the chance constraints. The second step is to solve the DNOC problem using sequential convex programming for trajectory generation and control. We prove that in the unconstrained case, the optimal value of the DNOC converges to that of SNOC asymptotically and that any feasible solution of the constrained DNOC is a feasible solution of the chance-constrained SNOC. We also present the predictor-corrector extension (gPC-SCP$^\mathrm{PC}$) for real-time motion trajectory generation in the presence of stochastic uncertainty. In the gPC-SCP$^\mathrm{PC}$ method, we first predict the uncertainty using the gPC method and then optimize the motion plan to accommodate the uncertainty. We empirically demonstrate the efficacy of the gPC-SCP and the gPC-SCP$^\mathrm{PC}$ methods for the following two test cases: 1) collision checking under uncertainty in actuation and physical parameters and 2) collision checking with stochastic obstacle model for 3DOF and 6DOF robotic systems. We validate the effectiveness of the gPC-SCP method on the 3DOF robotic spacecraft testbed.

preprint2021arXiv

Neural Stochastic Contraction Metrics for Learning-based Control and Estimation

We present Neural Stochastic Contraction Metrics (NSCM), a new design framework for provably-stable robust control and estimation for a class of stochastic nonlinear systems. It uses a spectrally-normalized deep neural network to construct a contraction metric, sampled via simplified convex optimization in the stochastic setting. Spectral normalization constrains the state-derivatives of the metric to be Lipschitz continuous, thereby ensuring exponential boundedness of the mean squared distance of system trajectories under stochastic disturbances. The NSCM framework allows autonomous agents to approximate optimal stable control and estimation policies in real-time, and outperforms existing nonlinear control and estimation techniques including the state-dependent Riccati equation, iterative LQR, EKF, and the deterministic neural contraction metric, as illustrated in simulation results.

preprint2021arXiv

Online Optimization with Memory and Competitive Control

This paper presents competitive algorithms for a novel class of online optimization problems with memory. We consider a setting where the learner seeks to minimize the sum of a hitting cost and a switching cost that depends on the previous $p$ decisions. This setting generalizes Smoothed Online Convex Optimization. The proposed approach, Optimistic Regularized Online Balanced Descent, achieves a constant, dimension-free competitive ratio. Further, we show a connection between online optimization with memory and online control with adversarial disturbances. This connection, in turn, leads to a new constant-competitive policy for a rich class of online control problems.

preprint2021arXiv

The Power of Predictions in Online Control

We study the impact of predictions in online Linear Quadratic Regulator control with both stochastic and adversarial disturbances in the dynamics. In both settings, we characterize the optimal policy and derive tight bounds on the minimum cost and dynamic regret. Perhaps surprisingly, our analysis shows that the conventional greedy MPC approach is a near-optimal policy in both stochastic and adversarial settings. Specifically, for length-$T$ problems, MPC requires only $O(\log T)$ predictions to reach $O(1)$ dynamic regret, which matches (up to lower-order terms) our lower bound on the required prediction horizon for constant regret.

preprint2020arXiv

Distributed Adaptive Reinforcement Learning: A Method for Optimal Routing

In this paper, a learning-based optimal transportation algorithm for autonomous taxis and ridesharing vehicles is presented. The goal is to design a mechanism to solve the routing problem for multiple autonomous vehicles and multiple customers in order to maximize the transportation company's profit. As a result, each vehicle selects the customer whose request maximizes the company's profit in the long run. To solve this problem, the system is modeled as a Markov Decision Process (MDP) using past customers data. By solving the defined MDP, a centralized high-level planning recommendation is obtained, where this offline solution is used as an initial value for the real-time learning. Then, a distributed SARSA reinforcement learning algorithm is proposed to capture the model errors and the environment changes, such as variations in customer distributions in each area, traffic, and fares, thereby providing optimal routing policies in real-time. Vehicles, or agents, use only their local information and interaction, such as current passenger requests and estimates of neighbors' tasks and their optimal actions, to obtain the optimal policies in a distributed fashion. An optimal adaptive rate is introduced to make the distributed SARSA algorithm capable of adapting to changes in the environment and tracking the time-varying optimal policies. Furthermore, a game-theory-based task assignment algorithm is proposed, where each agent uses the optimal policies and their values from distributed SARSA to select its customer from the set of local available requests in a distributed manner. Finally, the customers data provided by the city of Chicago is used to validate the proposed algorithms.

preprint2020arXiv

GLAS: Global-to-Local Safe Autonomy Synthesis for Multi-Robot Motion Planning with End-to-End Learning

We present GLAS: Global-to-Local Autonomy Synthesis, a provably-safe, automated distributed policy generation for multi-robot motion planning. Our approach combines the advantage of centralized planning of avoiding local minima with the advantage of decentralized controllers of scalability and distributed computation. In particular, our synthesized policies only require relative state information of nearby neighbors and obstacles, and compute a provably-safe action. Our approach has three major components: i) we generate demonstration trajectories using a global planner and extract local observations from them, ii) we use deep imitation learning to learn a decentralized policy that can run efficiently online, and iii) we introduce a novel differentiable safety module to ensure collision-free operation, thereby allowing for end-to-end policy training. Our numerical experiments demonstrate that our policies have a 20% higher success rate than optimal reciprocal collision avoidance, ORCA, across a wide range of robot and obstacle densities. We demonstrate our method on an aerial swarm, executing the policy on low-end microcontrollers in real-time.

preprint2020arXiv

Neural-Swarm: Decentralized Close-Proximity Multirotor Control Using Learned Interactions

In this paper, we present Neural-Swarm, a nonlinear decentralized stable controller for close-proximity flight of multirotor swarms. Close-proximity control is challenging due to the complex aerodynamic interaction effects between multirotors, such as downwash from higher vehicles to lower ones. Conventional methods often fail to properly capture these interaction effects, resulting in controllers that must maintain large safety distances between vehicles, and thus are not capable of close-proximity flight. Our approach combines a nominal dynamics model with a regularized permutation-invariant Deep Neural Network (DNN) that accurately learns the high-order multi-vehicle interactions. We design a stable nonlinear tracking controller using the learned model. Experimental results demonstrate that the proposed controller significantly outperforms a baseline nonlinear tracking controller with up to four times smaller worst-case height tracking errors. We also empirically demonstrate the ability of our learned model to generalize to larger swarm sizes.

preprint2020arXiv

Robust Regression for Safe Exploration in Control

We study the problem of safe learning and exploration in sequential control problems. The goal is to safely collect data samples from operating in an environment, in order to learn to achieve a challenging control goal (e.g., an agile maneuver close to a boundary). A central challenge in this setting is how to quantify uncertainty in order to choose provably-safe actions that allow us to collect informative data and reduce uncertainty, thereby achieving both improved controller safety and optimality. To address this challenge, we present a deep robust regression model that is trained to directly predict the uncertainty bounds for safe exploration. We derive generalization bounds for learning, and connect them with safety and stability bounds in control. We demonstrate empirically that our robust regression approach can outperform the conventional Gaussian process (GP) based safe exploration in settings where it is difficult to specify a good GP prior.

preprint2019arXiv

Neural Lander: Stable Drone Landing Control using Learned Dynamics

Precise near-ground trajectory control is difficult for multi-rotor drones, due to the complex aerodynamic effects caused by interactions between multi-rotor airflow and the environment. Conventional control methods often fail to properly account for these complex effects and fall short in accomplishing smooth landing. In this paper, we present a novel deep-learning-based robust nonlinear controller (Neural Lander) that improves control performance of a quadrotor during landing. Our approach combines a nominal dynamics model with a Deep Neural Network (DNN) that learns high-order interactions. We apply spectral normalization (SN) to constrain the Lipschitz constant of the DNN. Leveraging this Lipschitz property, we design a nonlinear feedback linearization controller using the learned model and prove system stability with disturbance rejection. To the best of our knowledge, this is the first DNN-based nonlinear feedback controller with stability guarantees that can utilize arbitrarily large neural nets. Experimental results demonstrate that the proposed controller significantly outperforms a Baseline Nonlinear Tracking Controller in both landing and cross-table trajectory tracking cases. We also empirically show that the DNN generalizes well to unseen data outside the training domain.

preprint2016arXiv

Distributed Estimation using Bayesian Consensus Filtering

We present the Bayesian consensus filter (BCF) for tracking a moving target using a networked group of sensing agents and achieving consensus on the best estimate of the probability distributions of the target's states. Our BCF framework can incorporate nonlinear target dynamic models, heterogeneous nonlinear measurement models, non-Gaussian uncertainties, and higher-order moments of the locally estimated posterior probability distribution of the target's states obtained using Bayesian filters. If the agents combine their estimated posterior probability distributions using a logarithmic opinion pool, then the sum of Kullback--Leibler divergences between the consensual probability distribution and the local posterior probability distributions is minimized. Rigorous stability and convergence results for the proposed BCF algorithm with single or multiple consensus loops are presented. Communication of probability distributions and computational methods for implementing the BCF algorithm are discussed along with a numerical example.

preprint2016arXiv

Probabilistic and Distributed Control of a Large-Scale Swarm of Autonomous Agents

We present a novel method for guiding a large-scale swarm of autonomous agents into a desired formation shape in a distributed and scalable manner. Our Probabilistic Swarm Guidance using Inhomogeneous Markov Chains (PSG-IMC) algorithm adopts an Eulerian framework, where the physical space is partitioned into bins and the swarm's density distribution over each bin is controlled. Each agent determines its bin transition probabilities using a time-inhomogeneous Markov chain. These time-varying Markov matrices are constructed by each agent in real-time using the feedback from the current swarm distribution, which is estimated in a distributed manner. The PSG-IMC algorithm minimizes the expected cost of the transitions per time instant, required to achieve and maintain the desired formation shape, even when agents are added to or removed from the swarm. The algorithm scales well with a large number of agents and complex formation shapes, and can also be adapted for area exploration applications. We demonstrate the effectiveness of this proposed swarm guidance algorithm by using results of numerical simulations and hardware experiments with multiple quadrotors.

preprint2014arXiv

Target Assignment in Robotic Networks: Distance Optimality Guarantees and Hierarchical Strategies

We study the problem of multi-robot target assignment to minimize the total distance traveled by the robots until they all reach an equal number of static targets. In the first half of the paper, we present a necessary and sufficient condition under which true distance optimality can be achieved for robots with limited communication and target-sensing ranges. Moreover, we provide an explicit, non-asymptotic formula for computing the number of robots needed to achieve distance optimality in terms of the robots' communication and target-sensing ranges with arbitrary guaranteed probabilities. The same bounds are also shown to be asymptotically tight. In the second half of the paper, we present suboptimal strategies for use when the number of robots cannot be chosen freely. Assuming first that all targets are known to all robots, we employ a hierarchical communication model in which robots communicate only with other robots in the same partitioned region. This hierarchical communication model leads to constant approximations of true distance-optimal solutions under mild assumptions. We then revisit the limited communication and sensing models. By combining simple rendezvous-based strategies with a hierarchical communication model, we obtain decentralized hierarchical strategies that achieve constant approximation ratios with respect to true distance optimality. Results of simulation show that the approximation ratio is as low as 1.4.

preprint2011arXiv

Application of Synchronization to Formation Flying Spacecraft: Lagrangian Approach

This article presents a unified synchronization framework with application to precision formation flying spacecraft. Central to the proposed innovation, in applying synchronization to both translational and rotational dynamics in the Lagrangian form, is the use of the distributed stability and performance analysis tool, called contraction analysis that yields exact nonlinear stability proofs. The proposed decentralized tracking control law synchronizes the attitude of an arbitrary number of spacecraft into a common time-varying trajectory with global exponential convergence. Moreover, a decentralized translational tracking control law based on phase synchronization is presented, thus enabling coupled translational and rotational maneuvers. While the translational dynamics can be adequately controlled by linear control laws, the proposed method permits highly nonlinear systems with nonlinearly coupled inertia matrices such as the attitude dynamics of spacecraft whose large and rapid slew maneuvers justify the nonlinear control approach. The proposed method integrates both the trajectory tracking and synchronization problems in a single control framework.

preprint2011arXiv

Neurobiologically Inspired Control of Engineered Flapping Flight

This article presents a new control approach and a dynamic model for engineered flapping flight with many interacting degrees of freedom. This paper explores the applications of neurobiologically inspired control systems in the form of Central Pattern Generators (CPG) to control flapping flight dynamics. A rigorous mathematical and control theoretic framework to design complex three dimensional wing motions is presented based on phase synchronization of nonlinear oscillators. In particular, we show the flapping flying dynamics without a tail or traditional aerodynamic control surfaces can be effectively controlled by a reduced set of CPG parameters that generate phase-synchronized or symmetry-breaking oscillatory motions of two main wings. Furthermore, by using Hopf bifurcation, we show that tailless aircraft alternating between flapping and gliding can be effectively stabilized by smooth wing motions driven by the CPG network. Results of numerical simulation with a full six degree-of-freedom flight dynamic model validate the effectiveness of the proposed neurobiologically inspired control approach.

preprint2010arXiv

On Synchronization of Coupled Hopf-Kuramoto Oscillators with Phase Delays

This paper presents new methods and results on almost global synchronization of coupled Hopf nonlinear oscillators, which are commonly used as the dynamic model of engineered central pattern generators (CPGs). On balanced graphs, any positive coupling gain is proven to induce almost global asymptotic synchronization, and a threshold value for truly global exponential synchronization is also computed. Furthermore, a hierarchical connection between coupled Hopf oscillators and Kuramoto oscillators is identified. Finally, a new result on the synchronization of Kuramoto oscillators with arbitrary time-varying heterogeneous frequencies and delays is derived.

Soon-Jo Chung

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

Competitive Control with Delayed Imperfect Information

Incremental Nonlinear Stability Analysis of Stochastic Systems Perturbed by Lévy Noise

Meta-Learning-Based Robust Adaptive Flight Control Under Uncertain Wind Conditions

Trajectory Optimization of Chance-Constrained Nonlinear Stochastic Systems for Motion Planning Under Uncertainty

Neural Stochastic Contraction Metrics for Learning-based Control and Estimation

Online Optimization with Memory and Competitive Control

The Power of Predictions in Online Control

Distributed Adaptive Reinforcement Learning: A Method for Optimal Routing

GLAS: Global-to-Local Safe Autonomy Synthesis for Multi-Robot Motion Planning with End-to-End Learning

Neural-Swarm: Decentralized Close-Proximity Multirotor Control Using Learned Interactions

Robust Regression for Safe Exploration in Control

Neural Lander: Stable Drone Landing Control using Learned Dynamics

Distributed Estimation using Bayesian Consensus Filtering

Probabilistic and Distributed Control of a Large-Scale Swarm of Autonomous Agents

Target Assignment in Robotic Networks: Distance Optimality Guarantees and Hierarchical Strategies

Application of Synchronization to Formation Flying Spacecraft: Lagrangian Approach

Neurobiologically Inspired Control of Engineered Flapping Flight

On Synchronization of Coupled Hopf-Kuramoto Oscillators with Phase Delays