Source author record

Jean-Jacques E. Slotine

Jean-Jacques E. Slotine appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Systems and Control eess.SY Machine Learning math.DS Robotics Biological Physics Artificial Intelligence Cell Behavior math.CA nlin.AO nlin.CD

Catalog footprint

What is connected

18works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Adaptive Variants of Optimal Feedback Policies

The stable combination of optimal feedback policies with online learning is studied in a new control-theoretic framework for uncertain nonlinear systems. The framework can be systematically used in transfer learning and sim-to-real applications, where an optimal policy learned for a nominal system needs to remain effective in the presence of significant variations in parameters. Given unknown parameters within a bounded range, the resulting adaptive control laws guarantee convergence of the closed-loop system to the state of zero cost. Online adjustment of the learning rate is used as a key stability mechanism, and preserves certainty equivalence when designing optimal policies without assuming uncertainty to be within the control range. The approach is illustrated on the familiar mountain car problem, where it yields near-optimal performance despite the presence of parametric model uncertainty.

preprint2022arXiv

Nonparametric adaptive control and prediction: theory and randomized algorithms

A key assumption in the theory of nonlinear adaptive control is that the uncertainty of the system can be expressed in the linear span of a set of known basis functions. While this assumption leads to efficient algorithms, it limits applications to very specific classes of systems. We introduce a novel nonparametric adaptive algorithm that estimates an infinite-dimensional density over parameters online to learn an unknown dynamics in a reproducing kernel Hilbert space. Surprisingly, the resulting control input admits an analytical expression that enables its implementation despite its underlying infinite-dimensional structure. While this adaptive input is rich and expressive - subsuming, for example, traditional linear parameterizations - its computational complexity grows linearly with time, making it comparatively more expensive than its parametric counterparts. Leveraging the theory of random Fourier features, we provide an efficient randomized implementation that recovers the complexity of classical parametric methods while provably retaining the expressivity of the nonparametric input. In particular, our explicit bounds only depend polynomially on the underlying parameters of the system, allowing our proposed algorithms to efficiently scale to high-dimensional systems. As an illustration of the method, we demonstrate the ability of the randomized approximation algorithm to learn a predictive model of a 60-dimensional system consisting of ten point masses interacting through Newtonian gravitation. By reinterpretation as a gradient flow on a specific loss, we conclude with a natural extension of our kernel-based adaptive algorithms to deep neural networks. We show empirically that the extra expressivity afforded by deep representations can lead to improved performance at the expense of closed-loop stability that is rigorously guaranteed and consistently observed for kernel machines.

preprint2022arXiv

The role of optimization geometry in single neuron learning

Recent numerical experiments have demonstrated that the choice of optimization geometry used during training can impact generalization performance when learning expressive nonlinear model classes such as deep neural networks. These observations have important implications for modern deep learning but remain poorly understood due to the difficulty of the associated nonconvex optimization problem. Towards an understanding of this phenomenon, we analyze a family of pseudogradient methods for learning generalized linear models under the square loss - a simplified problem containing both nonlinearity in the model parameters and nonconvexity of the optimization which admits a single neuron as a special case. We prove non-asymptotic bounds on the generalization error that sharply characterize how the interplay between the optimization geometry and the feature space geometry sets the out-of-sample performance of the learned model. Experimentally, selecting the optimization geometry as suggested by our theory leads to improved performance in generalized linear model estimation problems such as nonlinear and nonconvex variants of sparse vector recovery and low-rank matrix sensing.

preprint2021arXiv

Neural Stochastic Contraction Metrics for Learning-based Control and Estimation

We present Neural Stochastic Contraction Metrics (NSCM), a new design framework for provably-stable robust control and estimation for a class of stochastic nonlinear systems. It uses a spectrally-normalized deep neural network to construct a contraction metric, sampled via simplified convex optimization in the stochastic setting. Spectral normalization constrains the state-derivatives of the metric to be Lipschitz continuous, thereby ensuring exponential boundedness of the mean squared distance of system trajectories under stochastic disturbances. The NSCM framework allows autonomous agents to approximate optimal stable control and estimation policies in real-time, and outperforms existing nonlinear control and estimation techniques including the state-dependent Riccati equation, iterative LQR, EKF, and the deterministic neural contraction metric, as illustrated in simulation results.

preprint2020arXiv

Contraction Metrics in Adaptive Nonlinear Control

Lyapunov stability theory is the bedrock of direct adaptive control. Fundamentally, Lyapunov stability requires constructing a distance-like function which must decrease with time to ensure stability. Feedback linearization, backstepping, and sum-of-squares optimization are common approaches for constructing such a distance function, but require the system to possess certain inherent/structural properties or involves solving a non-convex optimization problem. These restrictions/complexities arise because Lyapunov stability theory relies on constructing an explicit distance function. This work uses contraction metrics to derive an adaptive controller for stabilizable nonlinear systems by constructing a distance-like function differentially rather than explicitly. Because stabilizability is in fact equivalent to the existence of a contraction metric, the proposed approach is significantly more general than available results in the literature. In particular, the method can be applied to underactuated systems. More broadly, it can also be used in transfer learning where a feedback controller has been carefully learned for a nominal system, but needs to remain effective in the presence of significant but structured variations in parameters. Simulation results illustrate the approach.

preprint2020arXiv

Learning Stability Certificates from Data

Many existing tools in nonlinear control theory for establishing stability or safety of a dynamical system can be distilled to the construction of a certificate function that guarantees a desired property. However, algorithms for synthesizing certificate functions typically require a closed-form analytical expression of the underlying dynamics, which rules out their use on many modern robotic platforms. To circumvent this issue, we develop algorithms for learning certificate functions only from trajectory data. We establish bounds on the generalization error - the probability that a certificate will not certify a new, unseen trajectory - when learning from trajectories, and we convert such generalization error bounds into global stability guarantees. We demonstrate empirically that certificates for complex dynamics can be efficiently learned, and that the learned certificates can be used for downstream tasks such as adaptive control.

preprint2020arXiv

Robust Adaptive Control Barrier Functions: An Adaptive & Data-Driven Approach to Safety (Extended Version)

A new framework is developed for control of constrained nonlinear systems with structured parametric uncertainties. Forward invariance of a safe set is achieved through online parameter adaptation and data-driven model estimation. The new adaptive data-driven safety paradigm is merged with a recent adaptive control algorithm for systems nominally contracting in closed-loop. This unification is more general than other safety controllers as closed-loop contraction does not require the system be invertible or in a particular form. Additionally, the approach is less expensive than nonlinear model predictive control as it does not require a full desired trajectory, but rather only a desired terminal state. The approach is illustrated on the pitch dynamics of an aircraft with uncertain nonlinear aerodynamics.

preprint2016arXiv

Asymptotic Solution to the Rayleigh Problem of Dynamic Soaring

Albatrosses can travel a thousand kilometers daily over the oceans. This feat is achieved through dynamic soaring, a non-flapping flight strategy where propulsive energy is extracted from horizontal wind shears. Dynamic soaring has been described as a sequence of half-turns connecting upwind climbs and downwind dives through the surface shear layer. We analytically and numerically investigate the aerodynamically optimal flight trajectory for varying shear thicknesses. Contrary to current thinking, but consistent with GPS recordings of flying albatrosses, in thin shears the optimal trajectory is composed of small angle arcs. Essentially, the albatross is a flying sailboat, sequentially acting as sail and keel, and most efficient when remaining crosswind. Our analysis constitutes a general framework for dynamic soaring, and more broadly energy extraction in complex winds.

preprint2015arXiv

A contraction based, singular perturbation approach to near-decomposability in complex systems

We revisit the classical concept of near-decomposability in complex systems, introduced by Herbert Simon in his foundational article The Architecture of Complexity, by developing an explicit quantitative analysis based on singular perturbations and nonlinear contraction theory. Complex systems are often modular and hierarchic, and a central question is whether the whole system behaves approximately as the "sum of its parts", or whether feedbacks between modules modify qualitatively the modules behavior, and perhaps also generate instabilities. We show that, when the individual nonlinear modules are contracting (i.e., forget their initial conditions exponentially), a critical separation of timescales exists between the dynamics of the modules and that of the macro system, below which it behaves approximately as the stable sum of its parts. Our analysis is fully nonlinear and provides explicit conditions and error bounds, thus both quantifying and qualifying existing results on near-decomposability.

preprint2015arXiv

Combination Properties of Weakly Contracting Systems

A note on the property of weak contraction, which implies that all bounded solutions of a nonlinear system converge to a (possibly non-unique) equilibrium. We provide some simple results about interconnections of such systems, and a brief discussion.

preprint2014arXiv

Control Contraction Metrics, Robust Control and Observer Duality

This paper addresses the problems of stabilization, robust control, and observer design for nonlinear systems. We build upon recently a proposed method based on contraction theory and convex optimization, extending the class of systems to which it is applicable. We prove converse results for mechanical systems and feedback-linearizable systems. Next we consider robust control, and give a simple construction of a controller guaranteeing an L2-gain condition, and discuss connections to nonlinear H-infinity control. Finally, we discuss a "duality" result between nonlinear stabilization problems and observer construction, in the process constructing globally stable reduced-order observers for a class of nonlinear systems.

preprint2014arXiv

Output-Feedback Control of Nonlinear Systems using Control Contraction Metrics and Convex Optimization

Control contraction metrics (CCMs) are a new approach to nonlinear control design based on contraction theory. The resulting design problems are expressed as pointwise linear matrix inequalities and are and well-suited to solution via convex optimization. In this paper, we extend the theory on CCMs by showing that a pair of "dual" observer and controller problems can be solved using pointwise linear matrix inequalities, and that when a solution exists a separation principle holds. That is, a stabilizing output-feedback controller can be found. The procedure is demonstrated using a benchmark problem of nonlinear control: the Moore-Greitzer jet engine compressor model.

preprint2013arXiv

Control Contraction Metrics and Universal Stabilizability

In this paper we introduce the concept of universal stabilizability: the condition that every solution of a nonlinear system can be globally stabilized. We give sufficient conditions in terms of the existence of a control contraction metric, which can be found by solving a pointwise linear matrix inequality. Extensions to approximate optimal control are straightforward. The conditions we give are necessary and sufficient for linear systems and certain classes of nonlinear systems, and have interesting connections to the theory of control Lyapunov functions.

preprint2013arXiv

Transverse Contraction Criteria for Existence, Stability, and Robustness of a Limit Cycle

This paper derives a differential contraction condition for the existence of an orbitally-stable limit cycle in an autonomous system. This transverse contraction condition can be represented as a pointwise linear matrix inequality (LMI), thus allowing convex optimization tools such as sum-of-squares programming to be used to search for certificates of the existence of a stable limit cycle. Many desirable properties of contracting dynamics are extended to this context, including preservation of contraction under a broad class of interconnections. In addition, by introducing the concepts of differential dissipativity and transverse differential dissipativity, contraction and transverse contraction can be established for large scale systems via LMI conditions on component subsystems.

preprint2011arXiv

Application of Synchronization to Formation Flying Spacecraft: Lagrangian Approach

This article presents a unified synchronization framework with application to precision formation flying spacecraft. Central to the proposed innovation, in applying synchronization to both translational and rotational dynamics in the Lagrangian form, is the use of the distributed stability and performance analysis tool, called contraction analysis that yields exact nonlinear stability proofs. The proposed decentralized tracking control law synchronizes the attitude of an arbitrary number of spacecraft into a common time-varying trajectory with global exponential convergence. Moreover, a decentralized translational tracking control law based on phase synchronization is presented, thus enabling coupled translational and rotational maneuvers. While the translational dynamics can be adequately controlled by linear control laws, the proposed method permits highly nonlinear systems with nonlinearly coupled inertia matrices such as the attitude dynamics of spacecraft whose large and rapid slew maneuvers justify the nonlinear control approach. The proposed method integrates both the trajectory tracking and synchronization problems in a single control framework.

preprint2011arXiv

Symmetries, Stability, and Control in Nonlinear Systems and Networks

This paper discusses the interplay of symmetries and stability in the analysis and control of nonlinear dynamical systems and networks. Specifically, it combines standard results on symmetries and equivariance with recent convergence analysis tools based on nonlinear contraction theory and virtual dynamical systems. This synergy between structural properties (symmetries) and convergence properties (contraction) is illustrated in the contexts of network motifs arising e.g. in genetic networks, of invariance to environmental symmetries, and of imposing different patterns of synchrony in a network.

preprint2010arXiv

Global convergence of quorum-sensing networks

In many natural synchronization phenomena, communication between individual elements occurs not directly, but rather through the environment. One of these instances is bacterial quorum sensing, where bacteria release signaling molecules in the environment which in turn are sensed and used for population coordination. Extending this motivation to a general non- linear dynamical system context, this paper analyzes synchronization phenomena in networks where communication and coupling between nodes are mediated by shared dynamical quan- tities, typically provided by the nodes' environment. Our model includes the case when the dynamics of the shared variables themselves cannot be neglected or indeed play a central part. Applications to examples from systems biology illustrate the approach.

preprint2010arXiv

Shaping state and time-dependent convergence rates in non-linear control and observer design

This paper derives for non-linear, time-varying and feedback linearizable systems simple controller designs to achieve specified state-and timedependent complex convergence rates. This approach can be regarded as a general gain-scheduling technique with global exponential stability guarantee. Typical applications include the transonic control of an aircraft with strongly Mach or time-dependent eigenvalues or the state-dependent complex eigenvalue placement of the inverted pendulum. As a generalization of the LTI Luenberger observer a dual observer design technique is derived for a broad set of non-linear and time-varying systems, where so far straightforward observer techniques were not known. The resulting observer design is illustrated for non-linear chemical plants, the Van-der-Pol oscillator, the discrete logarithmic map series prediction and the lighthouse navigation problem. These results [23] allow one to shape globally the state- and time-dependent convergence behaviour ideally suited to the non-linear or time-varying system. The technique can also be used to provide analytic robustness guarantees against modelling uncertainties. The derivations are based on non-linear contraction theory [18], a comparatively recent dynamic system analysis tool whose results will be reviewed and extended.

Jean-Jacques E. Slotine

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

Adaptive Variants of Optimal Feedback Policies

Nonparametric adaptive control and prediction: theory and randomized algorithms

The role of optimization geometry in single neuron learning

Neural Stochastic Contraction Metrics for Learning-based Control and Estimation

Contraction Metrics in Adaptive Nonlinear Control

Learning Stability Certificates from Data

Robust Adaptive Control Barrier Functions: An Adaptive & Data-Driven Approach to Safety (Extended Version)

Asymptotic Solution to the Rayleigh Problem of Dynamic Soaring

A contraction based, singular perturbation approach to near-decomposability in complex systems

Combination Properties of Weakly Contracting Systems

Control Contraction Metrics, Robust Control and Observer Duality

Output-Feedback Control of Nonlinear Systems using Control Contraction Metrics and Convex Optimization

Control Contraction Metrics and Universal Stabilizability

Transverse Contraction Criteria for Existence, Stability, and Robustness of a Limit Cycle

Application of Synchronization to Formation Flying Spacecraft: Lagrangian Approach

Symmetries, Stability, and Control in Nonlinear Systems and Networks

Global convergence of quorum-sensing networks

Shaping state and time-dependent convergence rates in non-linear control and observer design