Source author record

Jorge Cortés

Jorge Cortés appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Systems and Control eess.SY math.DS Robotics astro-ph.HE Machine Learning math.AP Neurons and Cognition

Catalog footprint

What is connected

14works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions

Establishing stability certificates for closed-loop systems under reinforcement learning (RL) policies is essential to move beyond empirical performance and offer guarantees of system behavior. Classical Lyapunov methods require a strict stepwise decrease in the Lyapunov function but such certificates are difficult to construct for learned policies. The RL value function is a natural candidate but it is not well understood how it can be adapted for this purpose. To gain intuition, we first study the linear quadratic regulator (LQR) problem and make two key observations. First, a Lyapunov function can be obtained from the value function of an LQR policy by augmenting it with a residual term related to the system dynamics and stage cost. Second, the classical Lyapunov decrease requirement can be relaxed to a generalized Lyapunov condition requiring only decrease on average over multiple time steps. Using this intuition, we consider the nonlinear setting and formulate an approach to learn generalized Lyapunov functions by augmenting RL value functions with neural network residual terms. Our approach successfully certifies the stability of RL policies trained on Gymnasium and DeepMind Control benchmarks. We also extend our method to jointly train neural controllers and stability certificates using a multi-step Lyapunov loss, resulting in larger certified inner approximations of the region of attraction compared to the classical Lyapunov approach. Overall, our formulation enables stability certification for a broad class of systems with learned policies by making certificates easier to construct, thereby bridging classical control theory and modern learning-based methods.

preprint2026arXiv

Gradient sampling algorithm for subsmooth functions

This paper considers non-smooth optimization problems where we seek to minimize the pointwise maximum of a continuously parameterized family of functions. Since the objective function is given as the solution to a maximization problem, neither its values nor its gradients are available in closed form, which calls for approximation. Our approach hinges upon extending the so-called gradient sampling algorithm, which approximates the Clarke generalized gradient of the objective function at a point by sampling its derivative at nearby locations. This allows us to select descent directions around points where the function may fail to be differentiable and establish algorithm convergence to a stationary point from any initial condition. Our key contribution is to prove this convergence by alleviating the requirement on continuous differentiability of the objective function on an open set of full measure. We further provide assumptions under which a desired convex subset of the decision space is rendered attractive for the iterates of the algorithm.

preprint2023arXiv

Feasibility Analysis and Regularity Characterization of Distributionally Robust Safe Stabilizing Controllers

This paper studies the well-posedness and regularity of safe stabilizing optimization-based controllers for control-affine systems in the presence of model uncertainty. When the system dynamics contain unknown parameters, a finite set of samples can be used to formulate distributionally robust versions of control barrier function and control Lyapunov function constraints. Control synthesis with such distributionally robust constraints can be achieved by solving a (convex) second-order cone program (SOCP). We provide one necessary and two sufficient conditions to check the feasibility of such optimization problems, characterize their computational complexity and numerically show that they are significantly faster to check than direct use of SOCP solvers. Finally, we also analyze the regularity of the resulting control laws.

preprint2022arXiv

Global Kinetic Modeling of the Intrabinary Shock in Spider Pulsars

Spider pulsars are compact binary systems composed of a millisecond pulsar and a low-mass companion. The relativistic magnetically-dominated pulsar wind impacts onto the companion, ablating it and slowly consuming its atmosphere. The interaction forms an intrabinary shock, a proposed site of particle acceleration. We perform global fully-kinetic particle-in-cell simulations of the intrabinary shock, assuming that the pulsar wind consists of plane-parallel stripes of alternating polarity and that the shock wraps around the companion. We find that particles are efficiently accelerated via shock-driven reconnection. We extract first-principles synchrotron spectra and lightcurves which are in good agreement with X-ray observations: (1) the synchrotron spectrum is nearly flat, $F_ν\propto {\rm const}$; (2) when the pulsar spin axis is nearly aligned with the orbital angular momentum, the light curve displays two peaks, just before and after the pulsar eclipse (pulsar superior conjunction), separated in phase by $\sim 0.8\, {\rm rad}$; (3) the peak flux exceeds the one at inferior conjunction by a factor of ten. We demonstrate that the double-peaked signature in the lightcurve is due to Doppler boosting in the post-shock flow.

preprint2022arXiv

Learning Local Volt/Var Controllers Towards Efficient Network Operation with Stability Guarantees

This paper considers the problem of voltage regulation in distribution networks. The primary motivation is to keep voltages within preassigned operating limits by commanding the reactive power output of distributed energy resources (DERs) deployed in the grid. We develop a framework for developing local Volt/Var control that comprises two main steps. In the first, by exploiting historical data and for each DER, we learn a function representing the desirable equilibrium points for the power network. These points approximate solutions of an Optimal Power Flow (OPF) problem. In the second, we propose a control scheme for steering the network towards these favorable configurations. Theoretical conditions are derived to formally guarantee the stability of the developed control scheme, and numerical simulations illustrate the effectiveness of the proposed approach.

preprint2022arXiv

Selective Inhibition and Recruitment of Linear-Threshold Thalamocortical Networks

Neuroscientific evidence shows that for most brain networks all pathways between cortical regions either pass through the thalamus or a transthalamic parallel route exists for any direct corticocortical connection. This paper seeks to formally study the dynamical behavior of the resulting thalamocortical brain networks with a view to characterizing the inhibitory role played by the thalamus and its benefits. We employ a linear-threshold mesoscale model for individual brain subnetworks and study both hierarchical and star-connected thalamocortical networks. Using tools from singular perturbation theory and switched systems, we show that selective inhibition and recruitment can be achieved in such networks through a combination of feedback and feedforward control. Various simulations throughout the exposition illustrate the benefits resulting from the presence of the thalamus regarding failsafe mechanisms, required control magnitude, and network performance.

preprint2021arXiv

Learning Barrier Functions with Memory for Robust Safe Navigation

Control barrier functions are widely used to enforce safety properties in robot motion planning and control. However, the problem of constructing barrier functions online and synthesizing safe controllers that can deal with the associated uncertainty has received little attention. This paper investigates safe navigation in unknown environments, using onboard range sensing to construct control barrier functions online. To represent different objects in the environment, we use the distance measurements to train neural network approximations of the signed distance functions incrementally with replay memory. This allows us to formulate a novel robust control barrier safety constraint which takes into account the error in the estimated distance fields and its gradient. Our formulation leads to a second-order cone program, enabling safe and stable control synthesis in a priori unknown environments.

preprint2021arXiv

Learning Koopman Eigenfunctions and Invariant Subspaces from Data: Symmetric Subspace Decomposition

This paper develops data-driven methods to identify eigenfunctions of the Koopman operator associated to a dynamical system and subspaces that are invariant under the operator. We build on Extended Dynamic Mode Decomposition (EDMD), a data-driven method that finds a finite-dimensional approximation of the Koopman operator on the span of a predefined dictionary of functions. We propose a necessary and sufficient condition to identify Koopman eigenfunctions based on the application of EDMD forward and backward in time. Moreover, we propose the Symmetric Subspace Decomposition (SSD) algorithm, an iterative method which provably identifies the maximal Koopman-invariant subspace and the Koopman eigenfunctions in the span of the dictionary. We also introduce the Streaming Symmetric Subspace Decomposition (SSSD) algorithm, an online extension of SSD that only requires a small, fixed memory and incorporates new data as is received. Finally, we propose an extension of SSD that approximates Koopman eigenfunctions and invariant subspaces when the dictionary does not contain sufficient informative eigenfunctions.

preprint2020arXiv

Dynamics of Data-driven Ambiguity Sets for Hyperbolic Conservation Laws with Uncertain Inputs

Ambiguity sets of probability distributions are used to hedge against uncertainty about the true probabilities of random quantities of interest (QoIs). When available, these ambiguity sets are constructed from both data (collected at the initial time and along the boundaries of the physical domain) and concentration-of-measure results on the Wasserstein metric. To propagate the ambiguity sets into the future, we use a physics-dependent equation governing the evolution of cumulative distribution functions (CDF) obtained through the method of distributions. This study focuses on the latter step by investigating the spatio-temporal evolution of data-driven ambiguity sets and their associated guarantees when the random QoIs they describe obey hyperbolic partial-differential equations with random inputs. For general nonlinear hyperbolic equations with smooth solutions, the CDF equation is used to propagate the upper and lower envelopes of pointwise ambiguity bands. For linear dynamics, the CDF equation allows us to construct an evolution equation for tighter ambiguity balls. We demonstrate that, in both cases, the ambiguity sets are guaranteed to contain the true (unknown) distributions within a prescribed confidence.

preprint2020arXiv

Event-Triggered Stabilization of Nonlinear Systems with Time-Varying Sensing and Actuation Delay

This paper studies the problem of stabilization of a nonlinear system with time-varying delays in both sensing and actuation using event-triggered control. Our proposed strategy seeks to opportunistically minimize the number of control updates while guaranteeing stabilization and builds on predictor feedback to compensate for arbitrarily large known time-varying delays. We establish, using a Lyapunov approach, the global asymptotic stability of the closed-loop system as long as the open-loop system is globally input-to-state stabilizable in the absence of time delays and sampling. We further prove that the proposed event-triggered law has inter-event times that are uniformly lower bounded and hence does not exhibit Zeno behavior. For the particular case of a stabilizable linear system, we show global exponential stability of the closed-loop system and analyze the trade-off between the rate of exponential convergence and a bound on the sampling frequency. We illustrate these results in simulation and also examine the properties of the proposed event-triggered strategy beyond the class of systems for which stabilization can be guaranteed.

preprint2020arXiv

Resource-Aware Discretization of Accelerated Optimization Flows

This paper tackles the problem of discretizing accelerated optimization flows while retaining their convergence properties. Inspired by the success of resource-aware control in developing efficient closed-loop feedback implementations on digital systems, we view the last sampled state of the system as the resource to be aware of. The resulting variable-stepsize discrete-time algorithms retain by design the desired decrease of the Lyapunov certificate of their continuous-time counterparts. Our algorithm design employs various concepts and techniques from resource-aware control that, in the present context, have interesting parallelisms with the discrete-time implementation of optimization algorithms. These include derivative- and performance-based triggers to monitor the evolution of the Lyapunov function as a way of determining the algorithm stepsize, exploiting sampled information to enhance algorithm performance, and employing high-order holds using more accurate integrators of the original dynamics. Throughout the paper, we illustrate our approach on a newly introduced continuous-time dynamics termed heavy-ball dynamics with displaced gradient, but the ideas proposed here have broad applicability to other globally asymptotically stable flows endowed with a Lyapunov certificate.

preprint2016arXiv

Differentially Private Distributed Convex Optimization via Functional Perturbation

We study a class of distributed convex constrained optimization problems where a group of agents aim to minimize the sum of individual objective functions while each desires that any information about its objective function is kept private. We prove the impossibility of achieving differential privacy using strategies based on perturbing the inter-agent messages with noise when the underlying noise-free dynamics are asymptotically stable. This justifies our algorithmic solution based on the perturbation of individual functions with Laplace noise. To this end, we establish a general framework for differentially private handling of functional data. We further design post-processing steps that ensure the perturbed functions regain the smoothness and convexity properties of the original functions while preserving the differentially private guarantees of the functional perturbation step. This methodology allows us to use any distributed coordination algorithm to solve the optimization problem on the noisy functions. Finally, we explicitly bound the magnitude of the expected distance between the perturbed and true optimizers which leads to an upper bound on the privacy-accuracy trade-off curve. Simulations illustrate our results.

preprint2016arXiv

Distributed saddle-point subgradient algorithms with Laplacian averaging

We present distributed subgradient methods for min-max problems with agreement constraints on a subset of the arguments of both the convex and concave parts. Applications include constrained minimization problems where each constraint is a sum of convex functions in the local variables of the agents. In the latter case, the proposed algorithm reduces to primal-dual updates using local subgradients and Laplacian averaging on local copies of the multipliers associated to the global constraints. For the case of general convex-concave saddle-point problems, our analysis establishes the convergence of the running time-averages of the local estimates to a saddle point under periodic connectivity of the communication digraphs. Specifically, choosing the gradient step-sizes in a suitable way, we show that the evaluation error is proportional to $1/\sqrt{t}$, where $t$ is the iteration step. We illustrate our results in simulation for an optimization scenario with nonlinear constraints coupling the decisions of agents that cannot communicate directly.

preprint2016arXiv

Gramian-based reachability metrics for bilinear networks

This paper studies Gramian-based reachability metrics for bilinear control systems. In the context of complex networks, bilinear systems capture scenarios where an actuator not only can affect the state of a node but also interconnections among nodes. Under the assumption that the input's infinity norm is bounded by some function of the network dynamic matrices, we derive a Gramian-based lower bound on the minimum input energy required to steer the state from the origin to any reachable target state. This result motivates our study of various objects associated to the reachability Gramian to quantify the ease of controllability of the bilinear network: the minimum eigenvalue (worst-case minimum input energy to reach a state), the trace (average minimum input energy to reach a state), and its determinant (volume of the ellipsoid containing the reachable states using control inputs with no more than unit energy). We establish an increasing returns property of the reachability Gramian as a function of the actuators, which in turn allows us to derive a general lower bound on the reachability metrics in terms of the aggregate contribution of the individual actuators. We conclude by examining the effect on the worst-case minimum input energy of the addition of bilinear inputs to difficult-to-control linear symmetric networks. We show that the bilinear networks resulting from the addition of either inputs at a finite number of interconnections or at all self loops with weight vanishing with the network scale remain difficult-to-control. Various examples illustrate our results.

Jorge Cortés

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions

Gradient sampling algorithm for subsmooth functions

Feasibility Analysis and Regularity Characterization of Distributionally Robust Safe Stabilizing Controllers

Global Kinetic Modeling of the Intrabinary Shock in Spider Pulsars

Learning Local Volt/Var Controllers Towards Efficient Network Operation with Stability Guarantees

Selective Inhibition and Recruitment of Linear-Threshold Thalamocortical Networks

Learning Barrier Functions with Memory for Robust Safe Navigation

Learning Koopman Eigenfunctions and Invariant Subspaces from Data: Symmetric Subspace Decomposition

Dynamics of Data-driven Ambiguity Sets for Hyperbolic Conservation Laws with Uncertain Inputs

Event-Triggered Stabilization of Nonlinear Systems with Time-Varying Sensing and Actuation Delay

Resource-Aware Discretization of Accelerated Optimization Flows

Differentially Private Distributed Convex Optimization via Functional Perturbation

Distributed saddle-point subgradient algorithms with Laplacian averaging

Gramian-based reachability metrics for bilinear networks