Researcher profile

Jorge Cortés

Jorge Cortés contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions

Establishing stability certificates for closed-loop systems under reinforcement learning (RL) policies is essential to move beyond empirical performance and offer guarantees of system behavior. Classical Lyapunov methods require a strict stepwise decrease in the Lyapunov function but such certificates are difficult to construct for learned policies. The RL value function is a natural candidate but it is not well understood how it can be adapted for this purpose. To gain intuition, we first study the linear quadratic regulator (LQR) problem and make two key observations. First, a Lyapunov function can be obtained from the value function of an LQR policy by augmenting it with a residual term related to the system dynamics and stage cost. Second, the classical Lyapunov decrease requirement can be relaxed to a generalized Lyapunov condition requiring only decrease on average over multiple time steps. Using this intuition, we consider the nonlinear setting and formulate an approach to learn generalized Lyapunov functions by augmenting RL value functions with neural network residual terms. Our approach successfully certifies the stability of RL policies trained on Gymnasium and DeepMind Control benchmarks. We also extend our method to jointly train neural controllers and stability certificates using a multi-step Lyapunov loss, resulting in larger certified inner approximations of the region of attraction compared to the classical Lyapunov approach. Overall, our formulation enables stability certification for a broad class of systems with learned policies by making certificates easier to construct, thereby bridging classical control theory and modern learning-based methods.

preprint2026arXiv

Gradient sampling algorithm for subsmooth functions

This paper considers non-smooth optimization problems where we seek to minimize the pointwise maximum of a continuously parameterized family of functions. Since the objective function is given as the solution to a maximization problem, neither its values nor its gradients are available in closed form, which calls for approximation. Our approach hinges upon extending the so-called gradient sampling algorithm, which approximates the Clarke generalized gradient of the objective function at a point by sampling its derivative at nearby locations. This allows us to select descent directions around points where the function may fail to be differentiable and establish algorithm convergence to a stationary point from any initial condition. Our key contribution is to prove this convergence by alleviating the requirement on continuous differentiability of the objective function on an open set of full measure. We further provide assumptions under which a desired convex subset of the decision space is rendered attractive for the iterates of the algorithm.

preprint2023arXiv

Feasibility Analysis and Regularity Characterization of Distributionally Robust Safe Stabilizing Controllers

This paper studies the well-posedness and regularity of safe stabilizing optimization-based controllers for control-affine systems in the presence of model uncertainty. When the system dynamics contain unknown parameters, a finite set of samples can be used to formulate distributionally robust versions of control barrier function and control Lyapunov function constraints. Control synthesis with such distributionally robust constraints can be achieved by solving a (convex) second-order cone program (SOCP). We provide one necessary and two sufficient conditions to check the feasibility of such optimization problems, characterize their computational complexity and numerically show that they are significantly faster to check than direct use of SOCP solvers. Finally, we also analyze the regularity of the resulting control laws.

preprint2022arXiv

Global Kinetic Modeling of the Intrabinary Shock in Spider Pulsars

Spider pulsars are compact binary systems composed of a millisecond pulsar and a low-mass companion. The relativistic magnetically-dominated pulsar wind impacts onto the companion, ablating it and slowly consuming its atmosphere. The interaction forms an intrabinary shock, a proposed site of particle acceleration. We perform global fully-kinetic particle-in-cell simulations of the intrabinary shock, assuming that the pulsar wind consists of plane-parallel stripes of alternating polarity and that the shock wraps around the companion. We find that particles are efficiently accelerated via shock-driven reconnection. We extract first-principles synchrotron spectra and lightcurves which are in good agreement with X-ray observations: (1) the synchrotron spectrum is nearly flat, $F_ν\propto {\rm const}$; (2) when the pulsar spin axis is nearly aligned with the orbital angular momentum, the light curve displays two peaks, just before and after the pulsar eclipse (pulsar superior conjunction), separated in phase by $\sim 0.8\, {\rm rad}$; (3) the peak flux exceeds the one at inferior conjunction by a factor of ten. We demonstrate that the double-peaked signature in the lightcurve is due to Doppler boosting in the post-shock flow.

preprint2022arXiv

Learning Local Volt/Var Controllers Towards Efficient Network Operation with Stability Guarantees

This paper considers the problem of voltage regulation in distribution networks. The primary motivation is to keep voltages within preassigned operating limits by commanding the reactive power output of distributed energy resources (DERs) deployed in the grid. We develop a framework for developing local Volt/Var control that comprises two main steps. In the first, by exploiting historical data and for each DER, we learn a function representing the desirable equilibrium points for the power network. These points approximate solutions of an Optimal Power Flow (OPF) problem. In the second, we propose a control scheme for steering the network towards these favorable configurations. Theoretical conditions are derived to formally guarantee the stability of the developed control scheme, and numerical simulations illustrate the effectiveness of the proposed approach.

preprint2022arXiv

Selective Inhibition and Recruitment of Linear-Threshold Thalamocortical Networks

Neuroscientific evidence shows that for most brain networks all pathways between cortical regions either pass through the thalamus or a transthalamic parallel route exists for any direct corticocortical connection. This paper seeks to formally study the dynamical behavior of the resulting thalamocortical brain networks with a view to characterizing the inhibitory role played by the thalamus and its benefits. We employ a linear-threshold mesoscale model for individual brain subnetworks and study both hierarchical and star-connected thalamocortical networks. Using tools from singular perturbation theory and switched systems, we show that selective inhibition and recruitment can be achieved in such networks through a combination of feedback and feedforward control. Various simulations throughout the exposition illustrate the benefits resulting from the presence of the thalamus regarding failsafe mechanisms, required control magnitude, and network performance.

preprint2021arXiv

Learning Barrier Functions with Memory for Robust Safe Navigation

Control barrier functions are widely used to enforce safety properties in robot motion planning and control. However, the problem of constructing barrier functions online and synthesizing safe controllers that can deal with the associated uncertainty has received little attention. This paper investigates safe navigation in unknown environments, using onboard range sensing to construct control barrier functions online. To represent different objects in the environment, we use the distance measurements to train neural network approximations of the signed distance functions incrementally with replay memory. This allows us to formulate a novel robust control barrier safety constraint which takes into account the error in the estimated distance fields and its gradient. Our formulation leads to a second-order cone program, enabling safe and stable control synthesis in a priori unknown environments.

preprint2021arXiv

Learning Koopman Eigenfunctions and Invariant Subspaces from Data: Symmetric Subspace Decomposition

This paper develops data-driven methods to identify eigenfunctions of the Koopman operator associated to a dynamical system and subspaces that are invariant under the operator. We build on Extended Dynamic Mode Decomposition (EDMD), a data-driven method that finds a finite-dimensional approximation of the Koopman operator on the span of a predefined dictionary of functions. We propose a necessary and sufficient condition to identify Koopman eigenfunctions based on the application of EDMD forward and backward in time. Moreover, we propose the Symmetric Subspace Decomposition (SSD) algorithm, an iterative method which provably identifies the maximal Koopman-invariant subspace and the Koopman eigenfunctions in the span of the dictionary. We also introduce the Streaming Symmetric Subspace Decomposition (SSSD) algorithm, an online extension of SSD that only requires a small, fixed memory and incorporates new data as is received. Finally, we propose an extension of SSD that approximates Koopman eigenfunctions and invariant subspaces when the dictionary does not contain sufficient informative eigenfunctions.

preprint2020arXiv

Dynamics of Data-driven Ambiguity Sets for Hyperbolic Conservation Laws with Uncertain Inputs

Ambiguity sets of probability distributions are used to hedge against uncertainty about the true probabilities of random quantities of interest (QoIs). When available, these ambiguity sets are constructed from both data (collected at the initial time and along the boundaries of the physical domain) and concentration-of-measure results on the Wasserstein metric. To propagate the ambiguity sets into the future, we use a physics-dependent equation governing the evolution of cumulative distribution functions (CDF) obtained through the method of distributions. This study focuses on the latter step by investigating the spatio-temporal evolution of data-driven ambiguity sets and their associated guarantees when the random QoIs they describe obey hyperbolic partial-differential equations with random inputs. For general nonlinear hyperbolic equations with smooth solutions, the CDF equation is used to propagate the upper and lower envelopes of pointwise ambiguity bands. For linear dynamics, the CDF equation allows us to construct an evolution equation for tighter ambiguity balls. We demonstrate that, in both cases, the ambiguity sets are guaranteed to contain the true (unknown) distributions within a prescribed confidence.

preprint2020arXiv

Event-Triggered Stabilization of Nonlinear Systems with Time-Varying Sensing and Actuation Delay

This paper studies the problem of stabilization of a nonlinear system with time-varying delays in both sensing and actuation using event-triggered control. Our proposed strategy seeks to opportunistically minimize the number of control updates while guaranteeing stabilization and builds on predictor feedback to compensate for arbitrarily large known time-varying delays. We establish, using a Lyapunov approach, the global asymptotic stability of the closed-loop system as long as the open-loop system is globally input-to-state stabilizable in the absence of time delays and sampling. We further prove that the proposed event-triggered law has inter-event times that are uniformly lower bounded and hence does not exhibit Zeno behavior. For the particular case of a stabilizable linear system, we show global exponential stability of the closed-loop system and analyze the trade-off between the rate of exponential convergence and a bound on the sampling frequency. We illustrate these results in simulation and also examine the properties of the proposed event-triggered strategy beyond the class of systems for which stabilization can be guaranteed.

preprint2020arXiv

Resource-Aware Discretization of Accelerated Optimization Flows

This paper tackles the problem of discretizing accelerated optimization flows while retaining their convergence properties. Inspired by the success of resource-aware control in developing efficient closed-loop feedback implementations on digital systems, we view the last sampled state of the system as the resource to be aware of. The resulting variable-stepsize discrete-time algorithms retain by design the desired decrease of the Lyapunov certificate of their continuous-time counterparts. Our algorithm design employs various concepts and techniques from resource-aware control that, in the present context, have interesting parallelisms with the discrete-time implementation of optimization algorithms. These include derivative- and performance-based triggers to monitor the evolution of the Lyapunov function as a way of determining the algorithm stepsize, exploiting sampled information to enhance algorithm performance, and employing high-order holds using more accurate integrators of the original dynamics. Throughout the paper, we illustrate our approach on a newly introduced continuous-time dynamics termed heavy-ball dynamics with displaced gradient, but the ideas proposed here have broad applicability to other globally asymptotically stable flows endowed with a Lyapunov certificate.