Researcher profile

Richard Linares

Richard Linares contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

Autonomous Reasoning for Spacecraft Control: A Large Language Model Framework with Group Relative Policy Optimization

This paper presents a learning-based guidance-and-control approach that couples a reasoning-enabled Large Language Model (LLM) with Group Relative Policy Optimization (GRPO). A two-stage procedure consisting of Supervised Fine-Tuning (SFT) to learn formatting and control primitives, followed by GRPO for interaction-driven policy improvement, trains controllers for each environment. The framework is demonstrated on four control problems spanning a gradient of dynamical complexity, from canonical linear systems through nonlinear oscillatory dynamics to three-dimensional spacecraft attitude control with gyroscopic coupling and thrust constraints. Results demonstrate that an LLM with explicit reasoning, optimized via GRPO, can synthesize feasible stabilizing policies under consistent training settings across both linear and nonlinear systems. The two-stage training methodology enables models to generate control sequences while providing human-readable explanations of their decision-making process. This work establishes a foundation for applying GRPO-based reasoning to autonomous control systems, with potential applications in aerospace and other safety-critical domains.

preprint2022arXiv

A Koopman Operator Tutorial with Othogonal Polynomials

The Koopman Operator (KO) offers a promising alternative methodology to solve ordinary differential equations analytically. The solution of the dynamical system is analyzed in terms of observables, which are expressed as a linear combination of the eigenfunctions of the system. Coefficients are evaluated via the Galerkin method, using Legendre polynomials as a set of orthogonal basis functions. This tutorial provides a detailed analysis of the Koopman theory, followed by a rigorous explanation of the KO implementation in a computer environment, where a line-by-line description of a MATLAB code solves the Duffing oscillator application.

preprint2022arXiv

A Koopman-Operator Control Optimization for Relative Motion in Space

A high order optimal control strategy implemented in the Koopman operator framework is proposed in this work. The new technique exploits the Koopman representation of the solution of the equations of motion to develop an energy optimal inverse control methodology. The operator theory can reformulate a nonlinear dynamical system of finite dimension into a linear system with an infinite number of dimensions. As a results, the state of any nonlinear dynamics is represented as a linear combination of high-order orthogonal polynomials, which creates the state transition polynomial map of the solution. Since the optimal control technique can be reduced to a two-points boundary value problem, the Koopman map is used to connect the state and control variables in time, such that optimal values are obtained through map inversion and polynomial evaluation. The new technique is applied to rendezvous applications in space, where the relative motion between two satellites is modelled with a high-order polynomial series expansion of the Lagrangian of the system, such that the Clohessy-Wiltshire equations represent the reduction of the high-order model to a linear truncation.

preprint2022arXiv

A Method for Generating Closely Packed Orbital Shells and the Implication on Orbital Capacity

Shell-wise orbital slotting in Low Earth Orbit (LEO) can improve space safety, simplify space traffic coordination and management, and optimize orbital capacity. This paper describes two methods to generate 2D Lattice Flower Constellations (2D-LFCs) that are defined with respect to either an arbitrary degree or an arbitrary degree and order Earth geopotential. By generating shells that are quasi-periodic and frozen with respect to the Earth geopotential, it is possible to safely stack shells with vertical separation distances smaller than the osculating variation in semi-major axis of each shell or a corresponding Keplerian 2D-LFC propagated under an aspherical geopotential. This helps mitigate the single inclination per shell requirement in prior work by admitting more shells for a given orbital volume while retaining self-safe phasing in each shell. These methods exploit previous work on the Time Distribution Constellation formulation and designs of closed 2D-LFCs under arbitrary Earth geopotentials using repeating ground track orbits. Factors that influence the widths and shapes of these frozen shells are identified. Simplified formulas for estimating shell geometry and thickness are presented. It is shown that sequencing shells to group similar or ascending inclinations improves capacity versus arbitrary inclination ordering.

preprint2021arXiv

Safe and Uncertainty-Aware Robotic Motion Planning Techniques for Agile On-Orbit Assembly

As access to space and robotic autonomy capabilities move forward, there is simultaneously a growing interest in deploying large, complex space structures to provide new on-orbit capabilities. New space-borne observatories, large orbital outposts, and even futuristic on-orbit manufacturing will be enabled by robotic assembly of space structures using techniques like on-orbit additive manufacturing which can provide flexibility in constructing and even repairing complex hardware. However, the dynamics underlying the robotic assembler during manipulation may operate under inertial uncertainties. Thus, inertial estimation of the robot and the manipulated component system must be considered during structural assembly. The contribution of this work is to address both the motion planning and control for robotic assembly with consideration of the inertial estimation of the combined free-flying robotic assembler and additively manufactured component system. Specifically, the Linear Quadratic Regulator Rapidly-Exploring Randomized Trees (LQR-RRT*) and dynamically feasible path smoothing are used to obtain obstacle-free trajectories for the system. Further, model learning is incorporated explicitly into the planning stages via approximation of the continuous system and accompanying reward of performing safe, objective-oriented motion. Remaining uncertainty can then be dealt with using robust tube model predictive control. By obtaining controlled trajectories that consider both obstacle avoidance and learning of the inertial properties of the free-flyer and manipulated component system, the free-flyer rapidly considers and plans the construction of space structures with enhanced system knowledge. The approach naturally generalizes to repairing, refueling, and re-provisioning space structure components while providing optimal collision-free trajectories under e.g., inertial uncertainty.

preprint2020arXiv

A set of orbital elements to fully represent the zonal harmonics around an oblate celestial body

This work introduces a new set of orbital elements to fully represent the zonal harmonics problem around an oblate celestial body. This new set of orbital elements allows to obtain a complete linear system for the unperturbed problem and, in addition, a complete polynomial system when considering the perturbation produced by the zonal harmonics from the gravitational force of an oblate celestial body. These orbital elements present no singularities and are able to represent any kind of orbit, including elliptic, parabolic and hyperbolic orbits. In addition, an application to this formulation of the Poincaré-Lindstedt perturbation method is included to obtain an approximate first order solution of the problem for the case of the J2 perturbation.

preprint2020arXiv

Adaptive Generalized ZEM-ZEV Feedback Guidance for Planetary Landing via a Deep Reinforcement Learning Approach

Precision landing on large and small planetary bodies is a technology of utmost importance for future human and robotic exploration of the solar system. In this context, the Zero-Effort-Miss/Zero-Effort-Velocity (ZEM/ZEV) feedback guidance algorithm has been studied extensively and is still a field of active research. The algorithm, although powerful in terms of accuracy and ease of implementation, has some limitations. Therefore with this paper we present an adaptive guidance algorithm based on classical ZEM/ZEV in which machine learning is used to overcome its limitations and create a closed loop guidance algorithm that is sufficiently lightweight to be implemented on board spacecraft and flexible enough to be able to adapt to the given constraint scenario. The adopted methodology is an actor-critic reinforcement learning algorithm that learns the parameters of the above-mentioned guidance architecture according to the given problem constraints.

preprint2020arXiv

Atmospheric Density Uncertainty Quantification for Satellite Conjunction Assessment

Conjunction assessment requires knowledge of the uncertainty in the predicted orbit. Errors in the atmospheric density are a major source of error in the prediction of low Earth orbits. Therefore, accurate estimation of the density and quantification of the uncertainty in the density is required. Most atmospheric density models, however, do not provide an estimate of the uncertainty in the density. In this work, we present a new approach to quantify uncertainties in the density and to include these for calculating the probability of collision Pc. For this, we employ a recently developed dynamic reduced-order density model that enables efficient prediction of the thermospheric density. First, the model is used to obtain accurate estimates of the density and of the uncertainty in the estimates. Second, the density uncertainties are propagated forward simultaneously with orbit propagation to include the density uncertainties for Pc calculation. For this, we account for the effect of cross-correlation in position uncertainties due to density errors on the Pc. Finally, the effect of density uncertainties and cross-correlation on the Pc is assessed. The presented approach provides the distinctive capability to quantify the uncertainty in atmospheric density and to include this uncertainty for conjunction assessment while taking into account the dependence of the density errors on location and time. In addition, the results show that it is important to consider the effect of cross-correlation on the Pc, because ignoring this effect can result in severe underestimation of the collision probability.

preprint2020arXiv

Autonomous Six-Degree-of-Freedom Spacecraft Docking Maneuvers via Reinforcement Learning

A policy for six-degree-of-freedom docking maneuvers is developed through reinforcement learning and implemented as a feedback control law. Reinforcement learning provides a potential framework for robust, autonomous maneuvers in uncertain environments with low on-board computational cost. Specifically, proximal policy optimization is used to produce a docking policy that is valid over a portion of the six-degree-of-freedom state-space while striving to minimize performance and control costs. Experiments using the simulated Apollo transposition and docking maneuver exhibit the policy's capabilities and provide a comparison with standard optimal control techniques. Furthermore, specific challenges and work-arounds, as well as a discussion on the benefits and disadvantages of reinforcement learning for docking policies, are discussed to facilitate future research. As such, this work will serve as a foundation for further investigation of learning-based control laws for spacecraft proximity operations in uncertain environments.

preprint2020arXiv

Decentralized Control of Large Collaborative Swarms using Random Finite Set Theory

Controlling large swarms of robotic agents presents many challenges including, but not limited to, computational complexity due to a large number of agents, uncertainty in the functionality of each agent in the swarm, and uncertainty in the swarm's configuration. The contribution of this work is to decentralize Random Finite Set (RFS) control of large collaborative swarms for control of individual agents. The RFS control formulation assumes that the topology underlying the swarm control is complete and uses the complete graph in a centralized manner. To generalize the control topology in a localized or decentralized manner, sparse LQR is used to sparsify the RFS control gain matrix obtained using iterative LQR. This allows agents to use information of agents near each other (localized topology) or only the agent's own information (decentralized topology) to make a control decision. Sparsity and performance for decentralized RFS control are compared for different degrees of localization in feedback control gains which show that the stability and performance compared to centralized control do not degrade significantly in providing RFS control for large collaborative swarms.

preprint2020arXiv

Motion Planning and Control for On-Orbit Assembly using LQR-RRT* and Nonlinear MPC

Deploying large, complex space structures is of great interest to the modern scientific world as it can provide new capabilities in obtaining scientific, communicative, and observational information. However, many theoretical mission designs contain complexities that must be constrained by the requirements of the launch vehicle, such as volume and mass. To mitigate such constraints, the use of on-orbit additive manufacturing and robotic assembly allows for the flexibility of building large complex structures including telescopes, space stations, and communication satellites. The contribution of this work is to develop motion planning and control algorithms using the linear quadratic regulator and rapidly-exploring randomized trees (LQR-RRT*), path smoothing, and tracking the trajectory using a closed-loop nonlinear receding horizon control optimizer for a robotic Astrobee free-flyer. By obtaining controlled trajectories that consider obstacle avoidance and dynamics of the vehicle and manipulator, the free-flyer rapidly considers and plans the construction of space structures. The approach is a natural generalization to repairing, refueling, and re-provisioning space structure components while providing optimal collision-free trajectories during operation.

preprint2020arXiv

Six Degree-of-Freedom Body-Fixed Hovering over Unmapped Asteroids via LIDAR Altimetry and Reinforcement Meta-Learning

We optimize a six degrees of freedom hovering policy using reinforcement meta-learning. The policy maps flash LIDAR measurements directly to on/off spacecraft body-frame thrust commands, allowing hovering at a fixed position and attitude in the asteroid body-fixed reference frame. Importantly, the policy does not require position and velocity estimates, and can operate in environments with unknown dynamics, and without an asteroid shape model or navigation aids. Indeed, during optimization the agent is confronted with a new randomly generated asteroid for each episode, insuring that it does not learn an asteroid's shape, texture, or environmental dynamics. This allows the deployed policy to generalize well to novel asteroid characteristics, which we demonstrate in our experiments. Moreover, our experiments show that the optimized policy adapts to actuator failure and sensor noise. Although the policy is optimized using randomly generated synthetic asteroids, it is tested on two shape models from actual asteroids: Bennu and Itokawa. We find that the policy generalizes well to these shape models. The hovering controller has the potential to simplify mission planning by allowing asteroid body-fixed hovering immediately upon the spacecraft's arrival to an asteroid. This in turn simplifies shape model generation and allows resource mapping via remote sensing immediately upon arrival at the target asteroid.

preprint2019arXiv

Adaptive Guidance and Integrated Navigation with Reinforcement Meta-Learning

This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a recurrent policy in four challenging environments with unknown but highly variable dynamics. These tasks include a safe Mars landing with random engine failure and a landing on an asteroid with unknown environmental dynamics. We also demonstrate the ability of a RL meta-learning optimized policy to implement a guidance law using observations consisting of only Doppler radar altimeter readings in a Mars landing environment, and LIDAR altimeter readings in an asteroid landing environment, thus integrating guidance and navigation.

preprint2019arXiv

Real-Time Thermospheric Density Estimation Via Two-Line-Element Data Assimilation

Inaccurate estimates of the thermospheric density are a major source of error in low Earth orbit prediction. To improve orbit prediction, real-time density estimation is required. In this work, we develop a reduced-order dynamic model for the thermospheric density by computing the main spatial modes of the atmosphere and deriving a linear model for the dynamics. The model is then used to estimate the density using two-line element (TLE) data by simultaneously estimating the reduced-order modes and the orbits and ballistic coefficients of several objects using an unscented Kalman filter. Accurate density estimation using the TLEs of 17 objects is demonstrated and validated against CHAMP and GRACE accelerometer-derived densities. Finally, the use of the model for density forecasting is shown.

preprint2019arXiv

Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied to Asteroid Close Proximity Operations

Current practice for asteroid close proximity maneuvers requires extremely accurate characterization of the environmental dynamics and precise spacecraft positioning prior to the maneuver. This creates a delay of several months between the spacecraft's arrival and the ability to safely complete close proximity maneuvers. In this work we develop an adaptive integrated guidance, navigation, and control system that can complete these maneuvers in environments with unknown dynamics, with initial conditions spanning a large deployment region, and without a shape model of the asteroid. The system is implemented as a policy optimized using reinforcement meta-learning. The spacecraft is equipped with an optical seeker that locks to either a terrain feature, back-scattered light from a targeting laser, or an active beacon, and the policy maps observations consisting of seeker angles and LIDAR range readings directly to engine thrust commands. The policy implements a recurrent network layer that allows the deployed policy to adapt real time to both environmental forces acting on the agent and internal disturbances such as actuator failure and center of mass variation. We validate the guidance system through simulated landing maneuvers in a six degrees-of-freedom simulator. The simulator randomizes the asteroid's characteristics such as solar radiation pressure, density, spin rate, and nutation angle, requiring the guidance and control system to adapt to the environment. We also demonstrate robustness to actuator failure, sensor bias, and changes in the spacecraft's center of mass and inertia tensor. Finally, we suggest a concept of operations for asteroid close proximity maneuvers that is compatible with the guidance system.