Source author record

Kejun Li

Kejun Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Robotics astro-ph.SR cond-mat.mes-hall eess.SY Human-Computer Interaction Machine Learning quant-ph Systems and Control

Catalog footprint

What is connected

8works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

CLF-RL: Control Lyapunov Function Guided Reinforcement Learning

Reinforcement learning (RL) has shown promise in generating robust locomotion policies for bipedal robots, but often suffers from tedious reward design and sensitivity to poorly shaped objectives. In this work, we propose a structured reward shaping framework that leverages model-based trajectory generation and control Lyapunov functions (CLFs) to guide policy learning. We explore two model-based planners for generating reference trajectories: a reduced-order linear inverted pendulum (LIP) model for velocity-conditioned motion planning, and a precomputed gait library based on hybrid zero dynamics (HZD) using full-order dynamics. These planners define desired end-effector and joint trajectories, which are used to construct CLF-based rewards that penalize tracking error and encourage rapid convergence. This formulation provides meaningful intermediate rewards, and is straightforward to implement once a reference is available. Both the reference trajectories and CLF shaping are used only during training, resulting in a lightweight policy at deployment. We validate our method both in simulation and through extensive real-world experiments on a Unitree G1 robot. CLF-RL demonstrates significantly improved robustness relative to the baseline RL policy and better performance than a classic tracking reward RL formulation.

preprint2022arXiv

Natural Multicontact Walking for Robotic Assistive Devices via Musculoskeletal Models and Hybrid Zero Dynamics

Generating stable walking gaits that yield natural locomotion when executed on robotic-assistive devices is a challenging task that often requires hand-tuning by domain experts. This paper presents an alternative methodology, where we propose the addition of musculoskeletal models directly into the gait generation process to intuitively shape the resulting behavior. In particular, we construct a multi-domain hybrid system model that combines the system dynamics with muscle models to represent natural multicontact walking. Provably stable walking gaits can then be generated for this model via the hybrid zero dynamics (HZD) method. We experimentally apply our integrated framework towards achieving multicontact locomotion on a dual-actuated transfemoral prosthesis, AMPRO3, for two subjects. The results demonstrate that enforcing muscle model constraints produces gaits that yield natural locomotion (as analyzed via comparison to motion capture data and electromyography). Moreover, gaits generated with our framework were strongly preferred by the non-disabled prosthetic users as compared to gaits generated with the nominal HZD method, even with the use of systematic tuning methods. We conclude that the novel approach of combining robotic walking methods (specifically HZD) with muscle models successfully generates anthropomorphic robotic-assisted locomotion.

preprint2022arXiv

Nuclear spin polarization and control in a van der Waals material

Van der Waals layered materials are a focus of materials research as they support strong quantum effects and can easily form heterostructures. Electron spins in van der Waals materials played crucial roles in many recent breakthroughs, including topological insulators, two-dimensional (2D) magnets, and spin liquids. However, nuclear spins in van der Waals materials remain an unexplored quantum resource. Here we report the first demonstration of optical polarization and coherent control of nuclear spins in a van der Waals material at room temperature. We use negatively-charged boron vacancy ($V_B^-$) spin defects in hexagonal boron nitride to polarize nearby nitrogen nuclear spins. Remarkably, we observe the Rabi frequency of nuclear spins at the excited-state level anti-crossing of $V_B^-$ defects to be 350 times larger than that of an isolated nucleus, and demonstrate fast coherent control of nuclear spins. We also detect strong electron-mediated nuclear-nuclear spin coupling that is 5 orders of magnitude larger than the direct nuclear spin dipolar coupling, enabling multi-qubit operations. Nitrogen nuclear spins in a triangle lattice will be suitable for large-scale quantum simulation. Our work opens a new frontier with nuclear spins in van der Waals materials for quantum information science and technology.

preprint2022arXiv

POLAR: Preference Optimization and Learning Algorithms for Robotics

Parameter tuning for robotic systems is a time-consuming and challenging task that often relies on domain expertise of the human operator. Moreover, existing learning methods are not well suited for parameter tuning for many reasons including: the absence of a clear numerical metric for `good robotic behavior'; limited data due to the reliance on real-world experimental data; and the large search space of parameter combinations. In this work, we present an open-source MATLAB Preference Optimization and Learning Algorithms for Robotics toolbox (POLAR) for systematically exploring high-dimensional parameter spaces using human-in-the-loop preference-based learning. This aim of this toolbox is to systematically and efficiently accomplish one of two objectives: 1) to optimize robotic behaviors for human operator preference; 2) to learn the operator's underlying preference landscape to better understand the relationship between adjustable parameters and operator preference. The POLAR toolbox achieves these objectives using only subjective feedback mechanisms (pairwise preferences, coactive feedback, and ordinal labels) to infer a Bayesian posterior over the underlying reward function dictating the user's preferences. We demonstrate the performance of the toolbox in simulation and present various applications of human-in-the-loop preference-based learning.

preprint2022arXiv

Safety-Aware Preference-Based Learning for Safety-Critical Control

Bringing dynamic robots into the wild requires a tenuous balance between performance and safety. Yet controllers designed to provide robust safety guarantees often result in conservative behavior, and tuning these controllers to find the ideal trade-off between performance and safety typically requires domain expertise or a carefully constructed reward function. This work presents a design paradigm for systematically achieving behaviors that balance performance and robust safety by integrating safety-aware Preference-Based Learning (PBL) with Control Barrier Functions (CBFs). Fusing these concepts -- safety-aware learning and safety-critical control -- gives a robust means to achieve safe behaviors on complex robotic systems in practice. We demonstrate the capability of this design paradigm to achieve safe and performant perception-based autonomous operation of a quadrupedal robot both in simulation and experimentally on hardware.

preprint2021arXiv

ROIAL: Region of Interest Active Learning for Characterizing Exoskeleton Gait Preference Landscapes

Characterizing what types of exoskeleton gaits are comfortable for users, and understanding the science of walking more generally, require recovering a user's utility landscape. Learning these landscapes is challenging, as walking trajectories are defined by numerous gait parameters, data collection from human trials is expensive, and user safety and comfort must be ensured. This work proposes the Region of Interest Active Learning (ROIAL) framework, which actively learns each user's underlying utility function over a region of interest that ensures safety and comfort. ROIAL learns from ordinal and preference feedback, which are more reliable feedback mechanisms than absolute numerical scores. The algorithm's performance is evaluated both in simulation and experimentally for three non-disabled subjects walking inside of a lower-body exoskeleton. ROIAL learns Bayesian posteriors that predict each exoskeleton user's utility landscape across four exoskeleton gait parameters. The algorithm discovers both commonalities and discrepancies across users' gait preferences and identifies the gait parameters that most influenced user feedback. These results demonstrate the feasibility of recovering gait utility landscapes from limited human trials.

preprint2010arXiv

Relationship between group sunspot number and Wolf sunspot number

Continuous wavelet transform and cross-wavelet transform have been used to investigate the phase periodicity and synchrony of the monthly mean Wolf ($R_{z}$) and group ($R_{g}$) sunspot numbers during the period of June 1795 to December 1995. The Schwabe cycle is the only one common period in Rg and Rz, but it is not well-defined in case of cycles 5-7 of Rg and in case of cycles 5 and 6 of $R_{z}$. In fact, the Schwabe period is slightly different in $R_{g}$ and $R_{z}$ before cycle 12, but from cycle 12 onwards it is almost the same for the two time series. Asynchrony of the two time series is more obviously seen in cycles 5 and 6 than in the following cycles, and usually more obviously seen around the maximum time of a cycle than during the rest of the cycle. $R_{g}$ is found to fit $R_{z}$ better in both amplitudes and peak epoch during the minimum time time of a solar cycle than during the maximum time of the cycle, which should be caused by their different definition, and around the maximum time of a cycle, $R_{g}$ is usually less than $R_{z}$. Asynchrony of $R_{g}$ and $R_{z}$ should somewhat agree with different sunspot cycle characteristics exhibited by themselves.

preprint2010arXiv

The Phase Shifts of the Paired Wings of Butterfly Diagrams

Sunspot groups observed by Royal Greenwich Observatory/US Air Force/NOAA from May 1874 to November 2008 and the Carte Synoptique solar filaments from March 1919 to December 1989 are used to investigate the relative phase shift of the paired wings of butterfly diagrams of sunspot and filament activities. Latitudinal migration of sunspot groups (or filaments) does asynchronously occur in the northern and southern hemispheres, and there is a relative phase shift between the paired wings of their butterfly diagrams in a cycle, making the paired wings spatially asymmetrical on the solar equator. It is inferred that hemispherical solar activity strength should evolve in a similar way within the paired wings of a butterfly diagram in a cycle, making the paired wings just and only keep the phase relationship between the northern and southern hemispherical solar activity strengths, but a relative phase shift between the paired wings of a butterfly diagram should bring about an almost same relative phase shift of hemispheric solar activity strength.

Kejun Li

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

CLF-RL: Control Lyapunov Function Guided Reinforcement Learning

Natural Multicontact Walking for Robotic Assistive Devices via Musculoskeletal Models and Hybrid Zero Dynamics

Nuclear spin polarization and control in a van der Waals material

POLAR: Preference Optimization and Learning Algorithms for Robotics

Safety-Aware Preference-Based Learning for Safety-Critical Control

ROIAL: Region of Interest Active Learning for Characterizing Exoskeleton Gait Preference Landscapes

Relationship between group sunspot number and Wolf sunspot number

The Phase Shifts of the Paired Wings of Butterfly Diagrams