Researcher profile

Fangzhou Liu

Fangzhou Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Do You Have Freestyle? Expressive Humanoid Locomotion via Audio Control

Humans intuitively move to sound, but current humanoid robots lack expressive improvisational capabilities, confined to predefined motions or sparse commands. Generating motion from audio and then retargeting it to robots relies on explicit motion reconstruction, leading to cascaded errors, high latency, and disjointed acoustic-actuation mapping. We propose RoboPerform, the first unified audio-to-locomotion framework that can directly generate music-driven dance and speech-driven co-speech gestures from audio. Guided by the core principle of "motion = content + style", the framework treats audio as implicit style signals and eliminates the need for explicit motion reconstruction. RoboPerform integrates a ResMoE teacher policy for adapting to diverse motion patterns and a diffusion-based student policy for audio style injection. This retargeting-free design ensures low latency and high fidelity. Experimental validation shows that RoboPerform achieves promising results in physical plausibility and audio alignment, successfully transforming robots into responsive performers capable of reacting to audio.

preprint2022arXiv

Adaptive Observer for a Class of Systems with Switched Unknown Parameters Using DREM

In this note, we develop an adaptive observer for a class of nonlinear systems with switched unknown parameters to estimate the states and parameters simultaneously. The main challenge lies in how to eliminate the disturbance effect of zero-input responses caused by the switching on the parameter estimation. These responses depend on the unknown states at switching instants (SASI) and constitute an additive disturbance to the parameter estimation, which obstructs parameter convergence to zero. Our solution is to treat the zero-input responses as excitations instead of disturbances. This is realized by first augmenting the system parameter with the SASI and then developing an estimator for the augmented parameter using the \textit{dynamic regression extension and mixing} (DREM) technique. Thanks to its property of element-wise parameter adaptation, the system parameter estimation is decoupled from the SASI. As a result, the estimation errors of system states and parameters converge to zero asymptotically. Furthermore, the robustness of the proposed adaptive observer is guaranteed in the presence of disturbances and noise. A numerical example validates the effectiveness of the proposed approach.

preprint2022arXiv

Off-Policy Risk-Sensitive Reinforcement Learning Based Constrained Robust Optimal Control

This paper proposes an off-policy risk-sensitive reinforcement learning based control framework for stabilization of a continuous-time nonlinear system that subjects to additive disturbances, input saturation, and state constraints. By introducing pseudo controls and risk-sensitive input and state penalty terms, the constrained robust stabilization problem of the original system is converted into an equivalent optimal control problem of an auxiliary system. Then, aiming at the transformed optimal control problem, we adopt adaptive dynamic programming (ADP) implemented as a single critic structure to get the approximate solution to the value function of the Hamilton-Jacobi-Bellman (HJB) equation, which results in the approximate optimal control policy that is able to satisfy both input and state constraints under disturbances. By replaying experience data to the off-policy weight update law of the critic artificial neural network, the weight convergence is guaranteed. Moreover, to get experience data to achieve a sufficient excitation required for the weight convergence, online and offline algorithms are developed to serve as principled ways to record informative experience data. The equivalence proof demonstrates that the optimal control strategy of the auxiliary system robustly stabilizes the original system without violating input and state constraints. The proofs of system stability and weight convergence are provided. Simulation results reveal the validity of the proposed control framework.

preprint2021arXiv

Model-Free Incremental Adaptive Dynamic Programming Based Approximate Robust Optimal Regulation

This paper presents a new formulation for model-free robust optimal regulation of continuous-time nonlinear systems. The proposed reinforcement learning based approach, referred to as incremental adaptive dynamic programming (IADP), exploits measured data to allow the design of the approximate optimal incremental control strategy, which stabilizes the controlled system incrementally under model uncertainties, environmental disturbances, and input saturation. By leveraging the time delay estimation (TDE) technique, we first exploit sensory data to reduce the requirement of a complete dynamics, where measured data are adopted to construct an incremental dynamics that reflects the system evolution in an incremental form. Then, the resulting incremental dynamics serves to design the approximate optimal incremental control strategy based on adaptive dynamic programming, which is implemented as a simplified single critic structure to get the approximate solution to the value function of the Hamilton-Jacobi-Bellman equation. Furthermore, for the critic artificial neural network, experience data are used to design an off-policy weight update law with guaranteed weight convergence. Rather importantly, to address the unintentionally introduced TDE error, we incorporate a TDE error bound related term into the cost function, whereby the TDE error is attenuated during the optimization process. The system stability proof and the weight convergence proof are provided. Numerical simulations are conducted to validate the effectiveness and superiority of our proposed IADP, especially regarding the reduced control energy expenditure and the enhanced robustness.

preprint2020arXiv

Distributed Link Removal Strategy for Networked Meta-Population Epidemics and its Application to the Control of the COVID-19 Pandemic

In this paper, we investigate the distributed link removal strategy for networked meta-population epidemics. In particular, a deterministic networked susceptible-infected-recovered (SIR) model is considered to describe the epidemic evolving process. In order to curb the spread of epidemics, we present the spectrum-based optimization problem involving the Perron-Frobenius eigenvalue of the matrix constructed by the network topology and transition rates. A modified distributed link removal strategy is developed such that it can be applied to the SIR model with heterogeneous transition rates on weighted digraphs. The proposed approach is implemented to control the COVID-19 pandemic by using the reported infected and recovered data in each state of Germany. The numerical experiment shows that the infected percentage can be significantly reduced by using the distributed link removal strategy.

preprint2020arXiv

Interplay Between Homophily-Based Appraisal Dynamics and Influence-Based Opinion Dynamics: Modeling and Analysis

In social systems, the evolution of interpersonal appraisals and individual opinions are not independent processes but intertwine with each other. Despite extensive studies on both opinion dynamics and appraisal dynamics separately, no previous work has ever combined these two processes together. In this paper, we propose a novel and intuitive model on the interplay between homophily-based appraisal dynamics and influence-based opinion dynamics. We assume that individuals' opinions are updated via the influence network constructed from their interpersonal appraisals, which are in turn updated based on the individual opinions via the homophily mechanism. By theoretical analysis, we characterize the set of equilibria and some transient behavior of our model. Moreover, we establish the equivalence among the convergence of the appraisal network to social balance, the modulus consensus of individual opinions, and the non-vanishing appraisals. Monte Carlo validations further show that the non-vanishing appraisals condition holds for generic initial conditions. Compared with previous works that explain the emergence of social balance via person-to-person homophily mechanism, our model provides an alternative explanation in terms of the person-to-entity homophily mechanism. In addition, our model also describes how individuals' opinions on multiple irrelevant issues become correlated and converge to modulus consensus over time-varying influence networks.

preprint2020arXiv

On the Stability of the Endemic Equilibrium of A Discrete-Time Networked Epidemic Model

Networked epidemic models have been widely adopted to describe propagation phenomena. The endemic equilibrium of these models is of great significance in the field of viral marketing, innovation dissemination, and information diffusion. However, its stability conditions have not been fully explored. In this paper we study the stability of the endemic equilibrium of a networked Susceptible-Infected-Susceptible (SIS) epidemic model with heterogeneous transition rates in a discrete-time manner. We show that the endemic equilibrium, if it exists, is asymptotically stable for any nontrivial initial condition. Under mild assumptions on initial conditions, we further prove that during the spreading process there exists no overshoot with respect to the endemic equilibrium. Finally, we conduct numerical experiments on real-world networks to demonstrate our results.