Researcher profile

Juanjuan Xu

Juanjuan Xu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2023arXiv

Exact Controllability of Discrete-Time Stochastic System with Multiplicative Noise

This paper is concerned with the exact controllability of discrete-time stochastic system which is one of the basic problems of modern control theory. Though the exact controllability of continuous-time system governed by Ito stochastic differential equations has been well studied in S. Peng, Progress in Natural Science, 1994, the counterpart of the discrete-time case is still open due to the adaptiveness constraint of the controllers and the solvability challenging of stochastic difference equation with terminal value. The main contribution in this paper is to present both the Gramian matrix criterion and the Rank criterion for the exact controllability of discrete-time stochastic system. The novelty lies in the transformation of the forward stochastic difference equation into a novel backward one.

preprint2023arXiv

Reinforcement Learning-Based Optimal Control for Multiplicative-Noise Systems with Input Delay

In this paper, the reinforcement learning (RL)-based optimal control problem is studied for multiplicative-noise systems, where input delay is involved and partial system dynamics is unknown. To solve a variant of Riccati-ZXL equations, which is a counterpart of standard Riccati equation and determines the optimal controller, we first develop a necessary and sufficient stabilizing condition in form of several Lyapunov-type equations, a parallelism of the classical Lyapunov theory. Based on the condition, we provide an offline and convergent algorithm for the variant of Riccati-ZXL equations. According to the convergent algorithm, we propose a RL-based optimal control design approach for solving linear quadratic regulation problem with partially unknown system dynamics. Finally, a numerical example is used to evaluate the proposed algorithm.

preprint2022arXiv

Distributed Q-Learning for Stochastic LQ Control with Unknown Uncertainty

This paper studies a discrete-time stochastic control problem with linear quadratic criteria over an infinite-time horizon. We focus on a class of control systems whose system matrices are associated with random parameters involving unknown statistical properties. In particular, we design a distributed Q-learning algorithm to tackle the Riccati equation and derive the optimal controller stabilizing the system. The key technique is that we convert the problem of solving the Riccati equation into deriving the zero point of a matrix equation and devise a distributed stochastic approximation method to compute the estimates of the zero point. The convergence analysis proves that the distributed Q-learning algorithm converges to the correct value eventually. A numerical example sheds light on that the distributed Q-learning algorithm converges asymptotically.

preprint2020arXiv

A New Approach for Solving Delayed Forward and Backward Stochastic Differential Equations

This paper is concerned with the decoupling of delayed linear forward-backward stochastic differential equations (D-FBSDEs), which is much more involved than the delay-free case due to the infinite dimension caused by the delay. A new approach of `discretization' is proposed to obtain the explicit solution to the D-FBSDEs. Firstly, we transform the continuous-time D-FBSDEs into the discrete-time form by using discretization. Secondly, we derive the solution of the discrete-time D-FBSDEs by applying backward iterative induction. Finally the explicit solution of the continuous-time D-FBSDEs is obtained by taking the limit to the solution of discrete-time form. The proposed approach can be applied to solve more general FBSDEs with delay, which would provide a complete solution to the stochastic LQ control with time delay.

preprint2020arXiv

The Difference and Unity of Irregular LQ Control and Standard LQ Control and Its Solution

Irregular linear quadratic control (LQ, was called Singular LQ) has been a long-standing problem since 1970s. This paper will show that an irregular LQ control (deterministic) is solvable (for arbitrary initial value) if and only if the LQ cost can be rewritten as a regular one by changing the terminal cost $x'(T)Hx(T)$ to $x'(T)[H+P_1(T)]x(T)$, while the optimal controller can achieve $P_1(T)x(T)=0$ at the same time. In other words, the irregular controller (if exists) needs to do two things at the same time, one thing is to minimize the cost and the other is to achieve the terminal constraint $P_1(T)x(T)=0$, which clarifies the essential difference of irregular LQ from the standard LQ control where the controller is to minimize the cost only. With this breakthrough, we further study the irregular LQ control for stochastic systems with multiplicative noise. A sufficient solving condition and the optimal controller is presented based on Riccati equations.