Researcher profile

Jiaji Zhang

Jiaji Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

Model-based Reinforcement Learning with Multi-step Plan Value Estimation

A promising way to improve the sample efficiency of reinforcement learning is model-based methods, in which many explorations and evaluations can happen in the learned models to save real-world samples. However, when the learned model has a non-negligible model error, sequential steps in the model are hard to be accurately evaluated, limiting the model's utilization. This paper proposes to alleviate this issue by introducing multi-step plans to replace multi-step actions for model-based RL. We employ the multi-step plan value estimation, which evaluates the expected discounted return after executing a sequence of action plans at a given state, and updates the policy by directly computing the multi-step policy gradient via plan value estimation. The new model-based reinforcement learning algorithm MPPVE (Model-based Planning Policy Learning with Multi-step Plan Value Estimation) shows a better utilization of the learned model and achieves a better sample efficiency than state-of-the-art model-based RL approaches.

preprint2020arXiv

Proton Tunneling in a Two-Dimensional Potential Energy Surface with a Non-linear System-Bath Interaction: Thermal Suppression of Reaction Rate

We consider a proton-transfer (PT) system described by a proton-transfer reaction (PTR) coordinate and a rate promoting vibrational (RPV) coordinate interacting with a non-Markovian heat-bath. While dynamics of PT processes has been widely discussed using two-dimensional (2D) potential energy surfaces (PES), the role of the heat-bath, in particular, for a realistic form of the system-bath interaction has not been well explored. Previous studies are largely based on one-dimensional model and linear-linear (LL) system-bath interaction. In the present study, we introduce an exponential-linear (EL) system-bath interaction, which is derived from the analysis of a PTR-PRV system in a realistic situation. This interaction mainly causes vibrational dephasing in the PTR mode and population relaxation in the RPV mode. Numerical simulations were carried out using hierarchy equations of motion approach. We analyze the role of the heat-bath interaction on the chemical reaction rate as a function of the system-bath coupling strength at different temperature and for different values of the bath correlation time. A prominent feature of the present result is that while the reaction rate predicted from classical and quantum Kramers theory increases as the temperature increases, the present EL interaction model exhibits opposite temperature dependence. Kramers turn-over profile of the reaction rate as a function of the system-bath coupling is also suppressed in the present EL model turning into a plateau-like curve for larger system-bath interaction strength. Such features arise from the interplay of the vibrational dephasing process in the PTR mode and the population relaxation process in the RPV mode.