Source author record

Jiaji Zhang

Jiaji Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence cond-mat.stat-mech Machine Learning physics.chem-ph

Catalog footprint

What is connected

2works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Model-based Reinforcement Learning with Multi-step Plan Value Estimation

A promising way to improve the sample efficiency of reinforcement learning is model-based methods, in which many explorations and evaluations can happen in the learned models to save real-world samples. However, when the learned model has a non-negligible model error, sequential steps in the model are hard to be accurately evaluated, limiting the model's utilization. This paper proposes to alleviate this issue by introducing multi-step plans to replace multi-step actions for model-based RL. We employ the multi-step plan value estimation, which evaluates the expected discounted return after executing a sequence of action plans at a given state, and updates the policy by directly computing the multi-step policy gradient via plan value estimation. The new model-based reinforcement learning algorithm MPPVE (Model-based Planning Policy Learning with Multi-step Plan Value Estimation) shows a better utilization of the learned model and achieves a better sample efficiency than state-of-the-art model-based RL approaches.

preprint2020arXiv

Proton Tunneling in a Two-Dimensional Potential Energy Surface with a Non-linear System-Bath Interaction: Thermal Suppression of Reaction Rate

We consider a proton-transfer (PT) system described by a proton-transfer reaction (PTR) coordinate and a rate promoting vibrational (RPV) coordinate interacting with a non-Markovian heat-bath. While dynamics of PT processes has been widely discussed using two-dimensional (2D) potential energy surfaces (PES), the role of the heat-bath, in particular, for a realistic form of the system-bath interaction has not been well explored. Previous studies are largely based on one-dimensional model and linear-linear (LL) system-bath interaction. In the present study, we introduce an exponential-linear (EL) system-bath interaction, which is derived from the analysis of a PTR-PRV system in a realistic situation. This interaction mainly causes vibrational dephasing in the PTR mode and population relaxation in the RPV mode. Numerical simulations were carried out using hierarchy equations of motion approach. We analyze the role of the heat-bath interaction on the chemical reaction rate as a function of the system-bath coupling strength at different temperature and for different values of the bath correlation time. A prominent feature of the present result is that while the reaction rate predicted from classical and quantum Kramers theory increases as the temperature increases, the present EL interaction model exhibits opposite temperature dependence. Kramers turn-over profile of the reaction rate as a function of the system-bath coupling is also suppressed in the present EL model turning into a plateau-like curve for larger system-bath interaction strength. Such features arise from the interplay of the vibrational dephasing process in the PTR mode and the population relaxation process in the RPV mode.