Graph explorer

Asynchronous Coagent Networks

Coagent policy gradient algorithms (CPGAs) are reinforcement learning algorithms for training a class of stochastic neural networks called coagent networks. In this work, we prove that CPGAs converge to locally optimal policies. Additionally, we extend prior theory to encompass asynchronous and recurrent coagent networks. These extensions facilitate the straightforward design and analysis of hierarchical reinforcement learning algorithms like the option-critic, and eliminate the need for complex derivations of customized learning rules for these algorithms.

5 nodes4 linksoverview previewAsynchronous Coagent Networks
5 nodes4 links
Asynchronous Coagent Networks5 visible / 5 total nodes / 7 links
Co-authorshipCo-authorshipCo-authorshipAuthorshipAuthorshipAuthorshipTopic signalWAsynchronous Coagent Networkspreprint / 2020AJames E. KostasResearcherAChris NotaResearcherAPhilip S. ThomasResearcherTMachine Learning49008 works
PaperSignal 104 links

Asynchronous Coagent Networks

preprint / 2020

Open