Graph explorer

Dynamics-aware Embeddings

In this paper we consider self-supervised representation learning to improve sample efficiency in reinforcement learning (RL). We propose a forward prediction objective for simultaneously learning embeddings of states and action sequences. These embeddings capture the structure of the environment's dynamics, enabling efficient policy learning. We demonstrate that our action embeddings alone improve the sample efficiency and peak performance of model-free RL on control from low-dimensional states. By combining state and action embeddings, we achieve efficient learning of high-quality policies on goal-conditioned continuous control from pixel observations in only 1-2 million environment steps.

7 nodes9 linksoverview previewDynamics-aware Embeddings
7 nodes9 links
Dynamics-aware Embeddings7 visible / 7 total nodes / 15 links
Related contextCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipAuthorshipWorks onWorks onAuthorshipAuthorshipAuthorshipTopic signalTopic signalWDynamics-aware Embeddingspreprint / 2020AWilliam WhitneyResearcherARajat AgarwalResearcherAKyunghyun ChoResearcherAAbhinav GuptaResearcherTMachine Learning49008 worksTArtificial Intelligence22915 works
PaperSignal 106 links

Dynamics-aware Embeddings

preprint / 2020

Open