Graph explorer

Geometric Entropic Exploration

Exploration is essential for solving complex Reinforcement Learning (RL) tasks. Maximum State-Visitation Entropy (MSVE) formulates the exploration problem as a well-defined policy optimization problem whose solution aims at visiting all states as uniformly as possible. This is in contrast to standard uncertainty-based approaches where exploration is transient and eventually vanishes. However, existing approaches to MSVE are theoretically justified only for discrete state-spaces as they are oblivious to the geometry of continuous domains. We address this challenge by introducing Geometric Entropy Maximisation (GEM), a new algorithm that maximises the geometry-aware Shannon entropy of state-visits in both discrete and continuous domains. Our key theoretical contribution is casting geometry-aware MSVE exploration as a tractable problem of optimising a simple and novel noise-contrastive objective function. In our experiments, we show the efficiency of GEM in solving several RL problems with sparse rewards, compared against other deep RL exploration approaches.

12 nodes14 linksoverview previewGeometric Entropic Exploration
12 nodes14 links
Geometric Entropic Exploration12 visible / 12 total nodes / 58 links
Works onCo-authorshipWorks onCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipAuthorshipAuthorshipAuthorshipAuthorshipTopic signalAuthorshipAuthorshipAuthorshipAuthorshipAuthorshipAuthorshipWGeometric Entropic Explorationpreprint / 2021AZhaohan Daniel GuoResearcherAMohammad Gheshlaghi AzarResearcherAAlaa SaadeResearcherAShantanu ThakoorResearcherTMachine Learning49008 worksABilal PiotResearcherABernardo Avila PiresResearcherAMichal ValkoResearcherAThomas MesnardResearcherARémi MunosResearcherATor LattimoreResearcher
PaperSignal 1011 links

Geometric Entropic Exploration

preprint / 2021

Open