Graph explorer

Hypermodels for Exploration

We study the use of hypermodels to represent epistemic uncertainty and guide exploration. This generalizes and extends the use of ensembles to approximate Thompson sampling. The computational cost of training an ensemble grows with its size, and as such, prior work has typically been limited to ensembles with tens of elements. We show that alternative hypermodels can enjoy dramatic efficiency gains, enabling behavior that would otherwise require hundreds or thousands of elements, and even succeed in situations where ensemble methods fail to learn regardless of size. This allows more accurate approximation of Thompson sampling as well as use of more sophisticated exploration schemes. In particular, we consider an approximate form of information-directed sampling and demonstrate performance gains relative to Thompson sampling. As alternatives to ensembles, we consider linear and neural network hypermodels, also known as hypernetworks. We prove that, with neural network base models, a linear hypermodel can represent essentially any distribution over functions, and as such, hypernetworks are no more expressive.

9 nodes11 linksoverview previewHypermodels for Exploration
9 nodes11 links
Hypermodels for Exploration9 visible / 9 total nodes / 26 links
Related contextWorks onCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipAuthorshipWorks onAuthorshipAuthorshipAuthorshipTopic signalTopic signalAuthorshipAuthorshipWHypermodels for Explorationpreprint / 2020AVikranth DwaracherlaResearcherAXiuyuan LuResearcherAMorteza IbrahimiResearcherAIan OsbandResearcherTMachine Learning49008 worksTmath.OC9232 worksAZheng WenResearcherABenjamin Van RoyResearcher
PaperSignal 108 links

Hypermodels for Exploration

preprint / 2020

Open