Graph explorer

Playable Video Generation

This paper introduces the unsupervised learning problem of playable video generation (PVG). In PVG, we aim at allowing a user to control the generated video by selecting a discrete action at every time step as when playing a video game. The difficulty of the task lies both in learning semantically consistent actions and in generating realistic videos conditioned on the user input. We propose a novel framework for PVG that is trained in a self-supervised manner on a large dataset of unlabelled videos. We employ an encoder-decoder architecture where the predicted action labels act as bottleneck. The network is constrained to learn a rich action space using, as main driving loss, a reconstruction loss on the generated video. We demonstrate the effectiveness of the proposed approach on several datasets with wide environment variety. Further details, code and examples are available on our project page willi-menapace.github.io/playable-video-generation-website.

8 nodes10 linksoverview previewPlayable Video Generation
8 nodes10 links
Playable Video Generation8 visible / 8 total nodes / 20 links
Related contextCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipAuthorshipWorks onWorks onAuthorshipAuthorshipAuthorshipTopic signalTopic signalAuthorshipWPlayable Video Generationpreprint / 2021AWilli MenapaceResearcherAStéphane LathuilièreResearcherASergey TulyakovResearcherAAliaksandr SiarohinResearcherTArtificial Intelligence22915 worksTComputer Vision30606 worksAElisa RicciResearcher
PaperSignal 107 links

Playable Video Generation

preprint / 2021

Open