Graph explorer

Skip-Thought Vectors

We describe an approach for unsupervised learning of a generic, distributed sentence encoder. Using the continuity of text from books, we train an encoder-decoder model that tries to reconstruct the surrounding sentences of an encoded passage. Sentences that share semantic and syntactic properties are thus mapped to similar vector representations. We next introduce a simple vocabulary expansion method to encode words that were not seen as part of training, allowing us to expand our vocabulary to a million words. After training our model, we extract and evaluate our vectors with linear models on 8 tasks: semantic relatedness, paraphrase detection, image-sentence ranking, question-type classification and 4 benchmark sentiment and subjectivity datasets. The end result is an off-the-shelf encoder that can produce highly generic sentence representations that are robust and perform well in practice. We will make our encoder publicly available.

10 nodes11 linksoverview previewSkip-Thought Vectors
10 nodes11 links
Skip-Thought Vectors10 visible / 10 total nodes / 32 links
Related contextCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipAuthorshipWorks onAuthorshipAuthorshipAuthorshipTopic signalTopic signalAuthorshipAuthorshipAuthorshipWSkip-Thought Vectorspreprint / 2015ARyan KirosResearcherAYukun ZhuResearcherARuslan SalakhutdinovResearcherARichard S. ZemelResearcherTMachine Learning49008 worksTComputation and Language14115 worksAAntonio TorralbaResearcherARaquel UrtasunResearcherASanja FidlerResearcher
PaperSignal 109 links

Skip-Thought Vectors

preprint / 2015

Open