Doc2vec tutorial
▻http://radimrehurek.com/2014/12/doc2vec-tutorial
The latest #gensim release has a new class named #Doc2Vec. All credit for this class, which is an implementation of Quoc Le & Tomáš Mikolov: “Distributed Representations of Sentences and Documents”, as well as for this tutorial, goes to the illustrious Tim Emerick.
Doc2vec (aka paragraph2vec, aka sentence embeddings) modifies the word2vec algorithm to unsupervised learning of continuous representations for larger blocks of text, such as sentences, paragraphs or entire documents.
#text-mining cc: @lewer @lazuly