Skip to content

Original doc2vec implementation by Tomas Mikolov

Notifications You must be signed in to change notification settings

yiqiang-zhao/doc2vec

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

doc2vec

Original doc2vec implementation by Tomas Mikolov from this Google Groups discussion. Just keeping this here for backup.

Mikolov's words

I'm sending modified word2vec version that I wrote during the summer to help one intern with his project. It allows to train the sentence vectors, and the attached script runs it on IMDB. It also trains recurrent neural network language model to perform classification (another baseline, showing that generative models can work reasonably well for this task too, although the discriminative ones are obviously better). You can comment out this part of the script.

This combined with SVM classifier with bag of bigram features actually achieves state of the art on the IMDB dataset (though this is not in the attached script).

About

Original doc2vec implementation by Tomas Mikolov

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C 87.2%
  • Shell 12.8%