Skip to content

Latest commit

 

History

History
8 lines (8 loc) · 334 Bytes

README.md

File metadata and controls

8 lines (8 loc) · 334 Bytes

DocumentSimilarity

This repository containes the various methods that have been tried to find the similarity between documents. The dataset used is MIMICIII. The various methods used are:

  1. Doc2vec using gensim
  2. Tf-Idf method
  3. Lda coupled with Topic Modelling using gensim
  4. Document similarity using Facebook Infersent model