This repository containes the various methods that have been tried to find the similarity between documents. The dataset used is MIMICIII. The various methods used are:
- Doc2vec using gensim
- Tf-Idf method
- Lda coupled with Topic Modelling using gensim
- Document similarity using Facebook Infersent model