-
Notifications
You must be signed in to change notification settings - Fork 0
Calculate TF-IDF cosine similarity of files
License
lincerely/tfidf
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
# TFIDF Given a list of filenames, calculate TF-IDF cosine similarity matrix for all entries. ## Reference - https://www.sejuku.net/blog/26420 - https://atmarkit.itmedia.co.jp/ait/articles/2112/23/news028.html ## License The stemming code is from https://tartarus.org/martin/PorterStemmer/, by Martine Porter. > "The software is completely free for any purpose, unless notes at the head of the program text indicates otherwise (which is rare)..." For the rest of the code, see the license attached.
About
Calculate TF-IDF cosine similarity of files