A program to search through the Cranfield documents and rank them accordingly based on your search query.
Uses a Vector Space Model with TF-IDF weighting.
- run jar with the command
java -jar cranfieldTFIDF.jar file size
file
: cran.1400.all or similarly formatted file
size
: number of documents within the file - Enter your query, and then how many relevant documents you wish to see.
Documents will be listed in descending order, with the highest score being the most relevant