-
Navigate into the source directory
-
To run basic searching, run the following command:
python3 step4_offline_driver.py -
To run term proximity searching, run the following command:
python3 step5_offline_driver.py -
To run relevance feedback searching, run the following command:
python3 step6_offline_driver.py
Instructions for submitting queries are printed to the terminal at runtime.
The Online Version of This Search Engine Is Available Here.
-
Place a folder of HTML documents in the file_cache/unprocessed/ directory.
-
Let the name of this folder be customFolder.
-
Navigate into the source directory.
-
Open a Python terminal:
python -
Run the follwing commands:
import docproc, indexer dp = docproc.DocProcessor() dp.runDocProc("../file_cache/unprocessed/customFolder") iic = indexer.InvertedIndex() iic.createInvertedIndex("../file_cache/processed/customFolder")The new inverted index will be written as data directory as customFolder_index.txt