this is one of my college projects from information retrieval coursework. What I built here is simple search engine that able to search from inverted index that built from news data that crawled from detik.
- install the environment with command:
conda env create -f environment.yml --name some_env_name
- activate the virtual environment
- change directory to /web_app
- run the web server:
uvicon main:app --reload
- go to 127.0.0.1/site
- install and activate the enviroment, see the step from above steps
- run crawling.py (this only crawl the news from the first page of indeks berita detik, modification if you want to get more data)
- run inverted_index.py