A Flask-based search engine which allows a user to search for UHC plans and corresponding file URLs provided on https://transparency-in-coverage.uhc.com.
It has three modules
- Scraper - Scrapes JSON files' URLs from UHC website.
- JSON Downloader - Downloads, formats and merges all the JSON files.
- Flask Application - A search engine to query the centralized JSON files database.
- Python 3.9.x
- ChromeDriver 106.0.5249.61+ (https://chromedriver.chromium.org/getting-started)
Clone the repository.
git clone https://github.com/ahmadjanan/uhc-search-engine.git
- Install requirements
pip install -r requirements.txt
- Scrape and extract JSON URLs from webite.
python scraper.py
- Download JSON files and populate backend database.
python json_downloader.py
- Run server.
python server.py