Skip to content

An offline UHC search engine which can sync with the UHC website to download latest data.

Notifications You must be signed in to change notification settings

ahmadjanan/uhc-search-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

UHC Search Engine


A Flask-based search engine which allows a user to search for UHC plans and corresponding file URLs provided on https://transparency-in-coverage.uhc.com.

It has three modules

  1. Scraper - Scrapes JSON files' URLs from UHC website.
  2. JSON Downloader - Downloads, formats and merges all the JSON files.
  3. Flask Application - A search engine to query the centralized JSON files database.

Table of contents


  1. Requirements
  2. Directory Structure
  3. Setup Project

1. Requirements


  1. Python 3.9.x
  2. ChromeDriver 106.0.5249.61+ (https://chromedriver.chromium.org/getting-started)

2. Directory Structure


Clone the repository.

    git clone https://github.com/ahmadjanan/uhc-search-engine.git

3. Setup Project


  • Install requirements
    pip install -r requirements.txt
    
  • Scrape and extract JSON URLs from webite.
    python scraper.py  
    
  • Download JSON files and populate backend database.
    python json_downloader.py
    
  • Run server.
    python server.py  
    

About

An offline UHC search engine which can sync with the UHC website to download latest data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published