Skip to content

Latest commit

 

History

History
29 lines (21 loc) · 713 Bytes

README.md

File metadata and controls

29 lines (21 loc) · 713 Bytes

text_analysis_ocr_service

This repository contains the code for the OCR API service using the Flask backend. This is used in conjunction with the Streamlit Information Extraction application [https://github.com/mohanbing/st_doc_ext].

Create and activate a venv

python -m venv <name_of_the_env>
source <name_of_the_env>/bin/activate

Pip install all requirements

pip install -r requirements.txt

Create .env file to specify the MongoDB credentials

MONGODB_USER = <user_name> 
MONGODB_PWD = <password>
MONGODB_CLUSTER = <cluster name>

Do check out the db.py file to check out the implementation of MongoDB caching and the required database name and collection name.