Dot-matrix text recognition in Python via PyTesseract (based on Tesseract)
Runs Tesseract OCR on an image file to scan strings. Finds strings matching a common pattern and uses regex to filter results. Stores results in an Excel spreadsheet.
- To install dependencies:
pip3 install -r requirements.txt - Run
python3 script.py, enter inputs for image file (eg:test.jpg) to be scanned and a common pattern string.