The script extracts text from images using Tesseract OCR and saves it to a text file.
This has been written in Python and uses the pytesseract
and Pillow
libraries.
I created this to extract texts from my Reddit screenshots :P
- Tesseract OCR:
- Install Tesseract OCR and its language data files.
- Python Libraries:
pytesseract
: A Python wrapper for Tesseract OCR.Pillow
: Python Imaging Library which will be used for opening and manipulating images.
sudo pacman -S tesseract tesseract-data-eng
sudo apt-get install tesseract-ocr
brew install tesseract
pip install pytesseract pillow
mkdir image
python3 script.py