Here is a look at the application's graphical user interface:
This project is a Zonal Optical Character Recognition (OCR) application developed using Python's Tkinter library. It is designed to extract text from specific zones or areas within a document or image, offering a user-friendly interface to define these zones and retrieve text efficiently.
This is no where close to complete yet. There will be many updates to come. The goal is to create an easy way for users to create document extraction ML models for their own use.
- User-Friendly Interface: Built with Tkinter.
- Zone Selection: Users can select specific areas in a document for text extraction.
- OCR Functionality: Utilizes advanced OCR technology to accurately extract text from the selected zones.
- Document Support: Compatible with various document and image formats.
- Real-Time Preview: Offers a real-time preview of selected zones and extracted text.
- Python >=3.9
- Tkinter
- Clone the repository:
git clone https://github.com/ThinkOrFaust/QuickZonalOCR.git
- Install required dependencies:
pip install -r requirements.txt
- Run the application:
python gui.py
Details to come soon.
Contributions to improve the application are welcome. Please follow these steps:
- Fork the repository.
- Create a new branch (
git checkout -b feature-branch
). - Make your changes and commit them (
git commit -am 'Add some feature'
). - Push to the branch (
git push origin feature-branch
). - Create a new Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
- PaddleOCR
- CustomTkinter by TomSchimansky
- Tesseract