OCRmyPDF Qt GUI Client

A user-friendly Qt-based graphical interface for OCRmyPDF, making it easy to add OCR text layers to scanned PDF documents.

Features

Modern Qt Interface: Clean, responsive GUI built with PyQt5
Drag & Drop Support: Simply drag PDF files into the application window
Batch Processing: Process multiple PDF files simultaneously
Multi-language OCR: Support for 100+ languages via Tesseract OCR
Advanced Settings: Full access to OCRmyPDF's powerful features
Real-time Progress: Monitor processing with detailed progress information
Cross-platform: Works on Windows, macOS, and Linux

Installation

System Requirements

Minimum Requirements:

Operating System: Windows 10, macOS 10.14, or Linux with Qt5 support
Architecture: 64-bit system (strongly recommended)
Python: 3.10 or newer (64-bit recommended)
RAM: 2 GB (4 GB recommended for large files)
Storage: 1 GB free space for temporary files

Required System Dependencies

IMPORTANT: These system dependencies must be installed BEFORE running the application:

1. Python 3.10+

Ubuntu/Debian: sudo apt update && sudo apt install python3 python3-pip python3-venv
Fedora/RHEL: sudo dnf install python3 python3-pip
macOS: brew install python3 or download from python.org
Windows: Download from python.org (make sure to check "Add to PATH")

2. Tesseract OCR 4.1.1+ (Required)

Ubuntu/Debian: sudo apt install tesseract-ocr tesseract-osd
Fedora/RHEL: sudo dnf install tesseract tesseract-osd
macOS: brew install tesseract
Windows: Download from UB-Mannheim Tesseract

3. Ghostscript 9.54+ (Required)

Ubuntu/Debian: sudo apt install ghostscript
Fedora/RHEL: sudo dnf install ghostscript
macOS: brew install ghostscript
Windows: Download from Ghostscript Downloads

4. Additional Language Support (Optional but Recommended)

For better OCR accuracy in different languages:

Turkish Language Pack:

Ubuntu/Debian: sudo apt install tesseract-ocr-tur
Fedora/RHEL: sudo dnf install tesseract-langpack-tur
macOS: brew install tesseract-lang
Windows: Download language data files from tessdata repository

Other Languages:

German: tesseract-ocr-deu / tesseract-langpack-deu
French: tesseract-ocr-fra / tesseract-langpack-fra
Spanish: tesseract-ocr-spa / tesseract-langpack-spa
Russian: tesseract-ocr-rus / tesseract-langpack-rus

5. Optional Performance Enhancements

jbig2enc (for better PDF compression):
- Ubuntu/Debian: sudo apt install jbig2enc
- macOS: brew install jbig2enc
pngquant (for PNG optimization):
- Ubuntu/Debian: sudo apt install pngquant
- macOS: brew install pngquant
unpaper (for additional cleaning options):
- Ubuntu/Debian: sudo apt install unpaper
- macOS: brew install unpaper

Verification Commands

After installing dependencies, verify they are working:

# Check Python version (should be 3.10+)
python3 --version

# Check Tesseract installation
tesseract --version
tesseract --list-langs

# Check Ghostscript installation
gs --version

Quick Start

Clone or download this repository

Run the application using the startup script:

./run_app.sh

Or manually:

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python main.py

Usage

Launch the Application

# From command line
ocrmypdf-gui

# Or run directly
python main.py

Basic Workflow

Add Files: Drag PDF files into the window or use "Add Files" button
Configure Settings: Choose OCR language and processing options
Start Processing: Click "Start Processing" to begin OCR
Monitor Progress: Watch real-time progress in the progress dialog
Access Results: Find processed files in the output directory

Advanced Configuration

Access advanced settings through Edit → Settings:

OCR Languages: Select single or multiple OCR languages
Page Processing: Enable auto-rotation, deskewing, and cleaning
Image Quality: Adjust DPI and compression settings
PDF Output: Choose PDF/A formats and optimization levels
Performance: Configure CPU threads and memory usage

System Requirements

Minimum Requirements

OS: Windows 10, macOS 10.14, or Linux with Qt5 support
RAM: 2 GB (4 GB recommended for large files)
Storage: 500 MB free space
Python: 3.8 or newer

Recommended Requirements

RAM: 4 GB or more for processing large PDF files
CPU: Multi-core processor for parallel processing
Storage: 1 GB free space for temporary files

Supported File Formats

Input

PDF files (including scanned PDFs)
Password-protected PDFs (with manual password entry)

Output

Standard PDF with OCR text layer
PDF/A-1b, PDF/A-2b, PDF/A-3b (archival formats)

Configuration

The application stores settings in platform-specific locations:

Windows: %APPDATA%\OCRmyPDF-GUI\OCRmyPDF-GUI.ini
macOS: ~/Library/Preferences/com.OCRmyPDF-GUI.OCRmyPDF-GUI.plist
Linux: ~/.config/OCRmyPDF-GUI/OCRmyPDF-GUI.conf

Troubleshooting

Common Issues

"OCRmyPDF not found" error

pip install ocrmypdf

"Tesseract not found" error

Ensure Tesseract is installed and added to system PATH
Check installation: tesseract --version

"Ghostscript not found" error

Install Ghostscript for your platform
Check installation: gs --version

Memory errors with large files

Reduce the number of parallel jobs in settings
Increase system memory or use smaller files

Enable Debug Logging

Run with verbose logging to diagnose issues:

python main.py --verbose

Logs are saved to ocr_gui.log in the application directory.

Development

Setup Development Environment

git clone https://github.com/tw4/OCRmyPDF-Qt-GUI-Client.git
cd OCRmyPDF-Qt-GUI-Client
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt
pip install -e .[dev]

Run Tests

pytest tests/

Code Style

This project uses Black for code formatting:

black .

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Make your changes
Add tests for new functionality
Ensure all tests pass (pytest)
Format code with Black (black .)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

License and Legal Information

This project is licensed under the Mozilla Public License 2.0 - see the LICENSE file for full details.

Third-Party Components

This application uses several third-party components, each with their own licenses:

OCRmyPDF: Licensed under Mozilla Public License 2.0 (Source)
PyQt5: Licensed under GPL v3 / Commercial License
Tesseract OCR: Licensed under Apache License 2.0
Ghostscript: Licensed under AGPL v3 / Commercial License
Pillow: Licensed under PIL Software License

Attribution Requirements

As required by the Mozilla Public License 2.0:

This application is a derivative work that provides a graphical user interface for OCRmyPDF functionality
Source code is available and governed by the MPL 2.0 license terms
Recipients are informed that the source code is available under these license terms
All original license notices have been preserved

Commercial Use

This GUI application: Free for commercial use under MPL 2.0
OCRmyPDF: Free for commercial use under MPL 2.0
PyQt5: Requires commercial license for commercial applications (or use GPL v3)
Ghostscript: May require commercial license for commercial use (check AGPL v3 requirements)

Please review the individual license terms for each component before commercial deployment.

Acknowledgments

OCRmyPDF - The powerful OCR engine that powers this GUI
Tesseract OCR - Open source OCR engine by Google
PyQt5 - The cross-platform GUI framework
Ghostscript - PostScript and PDF interpreter
The OCRmyPDF Community - For developing and maintaining the excellent OCR processing library

Support

Documentation: Read the Docs
Issues: GitHub Issues
Discussions: GitHub Discussions

Changelog

Version 1.0.0 (2025-08-25)

Initial release
Basic OCR functionality with GUI
Drag & drop support
Multi-language OCR support
Batch processing capabilities
Advanced settings dialog
Cross-platform support# OCRmyPDF-Qt-GUI-Client

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
core		core
gui		gui
resources		resources
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
main.py		main.py
requirements.txt		requirements.txt
run_app.sh		run_app.sh
setup.py		setup.py
turkce_kurulum.md		turkce_kurulum.md

License

tw4/OCRmyPDF-Qt-GUI-Client

Folders and files

Latest commit

History

Repository files navigation

OCRmyPDF Qt GUI Client

Features

Installation

System Requirements

Required System Dependencies

1. Python 3.10+

2. Tesseract OCR 4.1.1+ (Required)

3. Ghostscript 9.54+ (Required)

4. Additional Language Support (Optional but Recommended)

5. Optional Performance Enhancements

Verification Commands

Quick Start

Usage

Launch the Application

Basic Workflow

Advanced Configuration

System Requirements

Minimum Requirements

Recommended Requirements

Supported File Formats

Input

Output

Configuration

Troubleshooting

Common Issues

Enable Debug Logging

Development

Setup Development Environment

Run Tests

Code Style

Contributing

License and Legal Information

Third-Party Components

Attribution Requirements

Commercial Use

Acknowledgments

Support

Changelog

Version 1.0.0 (2025-08-25)

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages