ocr-pipeline

Star

Here are 5 public repositories matching this topic...

ALucek / seb-ocr

Star

vLLM Processing for Unstructured Historical Documents

ocr large-language-model vision-language-model ocr-pipeline

Updated Jun 22, 2025
Python

PRADUMAN-KR / OCR_model-HugginFace

Star

Optical Character Recognition, OCR pipeline, Arabic OCR, Deep Learning OCR, Computer Vision text extraction, Text recognition system, AI document processing, Multilingual OCR, Transformer OCR, OCR benchmarking, Bounding box detection, Ground truth evaluation.

opencv paddlepaddle paddleocr hugginface arabic-ocr ai-document-processing ocr-pipeline deep-learning-ocr computer-vision-text-extraction paddleocr-v5

Updated May 20, 2026
Python

jcaperella29 / Document_cleaning_CLI

Star

🧠 AI-powered pipeline for cleaning scanned documents. Removes noise, enhances text, auto-tunes model weights, and returns OCR-optimized PDFs via CLI or cloud API.

python ocr computer-vision deep-learning rest-api image-processing scanned-documents batch-processing denoising cli-tool document-processing pytesseract image-enhancement fastapi cloud-run document-ai auto-tune ocr-pipelines ocr-pipeline

Updated May 15, 2026
MATLAB

anshwysmcbel2710 / ocr-pdf-text-extraction-service

Star

Serverless OCR & PDF Text Extraction microservice for Personal AI Factory v1. Built with TypeScript and Vercel Serverless Functions, using pdf-parse, and node-fetch for high-performance parsing of machine-readable PDFs. Supports extracting clean text from textual PDFs and exposes a clean HTTP API returning structured JSON output for downstream n8n.

Updated Jan 4, 2026
TypeScript

Not-Buddy / HackerXAPI

Star

High-performance RAG API with AI, multi-format docs, Gemini integration, security, CLI.

scalability async-programming parallel-processing document-processing gemini-api pdf-processing vector-database ai-ml-integration ocr-pipeline batch-operations llm-intelligence tokio-runtime smart-context-filtering chunking-strategy prompt-injection-sanitization

Updated May 20, 2026
Rust

Improve this page

Add a description, image, and links to the ocr-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ocr-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ocr-pipeline

Here are 5 public repositories matching this topic...

ALucek / seb-ocr

PRADUMAN-KR / OCR_model-HugginFace

jcaperella29 / Document_cleaning_CLI

anshwysmcbel2710 / ocr-pdf-text-extraction-service

Not-Buddy / HackerXAPI

Improve this page

Add this topic to your repo