Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 80+ languages.
-
Updated
Sep 19, 2025 - Python
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 80+ languages.
All-in-One Development Tool based on PaddlePaddle
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
Add a description, image, and links to the pdf2markdown topic page so that developers can more easily learn about it.
To associate your repository with the pdf2markdown topic, visit your repo's landing page and select "manage topics."