A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
-
Updated
Apr 9, 2025 - C++
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Local adaptive image binarization
A fast and accurate command line tool for extracting text from PDF files.
CVL/READ Modules including Basic Layout Analysis and Writer Identification/Retrieval
Add a description, image, and links to the document-analysis topic page so that developers can more easily learn about it.
To associate your repository with the document-analysis topic, visit your repo's landing page and select "manage topics."