A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
-
Updated
Dec 27, 2024 - C++
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format
Add a description, image, and links to the document-recognition topic page so that developers can more easily learn about it.
To associate your repository with the document-recognition topic, visit your repo's landing page and select "manage topics."