Skip to content

Latest commit

 

History

History
11 lines (6 loc) · 431 Bytes

README.md

File metadata and controls

11 lines (6 loc) · 431 Bytes

latin-books

A budding repository for OCR-ification of older books, hand tuned, later to be contributed to ongoing projects. Now with only Latin texts!

Initial OCR done by Tesseract using gImageReader: https://github.com/manisandro/gImageReader

Hand corrections and scripted fixes for common issues (formatting, split lines, etc.).

Proofing always welcome. Please pull any typos or incorrect words for merging.

More to come!