What's Changed
- Added
scribe
CLI command- If
scribe.js
is installed globally (npm i -g scribe.js-ocr
), thescribe
command can be used to process documents from the command line.- For example,
scribe recognize analyst_report.png
runs OCR on an image and saves the result as a PDF.
- For example,
- This feature is still experimental and command/argument names and features may change without warning.
- If
- Added new intermediate data format
.scribe
for storing and loading document data.- Given OCR is computationally expensive, it is often desirable to save results for later use without losing data.
- By saving results to
.scribe
files, results can be re-loaded later (e.g. to export with slightly different settings).- While several other output formats can be re-loaded later (notably
.hocr
and.pdf
), only.scribe
can be re-loaded without any data being lost in the export/import process. .scribe
files only contain the text layer; they do not contain embedded images or PDF files..scribe
files can be loaded alongside image/PDF files to restore both image and text data.
- While several other output formats can be re-loaded later (notably
Full Changelog: v0.7.4...v0.8.0