🛠️ Built with open-sourced Tesseract OCR Engine (WebAssembly JavaScript port). An offline in-browser setup.
A Fusion of OCR Technology (Tesseract.js) & Web Speech API 🗣💬. Non-mobile responsive 📱🚫.
- Image-to-Text: Input image file for Text Extraction with Tesseract-OCR
- PDF-to-Image-to-Text: Input PDF File to render each page image for Text Extraction with PDF.js
- Text-to-Speech: Voice generation with Web API SpeechSynthesis
- Export extracted text to local .txt file.
Article One :: Build A Text-To-Speech App Using Client-Side JavaScript
Article Two :: Build An Image & PDF Text Extraction Tool with Tesseract OCR Using Client-side JavaScript
— Join me on 📝 Medium at ~ ξ(🎀˶❛◡❛) @geek-cc
🌮 Please buy me a Taco! 😋
- © Copyright - This work is licensed under Apache License
- Original library at Tesseract.js v2 and its co-creator/maintainer Jerome Wu