A lightweight Chrome extension to capture any area of a webpage and extract text using Tesseract OCR.
- Area Selection: Draw a resizable selection box anywhere on the webpage.
- Screenshot Capture: Capture only the selected portion of the page.
- Text Recognition: Extract text from the selected screenshot using Tesseract.js.
- On-Demand Activation: Activate only when the extension icon is clicked.
- Device Independent: Supports high-DPI screens by considering device pixel ratio.
- Clone or download this repository.
- Open Chrome and go to
chrome://extensions/. - Enable Developer mode.
- Click Load unpacked and select the project folder.
- Click the extension icon in the Chrome toolbar.
- Draw a selection box on the page.
- Wait a few seconds for OCR to process the selection.
- Check the recognized text in the browser console.
- Chrome Extension Manifest V3
- Tesseract.js for OCR
- JavaScript & HTML5 Canvas
- OCR runs directly in the content script without using web workers to ensure compatibility in Chrome extensions.
tesseract.min.jsmust be included in the extension folder.
MIT License
