A privacy-first, open-source OCR tool powered by Gemini AI. Easily scan and extract structured data from documents like Aadhar, PAN, and Passports — securely and accurately.
✅ Built with: Next.js, Tailwind CSS, ShadCN UI, Gemini API, REST API
- 🧠 AI-powered OCR & smart document data extraction
- 📸 Upload unlimited documents (JPG, PNG, WebP supported)
- 🔐 No personal data stored or shared — secure & private
- 🧩 100% open source — fork, improve, self-host
- 💡 Developer-friendly REST API integration
- ⚡ Built using modern and scalable web technologies
| Layer | Tech Stack |
|---|---|
| Frontend | Next.js, Tailwind CSS, ShadCN UI |
| Backend | Node.js REST API |
| AI Service | Gemini AI (Google Generative AI) |
| Deployment | Vercel |
git clone https://github.com/WizardGeeky/Ocr-Ai.git
cd Ocr-Ai
npm installCreate a .env.local file in the root directory:
GEMINI_API_KEY=your_gemini_api_key
⚠️ Your API key is required to access Gemini AI for document processing.
npm run devVisit: http://localhost:3000
This tool can scan and extract structured data from:
- ✅ Aadhar Cards
- ✅ PAN Cards
- ✅ Indian Passports
Your privacy is a top priority:
- ❌ No personal data is stored or logged
- 🔒 No data is shared with third parties
- 💾 All AI processing is done in-memory only
Contributions are welcome and appreciated!
Feel free to:
- Fork the repository
- Create a new feature or bugfix branch
- Submit a Pull Request
- Open an issue for ideas or bugs
This project is licensed under the MIT License.
Built with ❤️ to make AI more accessible, secure, and ethical.