Apply different text recognition services to images of handwritten documents.
-
Updated
Dec 26, 2022 - Python
Apply different text recognition services to images of handwritten documents.
A back-end service that tags images automatically using Google-Vision and translate tags so you can search these images by tags in two languages.
🖼️ ➡️ 📜
A simple to use python script for Automatic License Plate Recognition using Google Cloud Vision API.
📹🧐 Add face, gender and age detection annotations to a video
Rich tagging in the Terminal via Google Vision API
Using Google Vision API this project will output the most frequent objects that show up inside a given video along with the adult likelihood ratings of the content. This was part of the development of the clickbait detection chrome extension tool that was undertaken at SLO Hacks.
This is a Django application which uses Google's Vision API to analyze images and show results on an web browser.
Solves reCAPTCHA image challenges
OCR + transliteration on arabic scanned images
Recognize fruit with Python and Google vision AI
Exact Sciences extract text from forms
A question answering system developed in python focused on solving trivia questions that apply techniques of natural language processing and web scraping.
Send multiple parallel requests to Google Vision API end point with Python
This repository contains a Python script that automates the process of searching and booking trains on the IRCTC website using Selenium and Google Cloud Vision API. The script helps users find available trains, choose a train, provide passenger details, and complete the booking process.
Extract text from PDFs using Google Vision API. This script converts PDF pages to images, preprocesses them for OCR accuracy, and uses Google Vision API for text extraction. It supports parallel processing for efficiency and saves extracted text in a structured format for each PDF.
Proxy application to the Google Vision API for the Android application
Tools to perform Optical Character Recognition (OCR) and OCR+translation on images
WorkFlow - App that performs actions based on a Flow-chart
A wake-word activated voice assistant that can recognize content from webcam captures, screenshots, and clipboard. Built with Llama 3 via Groq.
Add a description, image, and links to the google-vision-api topic page so that developers can more easily learn about it.
To associate your repository with the google-vision-api topic, visit your repo's landing page and select "manage topics."