AI-Powered-Image-Information-Extraction-with-GeminiAI

Overview

This project utilizes the GeminiAI Generative Model by GOOGLE to extract information from images based on user input. The code is designed to take an image and a user prompt, then generate relevant information using the GeminiAI API.

Getting Started

Prerequisites

Before running the code, ensure you have the required libraries installed. You can install them using:

pip install -r requirements.txt

Setting up Environment Variables

Create a .env file in the project directory and add your Google API key:

GOOGLE_API_KEY=your_api_key_here

Running the Application

Run the main.py file:

streamlit run app.py

Usage

Enter a prompt in the provided text input.
Upload an image (supported formats: jpg, jpeg, png).
Click the "RUN" button to generate information based on the input and image.

Model Information

The code uses the GeminiAI Generative Model named 'gemini-pro-vision' for image analysis.

Dependencies

Python 3.10
Streamlit
Pillow (PIL)
GeminiAI

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Powered-Image-Information-Extraction-with-GeminiAI

Overview

Getting Started

Prerequisites

Setting up Environment Variables

Running the Application

Usage

Model Information

Dependencies

Output

About

Releases

Packages

Languages

NDSUKESH/AI-Powered-Image-Information-Extraction-with-GeminiAI

Folders and files

Latest commit

History

Repository files navigation

AI-Powered-Image-Information-Extraction-with-GeminiAI

Overview

Getting Started

Prerequisites

Setting up Environment Variables

Running the Application

Usage

Model Information

Dependencies

Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages