This project is a simple web application that allows users to upload an image and ask natural language questions about it. The system uses OpenRouter's GPT-4o Vision model to generate intelligent answers based on the visual content.
- Upload any medical image through the web interface
- Ask natural language questions about the image
- Get accurate visual answers powered by GPT-4o (Vision)
- Display the image along with the question and model's answer
- Frontend: HTML(via Flask)
- Backend: Python (Flask framework)
- Model API: OpenRouter GPT-4o Vision
- Others:
requests,Pillow,matplotlib,os,base64
-
Python 3.10+
-
Install required libraries:
pip install flask requests matplotlib pillow
-
Clone or download this repository
-
Add your image(s) to the
static/uploads/folder, or upload them via the web UI -
Set your API key:
- Open
vqa_api.py - Replace
"THE-API-KEY"with your actual OpenRouter API key
- Open
-
Run the Flask app:
cd "Medical-Image-Analysis-System" python app.py
-
Visit the app in your browser:
Open browser at http://localhost