#

vqa-dataset

Here are 36 public repositories matching this topic...

MuhammadShavaiz / DL-Visual-Question-Answering

The Visual Question Answering (VQA) project features a model with a simple GUI that handles both images and videos. It uses OpenAI's CLIP for encoding images and questions and GPT-2 for decoding embeddings to answer questions based on the VQA Version 2 dataset, which includes 265,016 images with multiple questions and answers.

python pytorch openai clip vqa-dataset gpt-2

Updated Jun 24, 2024
Jupyter Notebook

jiayi-wei / vqa-tf2

tensorflow vqa san vqa-dataset stacked-attention-networks tensorflow2 visual-q

Updated May 26, 2020
Python

nishitmehta1 / Deep-Image-Understanding-Visual-Question-Answering

python machine-learning computer-vision deep vqa deeplearning cnn-model vqa-dataset

Updated Dec 15, 2018
Python

juletx / egunean-behin-vqa

Egunean Behin Visual Question Answering Dataset

qa vqa question-answering visual-question-answering vqa-dataset visual-question-generation egunean-behin

Updated Mar 31, 2022
Jupyter Notebook

dinesh-kumar-mr / MediVQA

Part of our final year project work involving complex NLP tasks along with experimentation on various datasets and different LLMs

vqa medical-application vqa-dataset vqa-med-2018 llms llms-benchmarking

Updated Jan 12, 2024
HTML

AnshDesai / visual-question-answering

Deep Learning Web app that responds to any question about an image.

nlp deep-learning spacy vqa vgg16 vqa-dataset

Updated May 12, 2020
Python

chandrakanthm / visual-question-generator

tensorflow vqa vqa-dataset natural-questions

Updated Apr 21, 2017
Python

cserajdeep / Visual-Question-Answering-VQA

Visual Question Answering (VQA)

python flask computer-vision tensorflow keras vqa vqa-dataset

Updated Oct 2, 2021
Python

radonys / CFB-VQA

VQA Challenge - hosted on Hasura using Flask

deep-neural-networks lstm vqa vgg16 keras-models hasura keras-tensorflow hackathon-project vqa-dataset doselect

Updated Apr 30, 2018
Python

abdur75648 / MedicalGPT

Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)

medical-imaging vqa llama vqa-dataset medical-dataset vicuna llm medical-report-generation llms chatgpt minigpt4 multimodal-llm medicalgpt chatgpt4o xraygpt

Updated Jun 24, 2024
Python

shivam1423 / VQA

Visual Question Answer (VQA) software! Powered by Flask, this project seamlessly combines images and questions to generate accurate responses. Explore the world of interactive visual understanding with ease.

python flask jupyter-notebook html-css-javascript vqa-dataset

Updated Jun 2, 2023
HTML

thatAverageGuy / EarlyFusion-on-EasyVQA

Streamlit app for demonstrating multi-modal(vision+language) modelling in Pytorch.

transformers pytorch visual-question-answering vqa-dataset multimodal-deep-learning streamlit early-fusion

Updated Aug 22, 2022
Python

google-research-datasets / maverics

MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering (VQA).

evaluation vqa vqa-dataset multimodal data-creation maverics vq2a

Updated Feb 18, 2023

rentainhe / TRAR-Feature-Extraction

Grid features extraction for ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"

pytorch vqa extract-features visual-question-answering vqa-dataset vqav2 iccv2021

Updated Oct 10, 2021
Python

CAMMA-public / SSG-VQA

SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgical action-oriented queries generated using scene graphs.

scene-graph vqa-dataset surgical-data-science

Updated Aug 29, 2024
Python

ghazaleh-mahmoodi / lxmert_compression

B.Sc. Final Project: LXMERT Model Compression for Visual Question Answering.

python deep-learning pytorch vqa pruning visual-question-answering vqa-dataset

Updated Nov 22, 2023
Python

yanx27 / CLEVR3D

CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation

point-cloud scene-graph vqa-dataset scene-understanding vqa-3d

Updated Feb 2, 2024
Python

manoja328 / vqatools

API for VQA , visual 7w dataset

vqa visual7w vqa-dataset

Updated Aug 29, 2017
Jupyter Notebook

VibhuJawa / vqa-2018

This repo implements attention networks for visual question answering

pytorch attention-model vqa-dataset

Updated Dec 23, 2018
Python

gutbash / lmm-graph-vision

How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?

data-structures openai vqa visual-question-answering vqa-dataset google-generative-ai gpt-4v gpt-4-vision-preview gemini-pro-vision claude-3

Updated Jun 13, 2024
Python

Improve this page

Add a description, image, and links to the vqa-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vqa-dataset topic, visit your repo's landing page and select "manage topics."