-
Updated
May 26, 2020 - Python
vqa-dataset
Here are 20 public repositories matching this topic...
-
Updated
Dec 15, 2018 - Python
Deep Learning Web app that responds to any question about an image.
-
Updated
May 12, 2020 - Python
-
Updated
Apr 21, 2017 - Python
Visual Question Answering (VQA)
-
Updated
Oct 2, 2021 - Python
VQA Challenge - hosted on Hasura using Flask
-
Updated
Apr 30, 2018 - Python
Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)
-
Updated
Jun 24, 2024 - Python
Streamlit app for demonstrating multi-modal(vision+language) modelling in Pytorch.
-
Updated
Aug 22, 2022 - Python
Grid features extraction for ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
-
Updated
Oct 10, 2021 - Python
SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgical action-oriented queries generated using scene graphs.
-
Updated
Aug 29, 2024 - Python
B.Sc. Final Project: LXMERT Model Compression for Visual Question Answering.
-
Updated
Nov 22, 2023 - Python
CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation
-
Updated
Feb 2, 2024 - Python
This repo implements attention networks for visual question answering
-
Updated
Dec 23, 2018 - Python
How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?
-
Updated
Jun 13, 2024 - Python
Investigation on VQA dataset. TensorFlow is utilized for the implementation of a solution based on CNN and RNN architectures plus some ideas such as Attention and Positional features.
-
Updated
Aug 5, 2020 - Python
Counterfactual Reasoning VQA Dataset
-
Updated
Nov 23, 2023 - Python
VQA-Med 2021
-
Updated
Jul 11, 2022 - Python
Multi-page document understanding and VQA using OCR-free method
-
Updated
May 8, 2023 - Python
The Easy Visual Question Answering dataset.
-
Updated
Oct 3, 2023 - Python
A resource list and performance benchmark for blind video quality assessment (BVQA) models on user-generated content (UGC) datasets. [IEEE TIP'2021] "UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content", Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
-
Updated
Apr 12, 2022 - Python
Improve this page
Add a description, image, and links to the vqa-dataset topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the vqa-dataset topic, visit your repo's landing page and select "manage topics."