The Multimodal Model for Vietnamese Visual Question Answering (ViVQA)
-
Updated
Jul 29, 2024 - Python
The Multimodal Model for Vietnamese Visual Question Answering (ViVQA)
ViAG: A Novel Framework for Fine-tuning Answer Generation models ultilizing Encoder-Decoder and Decoder-only Transformers's architecture
A Pipeline from data construction to model deployment for Question-generation task using BARTPho.
Add a description, image, and links to the bartpho topic page so that developers can more easily learn about it.
To associate your repository with the bartpho topic, visit your repo's landing page and select "manage topics."