Meshed-Memory Transformer for Image Captioning. CVPR 2020
-
Updated
Dec 21, 2022 - Python
Meshed-Memory Transformer for Image Captioning. CVPR 2020
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Official Repository of OmniCaptioner
An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"
[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
[T-PAMI 2024] & [CVPR 2023] Vote2Cap-DETR; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning methods
CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Computer Vision Playground ⚡️
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
PyTorch library for Visual-Semantic tasks
To ease the driver to identify the Traffic Signs and also for the efficient working of Self-Driving Cars.
[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)
A 100% free & open-source AI Content Automation Tool that writes scripts, generates voiceovers, creates videos, and uploads them automatically — hands-free YouTube growth powered by AI.
[BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization
oCaption: Leveraging OpenAI's GPT-4 Vision for Advanced Image Captioning
Aid for blinds. This AI will describe the surrounding, it will tell who is in front of him (if that person is a known person to AI using Facial Recognition) and it will also help him to know what is written (Optical Character Recognition)
A real-time subtitle generator, based on whisper.
Microsoft COCO: Common Objects in Context for huggingface datasets
Add a description, image, and links to the caption-generation topic page so that developers can more easily learn about it.
To associate your repository with the caption-generation topic, visit your repo's landing page and select "manage topics."