OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
-
Updated
Aug 14, 2024 - Python
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
An open-source toolbox for action understanding based on PyTorch
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Code & Models for Temporal Segment Networks (TSN) in ECCV 2016
Temporal Segment Networks (TSN) in PyTorch
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
temporal action detection with SSN
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
[CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
Add a description, image, and links to the video-understanding topic page so that developers can more easily learn about it.
To associate your repository with the video-understanding topic, visit your repo's landing page and select "manage topics."