Stars
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
AlphaPose Implementation in Pytorch along with the pre-trained weights
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"
Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”
This repository contains an official PyTorch implementation of Position-aware Location Regression Network (PLRN) for temporal video grounding, which is presented in the paper Position-aware Locatio…
VLG-Net: Video-Language Graph Matching Networks for Video Grounding
Official implementation for Hierarchical Deep Residual Reasoning for Temporal Moment Localization
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
The implementation code and instruction of the proposed work "Cross-modal Dynamic Networks for Video Moment Retrieval with Text Query" (CDN).
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
Summary of the papers related to video moment retrieval / video grounding / video moment localization ...
Convolutional neural network model for video classification trained on the Kinetics dataset.
The models of action recognition with pytorch
Train I3D model on ucf101 or hmdb51 by tensorflow
Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
Cross-Modal Hashing for Efficiently Retrieving Moments in Videos
reproduce the results of Adversarial Cross-Modal retrieval (ACMR)
Deep Supervised Cross-modal Retrieval (CVPR 2019, PyTorch Code)
Repository of proposal-free temporal moment localization work
codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval
Dense Regression Network for Video Grounding (CVPR2020)
Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval