A simple motion extraction for video inspired from posy
-
Updated
Aug 13, 2024 - Python
A simple motion extraction for video inspired from posy
Video feature extractor in PyTorch.
Repository with code to extract different features from video and images.
deep_video_extraction is a powerful repository designed to extract deep feature representations from video inputs using pre-trained models. With support for both visual and aural features from videos. Additionally, you can process audio separately by converting it into spectrograms.
Motion energy features from video
Feature Extractor module for videos using the PySlowFast framework
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Add a description, image, and links to the video-features topic page so that developers can more easily learn about it.
To associate your repository with the video-features topic, visit your repo's landing page and select "manage topics."