Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
-
Updated
Oct 26, 2024 - Python
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Pytorch port of Google Research's VGGish model used for extracting audio features.
Audio classification with VGGish as feature extractor in TensorFlow
Machine learning model for bird songs recognition
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
🏆 🏆 Top-1 Submission to CORSMAL Challenge 2020 (at ICPR). The winning solution for the CORSMAL Challenge (on Intelligent Sensing Summer School 2020)
My music tech thesis prototype as well as recent class project
Query service to serve the JibJib TensorFlow model
Re-Implementation of Google Research's VGGish model used for extracting audio features using Pytorch with GPU support.
This is where I store files and documents relating to my graduation project
Add a description, image, and links to the vggish topic page so that developers can more easily learn about it.
To associate your repository with the vggish topic, visit your repo's landing page and select "manage topics."