This is where I store files and documents relating to my graduation project
-
Updated
Jan 10, 2020 - Python
This is where I store files and documents relating to my graduation project
Re-Implementation of Google Research's VGGish model used for extracting audio features using Pytorch with GPU support.
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
My music tech thesis prototype as well as recent class project
Query service to serve the JibJib TensorFlow model
🏆 🏆 Top-1 Submission to CORSMAL Challenge 2020 (at ICPR). The winning solution for the CORSMAL Challenge (on Intelligent Sensing Summer School 2020)
Machine learning model for bird songs recognition
Audio classification with VGGish as feature extractor in TensorFlow
Pytorch port of Google Research's VGGish model used for extracting audio features.
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Add a description, image, and links to the vggish topic page so that developers can more easily learn about it.
To associate your repository with the vggish topic, visit your repo's landing page and select "manage topics."