Audio processing by using pytorch 1D convolution network
-
Updated
Feb 13, 2024 - Python
Audio processing by using pytorch 1D convolution network
kapre: Keras Audio Preprocessors
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
A packaged convolutional voice activity detector for noisy environments.
Perform three types of feature extraction: STFT, MFCC and MelSpectrogram. Apply CNN/VGG with or without RNN architecture. Able to achieve 95% accuracy.
Learnable STRF, from Riad et al. 2021 JASA
An example repository to analyze cough audio data using transfer learning
This repository is mainly about the classification of music genres of Kaggle music dataset in various forms of data preparation
There are already many projects underway to extensively monitor birds by continuously recording natural soundscapes over long periods. However, as many living and nonliving things make noise, the analysis of these datasets is often done manually by domain experts. These analyses are painstakingly slow, and results are often incomplete.
Using Deep Convolutional Generative Adversarial Networks to generate new spoken digits.
Fall 2021 Introduction to Deep Learning - Homework 3 Part 2 (RNN-based phoneme recognition)
Add a description, image, and links to the melspectrogram topic page so that developers can more easily learn about it.
To associate your repository with the melspectrogram topic, visit your repo's landing page and select "manage topics."