🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
-
Updated
Jun 6, 2024
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
A collection of datasets for the purpose of emotion recognition/detection in speech.
open-source audio datasets
Python library for handling audio datasets.
A library built for easier audio self-supervised training, downstream tasks evaluation
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
This package aims at simplifying the download of the AudioSet dataset.
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a list of YouTube playlists or YouTube channels, KATube will generate dataset with audios and texts.
Download speech datasets (English and non-English) for Automatic Speech Recognition
[v.1.0] Lingualibre Languages Gallery in VueJS.
Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.
top dataset for voice conversion models
This repository contains the resources our team used through the course of the CLEF competition.
Add a description, image, and links to the audio-datasets topic page so that developers can more easily learn about it.
To associate your repository with the audio-datasets topic, visit your repo's landing page and select "manage topics."