A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
-
Updated
Sep 22, 2024
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
Simple script that creates a speech dataset quickly
Persian spoken digit recognition
top dataset for voice conversion models
Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.
393-Hours-Korean-Children-Speech-Data-by-Mobile-Phone
Corpus, dataset of speech recording in 50 languages
2-People-New-Zealand-English-Average-Tone-Speech-Synthesis-Corpus
Download speech datasets (English and non-English) for Automatic Speech Recognition
ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎷️ The audio:speeches category for AI2001, containing speech datasets
A simple CNN-LSTM deep neural model using Tensorflow to classify emotions from a speech dataset
Voice activity detection and speaker gender segmentation audiovisual corpus
A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.
Numpy-librosa implementation of Speech dataset pipeline
Construct a speech dataset and implement an algorithm for trigger word detection (sometimes also called keyword detection, or wakeword detection).
Add a description, image, and links to the speech-dataset topic page so that developers can more easily learn about it.
To associate your repository with the speech-dataset topic, visit your repo's landing page and select "manage topics."