#

speech-dataset

Here are 23 public repositories matching this topic...

MahtaFetrat / GPTInformal-Persian-Speech-Dataset

A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection manatts

Updated Sep 22, 2024

PanosAntoniadis / fast-recorder

Simple script that creates a speech dataset quickly

recorder speech-to-text sphinx-4 speech-dataset

Updated Jul 13, 2019
Python

Ralireza / PSDR

Persian spoken digit recognition

speech-recognition persian speech-recognizer speech-analysis speech-dataset persian-speech-recognition persian-spoken-digit persian-dataset

Updated Jul 28, 2019
Python

nafiuny / voice_conversion_dataset

top dataset for voice conversion models

python text-to-speech tts dataset speech-to-text datasets pyth voice-conversion vc speech-dataset audio-datasets voice-dataset voice-datasets audio-dataset tts-dataset vc-dataset

Updated Oct 28, 2023

petrichorwq / DECRO-dataset

Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.

speech-dataset deepfake-detection

Updated Sep 14, 2023

Nexdata-AI / 393-Hours-Korean-Children-Speech-Data-by-Mobile-Phone

393-Hours-Korean-Children-Speech-Data-by-Mobile-Phone

children speech-recognition korean speech-dataset children-speech-recognition

Updated Aug 8, 2024

cyrta / 50languages

Corpus, dataset of speech recording in 50 languages

corpus speech speech-dataset

Updated Mar 23, 2018
PHP

Nexdata-AI / 2-People-New-Zealand-English-Average-Tone-Speech-Synthesis-Corpus

2-People-New-Zealand-English-Average-Tone-Speech-Synthesis-Corpus

text-to-speech speech-synthesis speech-analysis speech-dataset

Updated Aug 8, 2024

Rumeysakeskin / Speech-Datasets-for-ASR

Download speech datasets (English and non-English) for Automatic Speech Recognition

speech-synthesis speech-recognition speech-to-text speech-processing asr speech-dataset audio-datasets voice-datasets common-voice-dataset voxforge-dataset

Updated Jan 22, 2023
Jupyter Notebook

MahtaFetrat / ManaTTS-Persian-Speech-Dataset

ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection

Updated Sep 13, 2024
Jupyter Notebook

MahtaFetrat / VirgoolInformal-Speech-Dataset

A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.

tts persian speech-processing asr forced-alignment speech-dataset persian-speech-recognition asr-evaluation persian-speech-dataset persian-text-to-speech speech-data-collection persian-speech-corpus

Updated Sep 13, 2024
Jupyter Notebook

ruslan-corpus / ruslan-corpus.github.io

text-to-speech tts russian speech-dataset speech-corpus

Updated Aug 29, 2019
HTML

mborsdorf / TargetLanguageExtraction

audio multilingual python deep-learning matlab pytorch speech-processing audio-processing source-separation speech-separation speech-dataset auditory-attention speech-corpus speaker-extraction speech-database

Updated Feb 8, 2022

mborsdorf / GlobalPhoneMS_Scripts

multilingual python deep-learning matlab speech-separation speech-dataset auditory-attention

Updated Sep 6, 2021
MATLAB

AI2001_Category-Audio-SC-Speeches

seanpm2001 / AI2001_Category-Audio-SC-Speeches

🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎷️ The audio:speeches category for AI2001, containing speech datasets

gplv3 dataset r-language md txt gpl3 speech-dataset audio-dataset rmarkdown-language ai2001 ai-2001 ai2001-dataset ai-2001-dataset ai2001-development ai-2001-development speech-audio-dataset

Updated Mar 17, 2023
R

KanishkNavale / Speech-Emotion-Recognition

A simple CNN-LSTM deep neural model using Tensorflow to classify emotions from a speech dataset

deep-learning tensorflow cnn lstm speech-emotion-recognition speech-dataset

Updated Jun 1, 2022
Jupyter Notebook

ina-foss / InaGVAD

Voice activity detection and speaker gender segmentation audiovisual corpus

radio benchmark corpus tv dataset gender audio-segmentation voice-activity-detection gender-prediction speech-dataset gender-bias speech-activity-detection speaker-gender speech-corpus audio-dataset audiovisual-dataset acoustic-diversity gender-representation

Updated Jun 6, 2024
Jupyter Notebook

gauthelo / kallaama-speech-dataset

A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.

natural-language-processing agriculture speech-processing speech-dataset senegal-language

Updated Apr 29, 2024

revsic / speechset

Numpy-librosa implementation of Speech dataset pipeline

preprocessor tts vocoder speech-dataset

Updated Jan 18, 2023
Python

manankshastri / Trigger-Word-Detection

Construct a speech dataset and implement an algorithm for trigger word detection (sometimes also called keyword detection, or wakeword detection).

python deep-learning rnn gated-recurrent-units speech-dataset trigger-word-detection

Updated Apr 14, 2019
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-dataset topic, visit your repo's landing page and select "manage topics."