UniSpeech - Large Scale Self-Supervised Learning for Speech
-
Updated
Apr 5, 2024 - Python
UniSpeech - Large Scale Self-Supervised Learning for Speech
The dataset of Speech Recognition
This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).
Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
A demo to show Speech Diarization (seperating audio of different speaker) and converting them to text using Google Cloud Speech API.
Speech transcription and speech diarization
Add a description, image, and links to the speech-diarization topic page so that developers can more easily learn about it.
To associate your repository with the speech-diarization topic, visit your repo's landing page and select "manage topics."