Stars
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Chatbot system for Final Year Project. Chatbot made in Python using Natural Language Toolkit especially Machine Learning. Easy to Understand and Implement.
Audio Codec Speech processing Universal PERformance Benchmark
Awesome speech/audio LLMs, representation learning, and codec models
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
The official repository of Dynamic-SUPERB.
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Awesome Learn From Model Beyond Fine-Tuning: A Survey
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Using Pre-trained SSL Transformer Models for Speaker Verification
eurecom-asp / spk_anon_nac_lm
Forked from m-pana/spk_anon_nac_lmAuthor's code of "Speaker anonymization using neural audio codec language models" (ICASSP 2024).
SpeechGPT Series: Speech Large Language Models
Audio generation using diffusion models, in PyTorch.
Awesome Incremental Learning
A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024
《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》
The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"
A curated list of awesome voice conversion, projects and communities.
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
VMamba: Visual State Space Models,code is based on mamba