Skip to content
View CaptainPrice2023's full-sized avatar

Highlights

  • Pro

Block or report CaptainPrice2023

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,434 1,084 Updated Dec 27, 2024

Chatbot system for Final Year Project. Chatbot made in Python using Natural Language Toolkit especially Machine Learning. Easy to Understand and Implement.

Jupyter Notebook 168 31 Updated Aug 12, 2022

Audio Codec Speech processing Universal PERformance Benchmark

Python 236 22 Updated Nov 1, 2024

Awesome speech/audio LLMs, representation learning, and codec models

796 48 Updated Dec 21, 2024

Survey on speech generation work.

15 Updated Nov 26, 2023

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,466 266 Updated Nov 8, 2024

Target Speaker Extraction Toolkit

Python 137 16 Updated Nov 6, 2024

The official repository of Dynamic-SUPERB.

Python 167 89 Updated Nov 11, 2024

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

627 35 Updated Aug 3, 2024
Python 92 10 Updated Aug 26, 2024

X-LoRA: Mixture of LoRA Experts

Python 192 10 Updated Aug 4, 2024

Update ASR paper everyday

Python 92 6 Updated Dec 27, 2024

Awesome Learn From Model Beyond Fine-Tuning: A Survey

53 2 Updated Dec 16, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,496 798 Updated Dec 26, 2024

Using Pre-trained SSL Transformer Models for Speaker Verification

Python 6 Updated Sep 22, 2024

Author's code of "Speaker anonymization using neural audio codec language models" (ICASSP 2024).

Python 2 1 Updated Aug 8, 2024

SpeechGPT Series: Speech Large Language Models

Python 1,322 87 Updated Jul 22, 2024

Audio generation using diffusion models, in PyTorch.

Python 1,993 168 Updated Jun 12, 2023

Awesome Incremental Learning

3,876 579 Updated Oct 22, 2024

A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024

13 Updated Jul 27, 2024

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

74 5 Updated Jun 9, 2023

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 127 6 Updated Nov 14, 2024

The Open Source Code of UniAudio

Python 532 32 Updated Jul 22, 2024

A curated list of awesome voice conversion, projects and communities.

212 13 Updated Nov 19, 2024

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 139 22 Updated Oct 16, 2023
Python 34 2 Updated Apr 15, 2024

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,383 113 Updated Dec 24, 2024

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

534 64 Updated Nov 13, 2024

[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"

Python 251 3 Updated Dec 23, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 2,306 152 Updated Oct 28, 2024
Next