Lists (5)
Sort Name ascending (A-Z)
Stars
C++ classes for designing high-order Butterworth IIR & equalization filters
Biquad Filter implementation in C using Portaudio
A Collection of Useful C++ Classes for Digital Signal Processing
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Headers, libs, and scripts for Max external development
Faster Whisper transcription with CTranslate2
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Port of OpenAI's Whisper model in C/C++
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone
ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Muzic: Music Understanding and Generation with Artificial Intelligence
Bark Voice Cloning and Voice Cloning for Chinese Speech
Finetune VITS and MMS using HuggingFace's tools
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.