State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
-
Updated
Nov 13, 2025 - Python
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Audio Codec Speech processing Universal PERformance Benchmark
Uniswap Universal Router SDK - Decode and Encode Transactions - Uniswap V2, V3 & V4
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
[CVPR'19, ICLR'20] A Python toolbox for modeling and optimization of photo acquisition & distribution pipelines (camera ISP, compression, forensics, manipulation detection)
A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.
Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities
Trainging, inference, and testing of the SAC speech codec model.
Official implementation of the "Efficient Video Compression via Content-Adaptive Super-Resolution" paper in Tensorflow.
Python SCALE-Codec
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
A collections of audio codecs with a standardized API
A neural speech codec based on discrete WavLM representations
Add a description, image, and links to the codec topic page so that developers can more easily learn about it.
To associate your repository with the codec topic, visit your repo's landing page and select "manage topics."