State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
-
Updated
Jul 11, 2024 - Python
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Audio Codec Speech processing Universal PERformance Benchmark
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Uniswap Universal Router SDK - Decode and Encode Transactions - Uniswap V2, V3 & V4
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
[CVPR'19, ICLR'20] A Python toolbox for modeling and optimization of photo acquisition & distribution pipelines (camera ISP, compression, forensics, manipulation detection)
A low-bitrate single-codebook 16 kHz speech codec based on focal modulation
Official implementation of the "Efficient Video Compression via Content-Adaptive Super-Resolution" paper in Tensorflow.
Python SCALE-Codec
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
Higher-Order Ambisonics Codec for Spatial Audio
A neural speech codec based on discrete WavLM representations
FFmpeg GUI with AviSynth support for deinterlacing and profile configuration that can be used in frameserving. Created as a replacement for inflexible batch files that do not allow multiple encodings at the same time easily and make configuration complicated.
🖼️ Let's Make a Binary Face
Add a description, image, and links to the codec topic page so that developers can more easily learn about it.
To associate your repository with the codec topic, visit your repo's landing page and select "manage topics."