Starred repositories
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.
Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.
An unofficial implementation of DeepVQE proposed by Microsoft Corp.
A collection of small python games made by me using pygame and tkinter libraries
MATLAB script of Independent Low-Rank Matrix Analysis (ILRMA)
End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation
#1 Locally hosted web application that allows you to perform various operations on PDF files
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
关于Transformer模型的最简洁pytorch实现,包含详细注释
Source code for a THD implementation based on Matlab.
End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation
Acoustic Echo Cancellation with Nerual Kalman Filtering
A generative speech model for daily dialogue.
Foundational Models for State-of-the-Art Speech and Text Translation
关于语音信号声源定位DOA估计所用的一些传统算法
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
Open-Sora: Democratizing Efficient Video Production for All
Generative Models by Stability AI
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
SpeechGPT Series: Speech Large Language Models
Open source short video automatic generation tool
Production first, nn-based on-device signal processing toolkit.