Skip to content
View codezjx's full-sized avatar
😱
Overtime
😱
Overtime

Block or report codezjx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

asr

11 repositories

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 9,816 1,107 Updated Jan 16, 2026

Whisper realtime streaming for long speech-to-text transcription and translation

Python 3,519 411 Updated Nov 12, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 93,346 11,662 Updated Dec 15, 2025

Port of OpenAI's Whisper model in C/C++

C++ 45,884 5,117 Updated Jan 16, 2026

Open source real-time translation app for Android that runs locally

C++ 9,556 857 Updated Jan 15, 2026

Efficient Inference of Transformer models

C++ 478 44 Updated Aug 7, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,736 1,170 Updated Nov 14, 2024

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,305 5,365 Updated Sep 22, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 19,138 2,142 Updated Jan 12, 2026

Multilingual Voice Understanding Model

Python 7,378 684 Updated Dec 30, 2025

SOTA Open Source TTS

Python 24,633 2,045 Updated Jan 8, 2026