Skip to content
View imvladikon's full-sized avatar

Highlights

  • Pro

Block or report imvladikon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

speech recognition

5 repositories

Official code for Wav2Seq

Python 96 12 Updated Jul 19, 2022

Robust Speech Recognition via Large-Scale Weak Supervision

Python 82,091 9,899 Updated May 13, 2025

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,352 128 Updated Apr 24, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,861 327 Updated Jan 8, 2025

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 765 83 Updated Apr 1, 2025