Skip to content
View imvladikon's full-sized avatar

Highlights

  • Pro

Block or report imvladikon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

speech recognition

5 repositories

Official code for Wav2Seq

Python 96 12 Updated Jul 19, 2022

Robust Speech Recognition via Large-Scale Weak Supervision

Python 79,736 9,583 Updated Jan 4, 2025

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,334 123 Updated Apr 24, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,824 321 Updated Jan 8, 2025

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 730 80 Updated Apr 1, 2025