Skip to content
View imvladikon's full-sized avatar

Highlights

  • Pro

Block or report imvladikon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

speech recognition

5 repositories

Official code for Wav2Seq

Python 96 12 Updated Jul 19, 2022

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,585 8,796 Updated Dec 1, 2024

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,250 119 Updated Apr 24, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,670 303 Updated Oct 28, 2024

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 520 51 Updated Dec 11, 2024