imvladikon

Vladimir Gurevich imvladikon

Achievements

Stars

5 repositories

Official code for Wav2Seq

Python 96 12 Updated Jul 19, 2022

Robust Speech Recognition via Large-Scale Weak Supervision

Python 82,091 9,899 Updated May 13, 2025

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,352 128 Updated Apr 24, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,861 327 Updated Jan 8, 2025

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 765 83 Updated Apr 1, 2025