Open source inference code for Rev's model
-
Updated
Oct 28, 2024 - Python
Open source inference code for Rev's model
SOVA ASR (Automatic Speech Recognition)
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Summarization, topic generation using GPT3
An implementation of RNN-Transducer loss in TF-2.0.
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.
fine-tune Wav2vec2. an ASR model released by Facebook
An end to end ASR Transformer model training repo
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
Audio to Audio (Whisper+ChatGPT+Bark)
QuartzNet implementation for Automatic Speech Recognition task
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.
Indonesian Speech Dataset
Kunming Dialect Speech Dataset
Add a description, image, and links to the asr-model topic page so that developers can more easily learn about it.
To associate your repository with the asr-model topic, visit your repo's landing page and select "manage topics."