Starred repositories
Clone a voice in 5 seconds to generate arbitrary speech in real-time
TensorFlow code and pre-trained models for BERT
Code for the paper "Language Models are Unsupervised Multitask Learners"
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Manipulate audio with a simple and easy high level interface
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Pytorch implementation of convolutional neural network visualization techniques
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
A dark style sheet for QtWidgets application
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Speech Recognition using DeepSpeech2.
中文语音识别; Mandarin Automatic Speech Recognition;
For running psychology and neuroscience experiments
Command line utility for forced alignment using Kaldi
A collection of links and notes on forced alignment tools
Generating Images from Captions with Attention
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
[NeurIPS'19 Oral] CORnet: Modeling the Neural Mechanisms of Core Object Recognition
Generate cochleagrams natively in Python. Ported from Josh McDermott's MATLAB code.
Auralisation of learned features in CNN (for audio)