Starred repositories
Data and analysis for supervised/unsupervised phonetic adaptation experiment
For running psychology and neuroscience experiments
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Command line utility for forced alignment using Kaldi
Speech Recognition using DeepSpeech2.
Audio and Music Analysis and Synthesis in Python
Generate cochleagrams natively in Python. Ported from Josh McDermott's MATLAB code.
Manipulate audio with a simple and easy high level interface
[NeurIPS'19 Oral] CORnet: Modeling the Neural Mechanisms of Core Object Recognition
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Dropbox Uploader is a BASH script which can be used to upload, download, list or delete files from Dropbox, an online file sharing, synchronization and backup service.
A repository of definition files for bootstrapping Singularity containers around the software applications, frameworks, and libraries you need to run on high-performance computing systems.
中文语音识别; Mandarin Automatic Speech Recognition;
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
TensorFlow code and pre-trained models for BERT
Code for the paper "Language Models are Unsupervised Multitask Learners"
Generating Images from Captions with Attention
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow