Automatic Lip reading Model using 3D CNN and GRU
-
Updated
Jul 29, 2024 - Python
Automatic Lip reading Model using 3D CNN and GRU
An open-source library for recognition of speech commands in the user dictionary using audiovisual data of the speaker
This model seeks to decipher sequences of lip movements captured in video frames and translate them into meaningful spoken language or phonetic representations.
Inspired by the LipNet Paper at https://arxiv.org/abs/1611.01599 and the Visual Speech Recognition for Multiple Languages in the Wild paper at https://arxiv.org/pdf/2202.13084v2.pdf
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Code repo for NTUA DSML MSc thesis
The official implementation of OpenSR (ACL2023 Oral)
Deep Visual Speech Recognition in arabic words
Deep Visual Speech Recognition in arabic words
EMOLIPS: TWO-LEVEL APPROACH FOR LIP-READING EMOTIONAL SPEECH
A novel lipreading system that improves on the task of speaker-independent word recognition by decoupling motion and content dynamics.
Tracking mouth movement(Lips)
In this repository, I try to use k2, icefall and Lhotse for lip reading. I will modify it for the lip reading task. Many different lip-reading datasets should be added. -_-
An multi modal automated proctor for online exams
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition"
My experiments in lip reading using deep learning with the LRW dataset
Speaker-Independent Speech Recognition using Visual Features
Repository for the paper "Lip Reading in unconstrained driving scenario with Greek words"
Our project's source code and documentation as part of the requirements for Graduation Project-2 (CCEN481) in Computer Engineering Program at Cairo University Faculty of Engineering
Add a description, image, and links to the lip-reading topic page so that developers can more easily learn about it.
To associate your repository with the lip-reading topic, visit your repo's landing page and select "manage topics."