Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.
-
Updated
Aug 17, 2020 - C++
Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.
Audio-driven facial animation generator with BiLSTM used for transcribing the speech and web interface displaying the avatar and the animation
Speaker-Independent Speech Recognition using Visual Features
Add a description, image, and links to the visual-speech topic page so that developers can more easily learn about it.
To associate your repository with the visual-speech topic, visit your repo's landing page and select "manage topics."