Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
audio-visual-speech-recognition interpretability visual-speech-recognition lipreading robust-asr parameter-efficient
-
Updated
Feb 24, 2025 - Python