List of papers and resources for multimodal transformers
-
Multimodal Transformer for Unaligned Multimodal Language Sequences, ACL 2019, https://github.com/yaohungt/Multimodal-Transformer
-
SELF-SUPERVISED LEARNING WITH CROSS-MODAL TRANSFORMERS FOR EMOTION RECOGNITION, SLT 2020
-
Multimodal Transformer Fusion for Continuous Emotion Recognition, ICASSP 2020
-
Multimodal transformer models https://github.com/georgian-io/Multimodal-Toolkit
-
Low Rank Fusion based Transformers for Multimodal Sequences, ACL 2020
-
Modulated Fusion using Transformer for Linguistic-Acoustic EmotionRecognition, ACL 2020, https://github.com/jbdel/modulated_fusion_transformer
-
Attending to Emotional Narratives, ACII 2019, https://github.com/frankaging/ACII2019-transformer
-
VATT: Transformers for Multimodal Self-Supervised Learningfrom Raw Video, Audio and Text, https://arxiv.org/pdf/2104.11178.pdf
-
Multimodal Cross-and Self-Attention Network for Speech Emotion Recognition, ICASSP 2021