[AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang
clip
vehicle-tracking
vehicle-detection
prior
mae
pre-training
vehicle-perceptron
large-modal
large-langauge-model
vehicle-segmentation
vehicle-attribute-recognition
vehicle-fine-grained-classification
vision-text-contrastive
-
Updated
Jul 29, 2024 - Python