[Preprint] TRACE: Temporal Grounding Video LLM via Casual Event Modeling
dense-video-captioning
video-highlight-detection
multimodal-large-language-models
video-large-language-models
video-temporal-grounding
-
Updated
Nov 8, 2024 - Python