Dataset | Language | # Hate | # Non-Hate | Resource |
---|---|---|---|---|
HateMM | English | 431 | 652 | BitChute |
Dataset | Language | # Hateful | # Offensive | # Normal | Resource |
---|---|---|---|---|---|
MultiHateClip | English | 82 | 256 | 662 | YouTube |
MultiHateClip | Chinese | 128 | 194 | 678 | Bilibili |
"Is there any hateful content in this video? Respond 'Yes' or 'No' and explain why."
MLLMs | Accuracy | Precision | Recall | F1 |
---|---|---|---|---|
Closed-source | ||||
Gemini-1.5-pro | 0.64380 | 0.42741 | 0.94642 | 0.58889 |
Azure AI Video Indexer | ||||
Open-source | ||||
VideoChat2 | ||||
VideoLLaMA2 (30Frames) | 0.62442 | 0.54811 | 0.30536 | 0.39221 |
VideoLLaMA2-AV | 0.47166 | 0.40212 | 0.75622 | 0.52504 |
LLaVA-Next-Video(Image-Text, 24Frames) | 0.55863 | 0.46252 | 0.67285 | 0.54820 |
LLaVA-OneVision(Image-Text. 24Frames) | 0.65836 | 0.80198 | 0.18794 | 0.30451 |