Here is some relevant work:
- “BAH Dataset for Ambivalence/Hesitancy Recognition in Videos for Behavioural Change”. [arXiv][Code][Dataset Download][Page]
- “PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI Localization”. [arXiv][Code]
- “CoLo-CAM: Class Activation Mapping for Object Co-Localization in Weakly-Labeled Unconstrained Videos”. [arXiv][Code]
- “TeD-Loc: Text Distillation for Weakly Supervised Object Localization”. [arXiv][Code]
- “A Realistic Protocol for Evaluation of Weakly Supervised Object Localization”. [arXiv][Code]
- “Textualized and Feature-based Models for Compound Multimodal Emotion Recognition in the Wild”. [arXiv][Code text-based][Code feature-based]
- “Joint Multimodal Transformer for Dimensional Emotional Recognition in the Wild”. [arXiv][Code]
- “SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution”. [arXiv][Code][Download Dataset][Page][Hugging Face Spaces]