Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
sound-detection sound-localization audio-visual-learning seldnet yolov5 seld yolov8 dcase2023 detic audio-visual-seld
-
Updated
Mar 21, 2025 - Python