Song identification combining landmark audio fingerprinting with CNN/AST/MERT embeddings (contrastive InfoNCE), benchmarked on Jamendo with strong noise robustness.
deep-learning transformers music-information-retrieval shazam mir audio-fingerprinting audio-search contrastive-learning song-identification audio-retrieval audio-embeddings
-
Updated
Aug 13, 2025 - Python