AIBullisharXiv – CS AI · 5h ago7/10
🧠
FIGMA: Towards FIne-Grained Music retrievAl
Researchers introduce FIGMA, a new multi-view contrastive learning architecture that significantly improves music retrieval based on fine-grained musical attributes like tempo, key, and chord progression. The work addresses a fundamental limitation in existing CLAP-based models that fail to process detailed musical descriptions, achieving up to 73.3% relative improvement and contributing a new 380K music-caption dataset (FGMCaps) to the field.