MusTBENCH: Benchmarking and Advancing Temporal Grounding in Music LLMs
Researchers introduce MusTBENCH, a benchmark for evaluating temporal grounding capabilities in Large Audio-Language Models (LALMs) for music understanding, and propose MusT, an optimization framework that significantly improves model performance on time-sensitive musical tasks like instrument entries and rhythmic transitions.