AINeutralarXiv – CS AI · 6h ago6/10
🧠
DeRA-MOS: Optimizing Text-to-Music Evaluation via Decoupled Listwise Ranking and Modality Alignment
Researchers introduce DeRA-MOS, a new framework for evaluating text-to-music generation systems that uses decoupled listwise ranking and modality alignment instead of traditional point-wise regression. The approach significantly improves accuracy in assessing both music quality and text-alignment metrics, reducing reliance on expensive human evaluation.