AINeutralarXiv – CS AI · 10h ago6/10
🧠
LLM-Based Multi-Reference Evaluation for Efficient and Robust Assessment of Phrase Break Annotations
Researchers propose LLM-Based Multi-Reference Evaluation (LMRE), a new method for assessing phrase break annotations in speech that acknowledges multiple valid phrasings rather than assuming a single correct interpretation. Tested on 1,356 Korean annotations, LMRE demonstrates stronger alignment with human judgment than traditional single-reference approaches, suggesting large language models can effectively evaluate prosodic speech characteristics at scale.