AINeutralarXiv – CS AI · 11h ago6/10
🧠
WASIL: In-the-Wild Arabic Spoken Interactions with LLMs
Researchers released WASIL, a dataset of 8,529 Arabic spoken interactions with LLMs including audio, transcriptions, and user feedback, to address how speech recognition errors degrade voice assistant performance. The dataset includes a 2,000-turn test set covering Modern Standard Arabic and four dialects, with annotations distinguishing between genuine unanswerability and ASR-induced failures, enabling more accurate evaluation of voice AI systems.