AINeutralarXiv โ CS AI ยท 7h ago6/10
๐ง
Beyond MCQ: An Open-Ended Arabic Cultural QA Benchmark with Dialect Variants
Researchers have created the first comprehensive Arabic Cultural QA benchmark that translates questions across Modern Standard Arabic and regional dialects, converting multiple-choice questions into open-ended formats. Testing reveals that large language models significantly underperform on dialectal content and struggle with open-ended Arabic questions, highlighting critical gaps in culturally grounded language understanding.