y0news
AnalyticsDigestsRSSAICrypto
#safety-evaluation1 article
1 articles
AINeutralarXiv โ€“ CS AI ยท 16h ago6/10
๐Ÿง 

SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models

Researchers introduce SalamaBench, the first comprehensive safety benchmark for Arabic Language Models, evaluating 5 state-of-the-art models across 8,170 prompts in 12 safety categories. The study reveals significant safety vulnerabilities in current Arabic AI models, with substantial variation in safety alignment across different harm domains.