AINeutralarXiv โ CS AI ยท 16h ago6/10
๐ง
SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models
Researchers introduce SalamaBench, the first comprehensive safety benchmark for Arabic Language Models, evaluating 5 state-of-the-art models across 8,170 prompts in 12 safety categories. The study reveals significant safety vulnerabilities in current Arabic AI models, with substantial variation in safety alignment across different harm domains.