y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#safety-evaluation News & Analysis

2 articles tagged with #safety-evaluation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AINeutralarXiv โ€“ CS AI ยท Mar 66/10
๐Ÿง 

SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models

Researchers introduce SalamaBench, the first comprehensive safety benchmark for Arabic Language Models, evaluating 5 state-of-the-art models across 8,170 prompts in 12 safety categories. The study reveals significant safety vulnerabilities in current Arabic AI models, with substantial variation in safety alignment across different harm domains.

AINeutralOpenAI News ยท Jan 316/104
๐Ÿง 

OpenAI o3-mini System Card

OpenAI has released a system card detailing the safety work conducted for its new o3-mini model. The report covers safety evaluations, external red teaming exercises, and assessments under OpenAI's Preparedness Framework to ensure responsible deployment.