AIBearisharXiv โ CS AI ยท 4h ago4
๐ง
ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI
Researchers have developed ForesightSafety Bench, a comprehensive AI safety evaluation framework covering 94 risk dimensions across 7 fundamental safety pillars. The benchmark evaluation of over 20 advanced large language models revealed widespread safety vulnerabilities, particularly in autonomous AI agents, AI4Science, and catastrophic risk scenarios.