y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#safety-benchmark News & Analysis

2 articles tagged with #safety-benchmark. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AINeutralarXiv – CS AI · Jun 107/10
🧠

PreAct-Bench: Benchmarking Predictive Monitoring in LLMs

Researchers introduce PreAct-Bench, a benchmark for evaluating LLMs' ability to predict unethical behavior from partial action trajectories before harmful actions occur. The study reveals that predictive monitoring remains a significant challenge even for advanced models, highlighting a critical gap in proactive AI safety mechanisms.

AINeutralarXiv – CS AI · Mar 177/10
🧠

From Evaluation to Defense: Advancing Safety in Video Large Language Models

Researchers introduced VideoSafetyEval, a benchmark revealing that video-based large language models have 34.2% worse safety performance than image-based models. They developed VideoSafety-R1, a dual-stage framework that achieves 71.1% improvement in safety through alarm token-guided fine-tuning and safety-guided reinforcement learning.