y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#safety-benchmark News & Analysis

1 article tagged with #safety-benchmark. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

1 articles
AINeutralarXiv โ€“ CS AI ยท Mar 177/10
๐Ÿง 

From Evaluation to Defense: Advancing Safety in Video Large Language Models

Researchers introduced VideoSafetyEval, a benchmark revealing that video-based large language models have 34.2% worse safety performance than image-based models. They developed VideoSafety-R1, a dual-stage framework that achieves 71.1% improvement in safety through alarm token-guided fine-tuning and safety-guided reinforcement learning.