y0news
AnalyticsDigestsSourcesRSSAICrypto
#safety-benchmark1 article
1 articles
AINeutralarXiv โ€“ CS AI ยท 10h ago7/10
๐Ÿง 

From Evaluation to Defense: Advancing Safety in Video Large Language Models

Researchers introduced VideoSafetyEval, a benchmark revealing that video-based large language models have 34.2% worse safety performance than image-based models. They developed VideoSafety-R1, a dual-stage framework that achieves 71.1% improvement in safety through alarm token-guided fine-tuning and safety-guided reinforcement learning.