AINeutralarXiv โ CS AI ยท 10h ago7/10
๐ง
From Evaluation to Defense: Advancing Safety in Video Large Language Models
Researchers introduced VideoSafetyEval, a benchmark revealing that video-based large language models have 34.2% worse safety performance than image-based models. They developed VideoSafety-R1, a dual-stage framework that achieves 71.1% improvement in safety through alarm token-guided fine-tuning and safety-guided reinforcement learning.