#mllm-safety News & Analysis

3 articles tagged with #mllm-safety. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles

AIBearisharXiv – CS AI · Jun 27/10

🧠

PaSBench-Video: A Streaming Video Benchmark for Proactive Safety Warning

Researchers introduce PaSBench-Video, a 740-video benchmark designed to evaluate multimodal large language models' ability to issue timely safety warnings in streaming video scenarios. Testing 13 MLLMs reveals that no model exceeds 20% accuracy on strict metrics, with models struggling to distinguish emerging hazards from routine activities, particularly in driving scenarios where safe and dangerous scenes appear visually similar.

AINeutralarXiv – CS AI · Jun 126/10

🧠

MLUBench: A Benchmark for Lifelong Unlearning Evaluation in MLLMs

Researchers introduce MLUBench, a large-scale benchmark for evaluating lifelong unlearning in multimodal large language models (MLLMs), revealing that existing methods suffer from cumulative degradation. The study identifies a unique challenge in MLLM unlearning: removing data from one modality can damage the model's multimodal alignment, and proposes LUMoE as a solution to mitigate this degradation.

AINeutralarXiv – CS AI · May 296/10

🧠

Towards Localized and Disentangled Knowledge Editing for Multimodal Large Language Models

Researchers propose LDKE, a new framework for editing knowledge in Multimodal Large Language Models that addresses two critical failure modes: causal misalignment (edits confined to specific samples) and feature entanglement (unintended alterations to related information). The method uses localized layer identification and input disentanglement to enable precise, generalized edits while preserving unrelated knowledge.