AIBearisharXiv – CS AI · 9h ago7/10
🧠
MCBench: A Multicontext Safety Assessment Benchmark for Omni Large Language Models
Researchers introduced MCBench, a new safety benchmark for multimodal AI systems that process vision, audio, and text simultaneously. Testing revealed that advanced language models struggle to integrate information across different modalities for safety-critical decisions, particularly with subtle risks lacking obvious visual or acoustic cues.