AINeutralarXiv – CS AI · Mar 95/10
🧠
VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models
Researchers introduce VLM-RobustBench, a comprehensive benchmark testing vision-language models across 133 corrupted image settings. The study reveals that current VLMs are semantically strong but spatially fragile, with low-severity spatial distortions often causing more performance degradation than visually severe photometric corruptions.