AINeutralarXiv โ CS AI ยท 15h ago5/10
๐ง
VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models
Researchers introduce VLM-RobustBench, a comprehensive benchmark testing vision-language models across 133 corrupted image settings. The study reveals that current VLMs are semantically strong but spatially fragile, with low-severity spatial distortions often causing more performance degradation than visually severe photometric corruptions.