AINeutralarXiv – CS AI · 6h ago6/10
🧠
Towards Annotation-Free Validation of MLLMs: A Vision-Language Logical Consistency Metric
Researchers propose Vision-Language Logical Consistency Metric (VL-LCM), a novel evaluation framework for multimodal large language models that assesses logical coherence without requiring ground-truth annotations. Testing 11 MLLMs across benchmarks including MMMU and NaturalBench reveals that while accuracy has improved significantly, logical consistency substantially lags, suggesting current models make confident but logically inconsistent predictions.