AINeutralarXiv โ CS AI ยท 17h ago6/10
๐ง
VisioMath: Benchmarking Figure-based Mathematical Reasoning in LMMs
Researchers introduced VisioMath, a new benchmark with 1,800 K-12 math problems designed to test Large Multimodal Models' ability to distinguish between visually similar diagrams. The study reveals that current state-of-the-art models struggle with fine-grained visual reasoning, often relying on shallow positional heuristics rather than proper image-text alignment.