AINeutralarXiv โ CS AI ยท 5h ago
๐ง
SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
Researchers introduce SpatialBench, a comprehensive benchmark for evaluating spatial cognition in multimodal large language models (MLLMs). The framework reveals that while MLLMs excel at perceptual grounding, they struggle with symbolic reasoning, causal inference, and planning compared to humans who demonstrate more goal-directed spatial abstraction.