AINeutralarXiv โ CS AI ยท 7h ago6/10
๐ง
ReactBench: A Benchmark for Topological Reasoning in MLLMs on Chemical Reaction Diagrams
Researchers introduce ReactBench, a benchmark that exposes critical limitations in multimodal large language models' ability to reason about complex topological structures in chemical reaction diagrams. Testing 17 MLLMs reveals a 30%+ performance gap between simple anchor-based tasks and sophisticated structural reasoning tasks, indicating that visual reasoning capabilities remain fundamentally constrained despite strong semantic recognition abilities.