TabularMath: Understanding Math Reasoning over Tables with Large Language Models
Researchers introduce TabularMath, a benchmark and neuro-symbolic framework for evaluating large language models' mathematical reasoning over tabular data. The study reveals that LLMs struggle with table complexity, low-quality data, and inconsistent information—critical limitations for real-world business intelligence applications that demand reliable numerical reasoning.