AINeutralarXiv โ CS AI ยท 4d ago7/105
๐ง
DAG-Math: Graph-of-Thought Guided Mathematical Reasoning in LLMs
Researchers introduce DAG-Math, a new framework for evaluating mathematical reasoning in Large Language Models that models Chain-of-Thought as rule-based processes over directed acyclic graphs. The framework includes a 'logical closeness' metric that reveals significant differences in reasoning quality between LLM families, even when final answer accuracy appears comparable.