AIBullisharXiv – CS AI · 14h ago7/10
🧠
TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation
Researchers introduce TRACE, a novel metric for evaluating the reasoning quality of large language models' Chain-of-Thought outputs by analyzing argument structure rather than just final answers. The method combines Toulmin's argumentation theory with metacognitive frameworks and demonstrates strong correlation with benchmark accuracy while improving reinforcement learning performance.