AINeutralarXiv – CS AI · 5h ago6/10
🧠
How Language Models Fail: Token-Level Signatures of Committed and Persistent Reasoning Failures
Researchers have identified two distinct failure modes in large language model reasoning: committed failures where models lock onto incorrect paths early, and persistent uncertainty failures where doubt accumulates throughout reasoning. The framework, validated across 23 model-dataset configurations, provides diagnostic signatures for detecting reasoning failures and offers practical implications for improving self-consistency methods.