AIBearisharXiv – CS AI · 7h ago7/10
🧠
Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models
Researchers demonstrate that Large Reasoning Models (LRMs) frequently 'overthink' problems after reaching correct answers, with continued reasoning degrading accuracy by up to 21%. The study introduces a protocol to measure reasoning sufficiency and reveals that harmful overthinking—where additional reasoning destabilizes correct solutions—represents a broader reliability risk affecting both multimodal and language-only models.