AIBearisharXiv – CS AI · 8h ago6/10
🧠
Coherence Under Commitment: Probing Generalization and Vacuous Memorization in LLM Logical Reasoning
Researchers introduce Coherence Under Commitment (CUC), a new evaluation framework that exposes a critical flaw in LLM logical reasoning: models can achieve coherence by refusing to make decisions rather than reasoning correctly. Testing on small language models reveals a stark trade-off where more decisive models contradict themselves frequently, while conservative models abstain from answering.