AINeutralarXiv – CS AI · 10h ago7/10
🧠
A Verifiable Search Is Not a Learnable Chain-of-Thought
Researchers demonstrate that language models cannot reliably learn certain types of algorithmic reasoning—specifically backtracking search procedures—through chain-of-thought fine-tuning, regardless of model size or training method. While models perform individual computational steps correctly, they fail to chain those steps into valid forward derivations when the task requires combinatorial search over unstructured information.