AINeutralarXiv – CS AI · 15h ago6/10
🧠
How Chain-of-Thought Works? Tracing Information Flow from Decoding, Projection, and Activation
Researchers have developed a mechanistic interpretability framework that reverses information flow through Chain-of-Thought prompting to understand how AI models reason. The study reveals CoT functions as a decoding space pruner that uses answer templates to guide outputs, with task-dependent neuron modulation that reduces activation in open-domain tasks but increases it in closed-domain scenarios.