Latent Planning Emerges with Scale
Researchers demonstrate that large language models develop internal planning representations that scale with model size, enabling them to implicitly plan future outputs without explicit verbalization. The study on Qwen-3 models (0.6B-14B parameters) reveals mechanistic evidence of latent planning through neural features that predict and shape token generation, with planning capabilities increasing consistently across model scales.