AIBullisharXiv – CS AI · 18h ago6/10
🧠
Why Limit the Residual Stream to Layers and Not Tokens? Persistent Memory for Continuous Latent Reasoning
Researchers propose AGCLR, a new method that enhances large language models' reasoning capabilities by introducing persistent memory across reasoning steps. The approach addresses a fundamental limitation in continuous latent reasoning where intermediate facts are lost as models explore deeper reasoning paths, demonstrating consistent improvements on mathematical and multi-hop reasoning benchmarks.