Path Channels and Plan Extension Kernels: a Mechanistic Description of Planning in a Sokoban RNN
Researchers reverse-engineered a Sokoban-playing RNN trained with model-free reinforcement learning and discovered that the network encodes planning strategies through specialized neural channels that represent directional movements and learned transition models. The findings demonstrate that neural networks can develop interpretable planning algorithms without explicit supervision, with path channels and extension kernels working together to implement bidirectional search and backtracking.