🧠 AI⚪ NeutralImportance 7/10

Transformers converge to invariant algorithmic cores

arXiv – CS AI|Joshua S. Schiffman|February 27, 2026 at 05:00 AM|5 views

🤖AI Summary

Researchers have discovered that transformer models, despite different training runs producing different weights, converge to the same compact 'algorithmic cores' - low-dimensional subspaces essential for task performance. The study shows these invariant structures persist across different scales and training runs, suggesting transformer computations are organized around shared algorithmic patterns rather than implementation-specific details.

Key Takeaways

→Independently trained transformers learn different weights but converge to identical algorithmic cores necessary for task performance.
→Markov-chain transformers embed 3D cores in orthogonal subspaces yet recover identical transition spectra.
→Modular-addition transformers discover compact cyclic operators during grokking that later expand during memorization-to-generalization transition.
→GPT-2 models control subject-verb agreement through a single axis that can invert grammatical number when flipped.
→Low-dimensional invariants persist across training runs and scales, suggesting shared computational structures in transformers.