AINeutralarXiv โ CS AI ยท 6h ago1
๐ง
A Gauge Theory of Superposition: Toward a Sheaf-Theoretic Atlas of Neural Representations
Researchers propose a new gauge-theoretic framework for understanding superposition in large language models, replacing traditional single-dictionary approaches with local semantic charts. The method introduces three measurable obstructions to interpretability and demonstrates results on Llama 3.2 3B model with various datasets.