AIBullisharXiv โ CS AI ยท 4h ago6/10
๐ง
Automated Attention Pattern Discovery at Scale in Large Language Models
Researchers developed AP-MAE, a vision transformer model that analyzes attention patterns in large language models at scale to improve interpretability. The system can predict code generation accuracy with 55-70% precision and enable targeted interventions that increase model accuracy by 13.6%.