AINeutralarXiv – CS AI · 15h ago6/10
🧠
Negligible in Size, Significant in Effect: On Scale Vectors in Large Language Models
Researchers demonstrate that scale vectors in large language models, despite comprising negligible model parameters, significantly impact training performance and optimization. Through theoretical analysis and empirical validation across models from 0.12B to 2B parameters, the study proposes three complementary improvements to scale vector design that enhance training efficiency without adding computational overhead.