AIBullisharXiv – CS AI · 8h ago6/10
🧠
DynMuon: A Dynamic Spectral Shaping View of Muon
Researchers propose DynMuon, an enhancement to the Muon optimizer used in large language model training that dynamically adjusts spectral shaping parameters throughout training. The method achieves lower validation loss and requires 10.6-26.5% fewer training steps than standard Muon by shifting from positive to mildly negative spectral exponents.
$UV