AIBullisharXiv โ CS AI ยท 2d ago7/10
๐ง
HTMuon: Improving Muon via Heavy-Tailed Spectral Correction
Researchers have developed HTMuon, an improved optimization algorithm for training large language models that builds upon the existing Muon optimizer. HTMuon addresses limitations in Muon's weight spectra by incorporating heavy-tailed spectral corrections, showing up to 0.98 perplexity reduction in LLaMA pretraining experiments.
๐ข Perplexity