HTMuon: Improving Muon via Heavy-Tailed Spectral Correction
Muon优化器新突破,基于重尾谱校正解决噪声方向过量问题,助力大模型高效训练
arXiv:2603.10067v2 Announce Type: replace-cross Abstract: Muon has recently shown promising results in LLM training. In this work, we study how to fur…