1
Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR
Muon优化器在视觉语言对齐与强化学习微调中暴露频谱失效问题,作者提出高通滤波器补救方案,刷新大模型训练认知。
arXiv:2605.19282v1 Announce Type: new Abstract: Muon is a matrix-aware optimizer that leverages Newton-Schulz (NS) iterations to enforce spectral grad…