1
High-dimensional Limit of SGD for Diagonal Linear Networks
高维视角下SGD在对角线性网络中的极限行为,理论深度与前沿性兼备
arXiv:2605.17177v1 Announce Type: cross Abstract: Understanding the behavior of stochastic gradient methods is a central problem in modern machine lea…