1
Eigenvectors of Experts are Training-free Non-collapsing Routers
无需训练即可避免路由坍缩,专家特征向量为MoE提供高效非学习路由方案。
arXiv:2605.30992v1 Announce Type: new Abstract: Sparse Mixture of Experts (SMoE) architectures improve the training efficiency of Large Language Model…