1
SHAPE: Coalition-Aware Expert Pruning for Sparse Mixture-of-Experts LLMs
稀疏MoE大模型部署新突破:引入联盟感知策略的专家剪枝方法
arXiv:2606.09886v1 Announce Type: cross Abstract: Sparse Mixture-of-Experts (MoE) large language models achieve strong quality with low per-token comp…