1
LEAP: Learnable End-to-End Adaptive Pruning of Large Language Models
提出LEAP可学习端到端自适应剪枝方法,在保持大语言模型性能的同时实现高效压缩
arXiv:2605.17289v1 Announce Type: new Abstract: Unstructured sparsity is now natively accelerated by recent GPU kernels and dataflow hardware, shiftin…