牛哥精选 · 本月

📋 全部 🤖 AI·大模型 ⚡ 效率工具 📝 深度技术 🚀 产品观察 💰 商业科技 🔓 开源项目 🎨 设计创意 📖 阅读推荐 🏷 资源合集 🌱 成长效率

📝 深度技术 arXiv 机器学习 2026-06-10

SpenseGPT: Practical One-shot Pruning Enabling Sparse and Dense GEMMs for LLM Inference

一种实用的一次性剪枝方法，同时支持稀疏与密集GEMM运算，显著降低LLM推理成本。

arXiv:2606.10445v1 Announce Type: new Abstract: Semi-structured 2:4 sparsity is widely supported by modern accelerators, providing up to a 2x theoreti…

spensegpt 一次性剪枝 llm推理稀疏矩阵 gemm

📅 日期

2026-05-20 2026-05-19

🐂 牛哥精选

SpenseGPT: Practical One-shot Pruning Enabling Sparse and Dense GEMMs for LLM Inference

📅 日期