1
Rethinking Output Alignment For 1-bit Post-Training Quantization of Large Language Models
1-bit量化大模型新思路,输出对齐策略再审视,助力低资源设备高效推理
arXiv:2512.21651v3 Announce Type: replace Abstract: Large Language Models (LLMs) deliver strong performance across a wide range of NLP tasks, but thei…