Trainable Smooth-Rotation Transforms with Learned Channel Scales for LLM Quantization
提出可训练的平滑旋转变换与学习通道缩放,为LLM量化精度提升提供新思路
arXiv:2606.09927v1 Announce Type: cross Abstract: Post-training quantization (PTQ) is one of the most practical ways to reduce the serving cost of Lar…