1
AAAC: Activation-Aware Adaptive Codebooks for 4-bit LLM Weight Quantization
提出AAAC方法,通过激活感知自适应码本,在保持4比特精度的同时进一步降低LLM权重量化误差
arXiv:2605.08692v2 Announce Type: replace Abstract: Post-training weight-only quantization to 4 bits is widely used to reduce the memory and compute c…