1
Fitting Is Not Enough: Smoothness in Extremely Quantized LLMs
LLM极端量化中平滑性比数值拟合更重要,揭示性能下降新成因。
arXiv:2605.08894v2 Announce Type: replace-cross Abstract: Large language models (LLMs) achieve strong performance but incur high deployment costs, mot…