1
LoopQ: Quantization for Recursive Transformers
循环语言模型量化面临三大挑战,首次系统性研究揭秘其脆弱性根源
arXiv:2605.16343v1 Announce Type: new Abstract: Looped language models (LoopLMs) improve parameter efficiency by recursively reusing Transformer block…
循环语言模型量化面临三大挑战,首次系统性研究揭秘其脆弱性根源
arXiv:2605.16343v1 Announce Type: new Abstract: Looped language models (LoopLMs) improve parameter efficiency by recursively reusing Transformer block…
1-bit量化大模型新思路,输出对齐策略再审视,助力低资源设备高效推理
arXiv:2512.21651v3 Announce Type: replace Abstract: Large Language Models (LLMs) deliver strong performance across a wide range of NLP tasks, but thei…