1
You Had One Job: Per-Task Quantization Using LLMs' Hidden Representations
利用大模型隐藏表示实现每任务量化,在保持性能的同时大幅提升效率,值得关注的技术突破。
arXiv:2511.06516v3 Announce Type: replace Abstract: Many LLM applications require only narrow capabilities, yet standard post-training quantization (P…