1
A Survey of On-Policy Distillation for Large Language Models
首份大模型在线策略蒸馏综述,系统梳理方法、挑战与未来方向,适合研究者深挖。
arXiv:2604.00626v3 Announce Type: replace Abstract: As Large Language Models (LLMs) continue to grow in both capability and cost, transferring frontie…