1
LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation
长上下文LLM无需重训练即可轻松转为混合模型,LightTransfer方法实现高效适配,论文被TMLR 2025收录。
arXiv:2410.13846v3 Announce Type: replace-cross Abstract: Scaling language models to handle longer contexts introduces substantial memory challenges d…