1
A Data-Efficient Path to Multilingual LLMs: Language Expansion via Post-training PARAM$\Delta$ Integration into Upcycled MoE
提出一种数据高效的多语言LLM扩展方法,通过后训练将PARAMΔ集成到升级MoE中,仅需少量目标语言文本即可显著提升性能。
arXiv:2605.18083v1 Announce Type: new Abstract: Expanding Large Language Models~(LLMs) to new languages is a costly endeavor, demanding extensive Cont…