1
TeamTR: Trust-Region Fine-Tuning for Multi-Agent LLM Coordination
针对多智能体LLM协调中序列微调导致上下文分布偏移的缺陷,提出信任区域微调方法,有效提升团队协同表现。
arXiv:2605.15207v1 Announce Type: new Abstract: Multi-agent LLM systems have shown promise for complex reasoning, yet recent evaluations reveal they o…