1
What Makes Interaction Trajectories Effective for Training Terminal Agents?
挑战强代码代理是更好教师的假设,揭秘交互轨迹训练终端代理的关键——Terminal-Lego流水线可扩展解耦多因素。
arXiv:2606.03461v1 Announce Type: new Abstract: Stronger code agents are commonly assumed to be superior teachers for post-training, yet this assumpti…