1
HAI-Eval: Measuring Human-AI Synergy in Collaborative Coding
新研究提出人机协同编码评估系统,填补现有测试与LLM基准的空白,聚焦真实协作场景。
arXiv:2512.04111v2 Announce Type: replace-cross Abstract: LLM-powered coding agents are reshaping the development paradigm. However, existing evaluati…