1
FormulaCode: Evaluating Agentic Optimization on Large Codebases
LLM编码代理如何优化大型代码库?这篇论文提出FormulaCode基准,评估真实场景下的整体优化能力,超越传统合成任务与二值信号。
arXiv:2603.16011v2 Announce Type: replace-cross Abstract: Large language model (LLM) coding agents increasingly operate at the repository level, motiv…