1
SWE-Mutation: Can LLMs Generate Reliable Test Suites in Software Engineering?
LLM生成测试套件可靠性研究,揭示软件工程评估新瓶颈。
arXiv:2605.22175v1 Announce Type: cross Abstract: Evaluating software engineering capabilities has become a core component of modern large language mo…