1
PerfCodeBench: Benchmarking LLMs for System-Level High-Performance Code Optimization
最新基准测试揭示LLM在系统级高性能代码优化上的能力短板,推动更实用的性能评测标准
arXiv:2605.15222v1 Announce Type: cross Abstract: Large language models (LLMs) can often generate functionally correct code, but their ability to prod…