1
Towards Automated Kernel Generation in the Era of LLMs
用LLM自动生成GPU内核,突破AI系统性能瓶颈的前沿研究
arXiv:2601.15727v3 Announce Type: replace Abstract: The performance of modern AI systems is fundamentally constrained by the quality of their underlyi…
用LLM自动生成GPU内核,突破AI系统性能瓶颈的前沿研究
arXiv:2601.15727v3 Announce Type: replace Abstract: The performance of modern AI systems is fundamentally constrained by the quality of their underlyi…
首个泛化感知基准评估GPU内核优化代理,推动自动调优技术落地。
arXiv:2605.16819v1 Announce Type: cross Abstract: GPU kernel optimization is increasingly critical for efficient deep learning systems, but writing hi…