1
Enhancing the Code Reasoning Capabilities of LLMs via Consistency-based Reinforcement Learning
基于一致性强化学习的新方法,有效提升大模型代码推理能力。
arXiv:2605.17958v1 Announce Type: new Abstract: Code reasoning refers to the task of predicting the output of a program given its source code and spec…