1
Beyond the Bellman Recursion: A Pontryagin-Guided Framework for Non-Exponential Discounting
用庞特里亚金最优控制原理突破传统强化学习折扣框架,为认知与经济学中的非指数折扣提供新解法。
arXiv:2605.20996v1 Announce Type: new Abstract: Most value-based and actor--critic reinforcement learning methods rely on Bellman-style recursions, ye…