1
Reinforcement Learning with Action-Triggered Observations
强化学习新范式:动作触发部分可观测的MDP,理论推导贝尔曼方程
arXiv:2510.02149v2 Announce Type: replace Abstract: We introduce Action-Triggered Sporadically Traceable Markov Decision Processes (ATST-MDPs), a rein…