1
Target-Aligned Bellman Backup for Cross-domain Offline Reinforcement Learning
跨领域离线强化学习新方法,用对齐贝尔曼备份解决域间转移难题。
arXiv:2605.22376v1 Announce Type: new Abstract: Cross-domain offline reinforcement learning (CDRL) aims to improve policy learning in a target domain …