1
MATE: Solving Contextual Markov Decision Processes with Memory of Accumulated Transition Embeddings
本论文提出MATE方法,通过记忆累积转移嵌入解决上下文MDP中的长期依赖问题,为强化学习领域贡献新思路。
arXiv:2605.17431v1 Announce Type: new Abstract: We propose MATE, a simple yet effective memory architecture for solving Contextual Markov Decision Pro…