1
Best-of-Both-Worlds for Heavy-Tailed Markov Decision Processes
提出首个在重尾MDP上同时实现随机与对抗环境最优遗憾的BoBW算法,突破保守局限。
arXiv:2602.01295v3 Announce Type: replace Abstract: We investigate episodic Markov Decision Processes with heavy-tailed losses (HTMDPs). Existing appr…