1
Distributions as Actions: A Unified Framework for Diverse Action Spaces
提出将参数化动作分布视为动作的新型强化学习框架,统一离散、连续与混合动作空间,简化智能体设计。
arXiv:2506.16608v3 Announce Type: replace-cross Abstract: We introduce a novel reinforcement learning (RL) framework that treats parameterized action …