1
Collaborative Yet Personalized Policy Training: Single-Timescale Federated Actor-Critic
联邦强化学习新突破:单时间尺度actor-critic框架平衡协作与个性化,适合多智能体异构环境。
arXiv:2605.14423v1 Announce Type: cross Abstract: Despite the popularity of the actor-critic method and the practical needs of collaborative policy tr…