1
Deep Double Q-learning
经典Double Q-learning的深度强化学习新范式,通过完全解耦动作选择与评估彻底消除最大化偏差。
arXiv:2507.00275v2 Announce Type: replace-cross Abstract: Double Q-learning is a classical control algorithm that mitigates the maximization bias of Q…
经典Double Q-learning的深度强化学习新范式,通过完全解耦动作选择与评估彻底消除最大化偏差。
arXiv:2507.00275v2 Announce Type: replace-cross Abstract: Double Q-learning is a classical control algorithm that mitigates the maximization bias of Q…