Critic-Driven Voronoi-Quantization for Distilling Deep RL Policies to Explainable Models
介绍一种批评驱动Voronoi量化方法,实现深度强化学习策略向可解释模型的高效蒸馏,解决性能-可解释性权衡难题。
arXiv:2605.14897v1 Announce Type: cross Abstract: Despite many successful attempts at explaining Deep Reinforcement Learning policies using distillati…