1
Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning
让大模型在稀疏奖励环境中引导强化学习策略,通过不确定性估计提升决策可靠性,有代码可复现。
arXiv:2606.06673v1 Announce Type: new Abstract: Sparse rewards and heterogeneous task sequences remain persistent challenges in Reinforcement Learning…