1
Offline Contextual Bandits in the Presence of New Actions
新动作出现时,离线上下文bandit如何优化?这篇论文提出解决方案,提升推荐系统等场景的决策效果。
arXiv:2605.18509v1 Announce Type: new Abstract: Automated decision-making algorithms drive applications such as recommendation systems and search engi…