1
LoVeC: Reinforcement Learning for Better Verbalized Confidence in Long-Form Generations
用强化学习提升长文本生成中置信度表达,直击大模型幻觉难题。
arXiv:2505.23912v2 Announce Type: replace-cross Abstract: Hallucination remains a major challenge for the safe and trustworthy deployment of large lan…