牛哥精选 · 三个月

📋 全部 🤖 AI·大模型 ⚡ 效率工具 📝 深度技术 🚀 产品观察 💰 商业科技 🔓 开源项目 🎨 设计创意 📖 阅读推荐 🏷 资源合集 🌱 成长效率

🤖 AI·大模型 arXiv AI 2026-07-07

The Anatomy of Uncertainty in LLMs

系统解剖LLM中不确定性的来源与量化方法，为可信AI提供理论基础。

arXiv:2603.24967v2 Announce Type: replace Abstract: Understanding why a large language model (LLM) is uncertain about the response is important for th…

llm 不确定性模型校准可解释ai 置信度估计

🤖 AI·大模型 arXiv AI 2026-06-01

Shared Doubt: Zero-shot Cross-Lingual Confidence Estimation for Language Models

首次提出零样本跨语言置信度估计方法，无需目标语言标注即可评估模型预测的可靠性。

arXiv:2605.31220v1 Announce Type: cross Abstract: Confidence estimation (CE), i.e. quantifying the reliability of a model's prediction, has attracted …

零样本跨语言置信度估计语言模型共享不确定性

📝 深度技术 arXiv AI 2026-05-19

LoVeC: Reinforcement Learning for Better Verbalized Confidence in Long-Form Generations

用强化学习提升长文本生成中置信度表达，直击大模型幻觉难题。

arXiv:2505.23912v2 Announce Type: replace-cross Abstract: Hallucination remains a major challenge for the safe and trustworthy deployment of large lan…

强化学习置信度估计幻觉检测长文本生成大语言模型

📅 日期

2026-05-20 2026-05-19

🐂 牛哥精选

The Anatomy of Uncertainty in LLMs

Shared Doubt: Zero-shot Cross-Lingual Confidence Estimation for Language Models

LoVeC: Reinforcement Learning for Better Verbalized Confidence in Long-Form Generations

📅 日期