牛哥精选 · 本月

🤖 AI·大模型 Ars Technica 2026-05-29

LLMs believe false statements even after explicit warnings that they're false

研究揭示LLM在训练数据中植入错误信念后，即使明确警告也无法纠正，警示AI安全与事实性漏洞。

Fine-tuning tests show "bias ... toward confidently representing the claims as true."

llm 错误信念训练数据 ai安全可信度

2026-05-20 2026-05-19