牛哥精选 · 半年

1

🔗 链接工具 Hacker News Show 2026-06-25

Show HN: Screen recording your crappy startup

牛哥评测初创网站，屏幕录制展示真实使用体验，诚实点评助你快速迭代。

Hi HN. I've been recording various startups for YouTube after they ask for it and I review it and give my honest opinion and show the difficulties I h…

屏幕录制初创评测网站测试用户体验诚实反馈

2

🤖 AI·大模型 36氪 2026-06-22

广州市番禺协诚实业有限公司副总经理龙德洋：真正的城市服务，藏在分分秒秒的坚守里

“2026年，创投圈的浪潮再次翻涌：AI从技术概念走进产业深水区，硬科技创业从“小众赛道” 变成“主流共识”，年轻的创业者们正在用代码和双手，重新定义中国创新的未来坐标。每一年，由36氪 · 暗涌主办的WAVES大会，都是中国创投圈的年度风向标。今年的 WAVES 2026以“今年盛夏”为主题，…

广州市番禺协诚实业有限公司副总经理龙德洋真正的城市服

3

📝 深度技术 arXiv AI 2026-06-11

Goal-Autopilot: A Verifiable Anti-Fabrication Firewall for Unattended Long-Horizon Agents

论文提出一种可验证的反伪造防火墙，让LLM代理在无人监督时无法虚报成功，将诚实作为自主任务的首要度量。

arXiv:2606.11688v1 Announce Type: cross Abstract: Long-horizon LLM agents are not trusted to run unattended: with no human watching, they confidently …

llm代理反伪造防火墙可验证性长期任务诚实度量

4

🤖 AI·大模型 arXiv AI 2026-06-10

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

用强化学习让大模型更诚实，TruthRL方法提升LLM回答真实性，含代码开源

arXiv:2509.25760v2 Announce Type: replace-cross Abstract: While large language models (LLMs) have demonstrated strong performance on factoid question …

truthrl 强化学习 llm 诚实性幻觉

5

🤖 AI·大模型 arXiv NLP 2026-06-10

Parametric Knowledge is Not All You Need: Toward Honest Large Language Models via Retrieval of Pretraining Data

揭秘LLM「说谎」根源：论文提出用检索预训练数据替代纯参数知识，实现更诚实的AI输出。

arXiv:2601.21218v2 Announce Type: replace Abstract: Large language models (LLMs) are highly capable of answering questions, but they are often unaware…

大语言模型检索增强诚实性预训练数据知识幻觉

6

🤖 AI·大模型 Hacker News AI 2026-06-05

Lying is Best. The Most Honest AI Won Anyway.

诚实AI在博弈中获胜，揭秘为何“说谎更好”却“诚实更优”

Article URL: https://kradle.ai/research/four-bridges Comments URL: https://news.ycombinator.com/item?id=48402435 Points: 4 # Comments: 2

ai 诚实行为策略 grok 博弈

7

📝 深度技术 arXiv AI 2026-06-01

Used Car Salesbots? Honesty and Credulity of LLMs as Bargaining Agents under Partial Information

LLM在信息不对称的二手车议价中会如何表现？这篇论文揭示了AI谈判员诚实与轻信的双重面貌。

arXiv:2605.31445v1 Announce Type: cross Abstract: In this work we study agents in simulated bargaining scenarios, where a buyer and a seller communica…

llm 谈判代理信息不对称二手车议价诚实性

8

🤖 AI·大模型 The Verge 2026-05-28

Claude’s new model is more ‘honest’ when it messes up

Claude Opus 4.8主打“诚实”，模型会主动承认不确定性，减少胡编乱造，让AI更值得信任。

Anthropic is releasing Claude Opus 4.8 on Thursday, and the company is touting the model's "honesty." According to Anthropic, it trains "all [its] mod…

claude 坦白 ai模型 anthropic 诚实

9

🤖 AI·大模型 arXiv AI 2026-05-27

The AI Cognitive Trojan Horse: How Large Language Models May Bypass Human Epistemic Vigilance

探讨大型语言模型如何像认知特洛伊木马一样，绕开人类的认知警惕，揭示AI在信息传播中的潜在安全风险。

arXiv:2601.07085v2 Announce Type: replace-cross Abstract: Large language model (LLM)-based conversational AI systems present a challenge to human cogn…

llm 认知安全人类警惕性信息操控演化生物学

10

📝 深度技术 OpenAI 官方博客 2026-05-19

How confessions can keep language models honest

OpenAI秘密武器：让大模型学会“认错”，诚实度飙升的秘密在这篇研究里！

OpenAI researchers are testing “confessions,” a method that trains models to admit when they make mistakes or act undesirably, helping improve AI hone…

ai诚实模型对齐 openai研究错误检测透明度

11

📖 阅读推荐 UI设计参考 2026-05-19

A Thousand Ships

一次深夜约会中的坦诚告白，像一颗石子投入平静水面，激起关于爱、诚实与自由意志的涟漪。

“I have a partner and I live with him,” O said abruptly on our third date as we were in bed together. She was embarrassed to say it, her eyes impossib…

情感关系诚实约会内心独白个人叙事

🐂 牛哥精选