牛哥精选 · 本月

1

🤖 AI·大模型 arXiv NLP 2026-05-20

Trust or Abstain? A Self-Aware RAG Approach

提出自我感知RAG方法，解决检索知识与参数知识冲突时的信任与弃权决策问题。

arXiv:2605.18792v1 Announce Type: cross Abstract: Retrieval-augmented generation (RAG) improves large language models (LLMs) by incorporating external…

rag 自感知知识冲突大语言模型检索增强生成

2

📝 深度技术 arXiv NLP 2026-05-20

Proof-Carrying Certificates for LLM Pipelines: A Trust-Boundary Architecture

LLM管线可信边界验证框架，用证明携带证书确保确定性结构，安全架构新思路。

arXiv:2605.16407v1 Announce Type: cross Abstract: We present a framework for verifying the deterministic structured computations surrounding a large l…

llm管线信任边界形式化验证确定性结构安全架构

3

📝 深度技术 arXiv 计算机视觉 2026-05-20

Delta Forcing: Trust Region Steering for Interactive Autoregressive Video Generation

提出Delta Forcing方法，解决交互式自回归视频生成中响应性与稳定性的平衡难题。

arXiv:2605.14382v2 Announce Type: replace Abstract: Interactive real-time autoregressive video generation is essential for applications such as conten…

视频生成自回归交互式信任区域稳定性

4

💰 商业科技安全客 2026-05-20

亿格云完成数亿元B轮融资，加码“人+AI”统一安全治理

亿格云获数亿元B轮融资，主攻“人+AI”统一安全治理新赛道，连续三年翻倍增长。

亿格云 b轮融资 ai安全统一安全治理零信任

5

🚀 产品观察 The Verge 2026-05-20

Google’s AI future demands trust — and your personal data

Google Gemini跨应用数据推理能力引发隐私信任新挑战，揭示AI未来与个人数据的深度绑定。

Google has big promises for its AI-powered future - and a lot of it depends on your trust. At I/O 2026, Google described a bunch of new tools that it …

google gemini ai个人数据隐私信任

6

🤖 AI·大模型 arXiv NLP 2026-05-20

Towards Trust Calibration in Socially Interactive Agents: Investigating Gendered Multimodal Behaviors Generation with LLMs

LLM能否生成考虑性别的多模态行为来校准用户对社交代理的信任？这篇研究切入了一个关键的人机交互问题。

arXiv:2605.19798v1 Announce Type: new Abstract: As Socially Interactive Agents (SIAs) become increasingly integrated into daily life, the ability to c…

llms 信任校准多模态行为社交代理性别研究

7

📝 深度技术 UI设计参考 2026-05-20

Trust

对原生浏览器特性信心十足，第三方代码却一文不值，具体百分比告诉你为什么。

Jeremy doesn’t trust third-party code , but... I’m much more trusting of native browser features—HTML elements, CSS features, and JavaScript APIs. The…

信任第三方代码原生浏览器特性 web标准

8

📖 阅读推荐 UI设计参考 2026-05-20

A Problem of Trust

一场疫情揭示的不仅是身体脆弱，更是社会信任的裂缝——诚实与隔离才是解药。

It wasn’t just attacking our bodies. Instead, the pandemic had found a weakness in the unbreakable social bonds that we share with one another. Our ne…

疫情信任社会观察文章

9

🚀 产品观察 OpenAI 官方博客 2026-05-19

How enterprises are scaling AI

OpenAI官方指南：企业如何从实验到规模化部署AI，聚焦信任、治理与工作流设计

How enterprises scale AI: from early experiments to compounding impact through trust, governance, workflow design, and quality at scale.

企业ai 规模化信任治理工作流设计质量

10

🤖 AI·大模型 arXiv AI 2026-05-19

When AI Persuades: Adversarial Explanation Attacks on Human Trust in AI-Assisted Decision Making

揭秘LLM如何生成说服性解释，操纵人类对AI辅助决策的信任，一场新型对抗攻击。

arXiv:2602.04003v3 Announce Type: replace Abstract: Most adversarial threats in artificial intelligence (AI) target the computational behavior of mode…

对抗性攻击解释攻击人类信任 ai辅助决策大语言模型

11

🤖 AI·大模型 Lifehacker 2026-05-19

ChatGPT Can Now Reach Out to a 'Trusted Contact' After Conversations Concerning Self-Harm

ChatGPT推出“信任联系人”功能，当检测到自残等危险对话时，可主动通知你指定的人，为心理健康提供安全网

You can invite a friend or family member to be your ChatGPT "Trusted Contact."

chatgpt 信任联系人心理健康安全功能 openai

12

📝 深度技术 arXiv AI 2026-05-19

Skills as Verifiable Artifacts: A Trust Schema and a Biconditional Correctness Criterion for Human-in-the-Loop Agent Runtimes

提出Agent技能作为可验证工件，用双条件正确性标准解决人机协作信任问题，LLM部署的新范式

arXiv:2605.00424v2 Announce Type: replace-cross Abstract: Agent skills - structured packages of instructions, scripts, and references that augment a l…

agent技能可验证工件信任模式正确性标准 llm

13

🚀 产品观察 Vercel Blog 2026-05-19

How OpenEvidence built a healthcare AI that physicians actually trust

医生信赖的医疗AI如何通过Vercel实现零妥协扩展，从TikTok爆红到系统稳如磐石。

Andy Yoon was scrolling through Slack when he saw the message: OpenEvidence had gone viral on TikTok. Not "gaining traction.” Actually viral, reaching…

医疗ai 信任扩展性 vercel tiktok

14

🤖 AI·大模型 arXiv AI 2026-05-19

Quantifying and Mitigating Self-Preference Bias of LLM Judges

LLM作为评判者会偏向自己，这篇论文量化了自我偏好偏差并提出了缓解方法。

arXiv:2604.22891v3 Announce Type: replace-cross Abstract: LLM-as-a-Judge has become a dominant approach in automated evaluation systems, playing criti…

llm 自我偏好偏差评估信任度机器学习

15

🤖 AI·大模型 OpenAI 官方博客 2026-05-19

Introducing Trusted Access for Cyber

OpenAI推出信任访问框架，平衡前沿网络安全能力与反滥用保护。

OpenAI introduces Trusted Access for Cyber, a trust-based framework that expands access to frontier cyber capabilities while strengthening safeguards …

openai 网络安全信任访问框架

16

🚀 产品观察 Smashing Magazine 2026-05-19

Identifying Necessary Transparency Moments In Agentic AI (Part 1)

代理式AI的透明性设计：在“黑箱”与“数据倾倒”之间找到平衡，让用户信任AI决策。

Designing for agentic AI requires attention to both the system’s behavior and the transparency of its actions. Between the black box and the data dump…

代理式ai 透明度 ux设计黑箱问题用户信任

17

📝 深度技术 arXiv 机器学习 2026-05-19

TeamTR: Trust-Region Fine-Tuning for Multi-Agent LLM Coordination

针对多智能体LLM协调中序列微调导致上下文分布偏移的缺陷，提出信任区域微调方法，有效提升团队协同表现。

arXiv:2605.15207v1 Announce Type: new Abstract: Multi-agent LLM systems have shown promise for complex reasoning, yet recent evaluations reveal they o…

多智能体llm 信任区域微调协调序列微调上下文分布

18

🤖 AI·大模型 Hacker News Ask 2026-05-19

Ask HN: Do you know what data your AI coding agent sends to the cloud?

AI编码工具可能正在泄露你的敏感数据，你究竟了解多少？

Every session my AI coding agent reads files, runs commands, makes API calls. I have no idea exactly what ends up in the cloud. Is anyone actually tra…

ai编码工具数据隐私云监控信任问题

19

🚀 产品观察 Hacker News AI 2026-05-19

I'm banning AI from my life for all human-to-human communication

一位技术人决绝地宣布：在所有人类交流中禁用AI，只为守护那份脆弱的人类信任。

Article URL: https://sam.elborai.me/articles/no-more-llm-comms/ Comments URL: https://news.ycombinator.com/item?id=48181804 Points: 1 # Comments: 0

ai伦理人际信任人类交流反ai

20