牛哥精选 · 半年

1

🤖 AI·大模型 arXiv AI 2026-07-14

Reinforcement Learning with Verifiable Physics: Post-training LLMs with Continuous Rewards

巧用物理规则为LLM提供连续奖励信号，让强化学习后训练更可解释、更高效

arXiv:2607.10474v1 Announce Type: cross Abstract: Partial differential equations (PDEs) are foundational to modeling in science and engineering, but c…

强化学习可验证物理连续奖励 llm后训练物理驱动ai

2

📝 深度技术 arXiv AI 2026-07-14

YUKTI: From Natural-Language Situations to Robust, Verifiable Decisions An Uncertainty-Typed Proposition IR, Assumption-Robust Pareto Frontiers, and a Regret Certificate

语言模型做决策不可靠？YUKTI提出从自然语言到鲁棒可验证决策的新框架，挑战传统单目标优化的置信度陷阱。

arXiv:2607.09706v1 Announce Type: new Abstract: Language models turn a worded situation into a numeric plan, and the dominant pipelines (NL4Opt, OptiM…

自然语言处理决策鲁棒性语言模型优化可验证性

3

🤖 AI·大模型 arXiv 机器学习 2026-07-08

Strategic Bargaining in Multi-Buyer Markets: Reinforcement Learning from Verifiable Rewards for LLM Negotiations

将强化学习与可验证奖励结合，让大模型在多买家市场中学会策略性谈判，博弈论新玩法。

arXiv:2607.05863v1 Announce Type: new Abstract: Negotiation is a fundamental strategic interaction in management science, characterized by agents atte…

多买家市场战略谈判可验证奖励强化学习大语言模型

4

🎨 设计工具量子位 2026-07-06

OpenSquilla发布0.5.0 Preview：多模型集成登顶DRACO双榜，对比名单中出现最新旗舰Fable 5

多模型集成开源AI Agent，智能路由省钱，自组织工作流与可验证编码，登顶榜单对比旗舰Fable5

发布多模型集成登双榜对比名单中出现最新旗舰

5

🤖 AI·大模型 arXiv AI 2026-06-16

VeriGraph: Towards Verifiable Data-Analytic Agents

大模型数据智能体验证难题新解，VeriGraph用图结构提升分析可信度。

arXiv:2606.16603v1 Announce Type: cross Abstract: LLM-based agents have demonstrated strong capabilities in data-intensive analytical tasks, yet their…

verigraph 可验证数据智能体图结构大模型应用数据可信性

6

📝 深度技术 arXiv AI 2026-06-11

Goal-Autopilot: A Verifiable Anti-Fabrication Firewall for Unattended Long-Horizon Agents

论文提出一种可验证的反伪造防火墙，让LLM代理在无人监督时无法虚报成功，将诚实作为自主任务的首要度量。

arXiv:2606.11688v1 Announce Type: cross Abstract: Long-horizon LLM agents are not trusted to run unattended: with no human watching, they confidently …

llm代理反伪造防火墙可验证性长期任务诚实度量

7

📝 深度技术 Dev.to 2026-06-11

Message Franking: Reporting Abuse Without Breaking Encryption

一种叫“消息弗兰克”的机制，让你在端到端加密聊天中安全举报滥用，不破加密。

Here's a problem that sounds impossible. A messaging app can't read your end-to-end encrypted messages — that's the whole point. But when someone rece…

消息弗兰克可验证滥用报告端到端加密 facebook m 加密技术

8

🤖 AI·大模型 arXiv 机器学习 2026-06-09

TinyJudge: Unverifiable Constraint Alignment via Lightweight Specialist Ensembles

ACL 2026论文：用轻量级专家集成方法，解决大模型不可验证约束对齐难题，提升安全性与可控性。

arXiv:2606.07520v1 Announce Type: cross Abstract: Instruction Following (IF) is a core capability of LLMs, requiring strict adherence to diverse const…

tinyjudge 约束对齐轻量级专家集成 acl 2026 不可验证约束

9

🔓 开源项目 Hacker News AI 2026-06-03

A cryptographically verifiable state-transition engine for AI systems

开源AI状态转换引擎，通过密码学验证确保系统的透明与可信，适合安全敏感场景。

Article URL: https://github.com/Ghoti6098/AgenticOS Comments URL: https://news.ycombinator.com/item?id=48378312 Points: 1 # Comments: 0

ai系统状态转换引擎加密可验证开源安全

10

🤖 AI·大模型 Hacker News Show 2026-06-02

Show HN: AERF, signed receipts for AI agent actions

让AI代理行为拥有可验证收据，AERF通过Ed25519签名确保可信溯源。

Article URL: https://github.com/aerf-spec/aerf Comments URL: https://news.ycombinator.com/item?id=48369900 Points: 1 # Comments: 0

aerf 签名收据 ai代理 ed25519 可验证性

11

🤖 AI·大模型 arXiv AI 2026-06-01

HERMES: Towards Efficient and Verifiable Mathematical Reasoning in LLMs

大模型数学推理新突破：HERMES实现高效与可验证的推理过程

arXiv:2511.18760v2 Announce Type: replace Abstract: Informal mathematics has been central to modern large language model (LLM) reasoning, offering fle…

大语言模型数学推理可验证性效率优化人工智能

12

📝 深度技术 arXiv 机器学习 2026-05-29

RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains

提出RUBRIC-ARROW方法，通过交替点对点标准奖励建模优化LLM在非可验证领域的后训练性能

arXiv:2605.29156v1 Announce Type: new Abstract: Pointwise reward modeling offers critical signals for LLM post-training, yet struggles with absolute s…

llm后训练奖励模型非可验证领域 rubric-arr 点对点评分

13

📝 深度技术 arXiv AI 2026-05-28

Verifiable Process Rewards for Agentic Reasoning

提出可验证过程奖励机制，让智能体推理更可信可解释，强化学习新思路。

arXiv:2605.10325v2 Announce Type: replace Abstract: Reinforcement learning from verifiable rewards (RLVR) has improved the reasoning abilities of larg…

可验证过程奖励智能体推理强化学习推理可靠性奖励模型

14

📝 深度技术 arXiv AI 2026-05-26

KYA: A Framework-Agnostic Trust Layer for Autonomous Systems with Verifiable Provenance and Hierarchical Policy Composition

开源框架KYA为自主系统提供可验证出处的信任层，支持分层策略组合，实现框架无关的治理。

arXiv:2605.25376v1 Announce Type: cross Abstract: Observability tells operators when an agent is slow. KYA tells operators when an agent is wrong, dri…

kya 信任层自主系统可验证出处分层策略

15

📝 深度技术 arXiv NLP 2026-05-22

From Reasoning Chains to Verifiable Subproblems: Curriculum Reinforcement Learning Enables Credit Assignment for LLM Reasoning

将推理链分解为可验证子问题，用课程强化学习精准分配LLM推理中的信用，突破训练瓶颈。

arXiv:2605.22074v1 Announce Type: cross Abstract: Reinforcement learning from verifiable rewards (RLVR) has shown strong promise for LLM reasoning, bu…

llm推理强化学习信用分配课程学习可验证子问题

16

🤖 AI·大模型 arXiv 机器学习 2026-05-21

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models

专为评估训练大模型设计的可扩展可验证规划基准，填补LLM在复杂规划任务中的性能缺口。

arXiv:2605.20873v1 Announce Type: cross Abstract: Planning is a fundamental capability for large language models (LLMs) because such complex tasks req…

planningbe 大语言模型规划数据评估训练

17

🤖 AI·大模型 arXiv NLP 2026-05-21

From Text to Voice: A Reproducible and Verifiable Framework for Evaluating Tool Calling LLM Agents

一项可复现、可验证的评估框架，让工具调用型LLM智能体的性能对比不再模糊。

arXiv:2605.15104v2 Announce Type: replace Abstract: Voice agents increasingly require reliable tool use from speech, whereas prominent tool-calling be…

llm agent 工具调用评估框架语音交互可重复性

18

📝 深度技术 arXiv NLP 2026-05-21

InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling

通过可验证任务缩放的新方法，显著提升大语言模型推理能力，来自InternBootcamp的技术报告亮点频出。

arXiv:2508.08636v2 Announce Type: replace Abstract: Large language models (LLMs) have revolutionized artificial intelligence by enabling complex reaso…

llm推理可验证任务缩放 internboot 技术报告推理增强

19

📝 深度技术 arXiv 机器学习 2026-05-21

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

揭示RLVR训练中参数轨迹的秩一结构，仅需极小规模训练即可外推LLM推理能力，颠覆传统认知。

arXiv:2605.21468v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has become a dominant paradigm for improving rea…

强化学习大语言模型参数轨迹秩1外推可验证奖励

20

📝 深度技术 arXiv 计算机视觉 2026-05-20

EgoCoT-Bench: Benchmarking Grounded and Verifiable Operation-Centric Chain of Thought Reasoning for MLLMs

评估多模态大模型操作中心链式思维推理能力的新基准，强调接地与可验证性。

arXiv:2605.19559v1 Announce Type: new Abstract: The rapid development of Multimodal Large Language Models (MLLMs) has led to growing interest in egoce…

egocot-ben 多模态大语言模型链式思维推理基准测试操作中心推理

🐂 牛哥精选