牛哥精选 · 三个月

1

🤖 AI·大模型 arXiv 机器学习 2026-07-14

LLM-PDESR: Robust PDE Discovery via Subdomain Weighted Residuals and LLM-Guided Symbolic Hypothesis Generation

AI引导符号假设生成，子域加权残差法让噪声数据中偏微分方程发现更鲁棒精准

arXiv:2607.10546v1 Announce Type: new Abstract: Discovering governing partial differential equations (PDEs) from noisy observational data is a fundame…

llm pde发现符号回归子域加权残差科学机器学习

2

📝 深度技术 arXiv AI 2026-07-14

YUKTI: From Natural-Language Situations to Robust, Verifiable Decisions An Uncertainty-Typed Proposition IR, Assumption-Robust Pareto Frontiers, and a Regret Certificate

语言模型做决策不可靠？YUKTI提出从自然语言到鲁棒可验证决策的新框架，挑战传统单目标优化的置信度陷阱。

arXiv:2607.09706v1 Announce Type: new Abstract: Language models turn a worded situation into a numeric plan, and the dominant pipelines (NL4Opt, OptiM…

自然语言处理决策鲁棒性语言模型优化可验证性

3

📝 深度技术 arXiv NLP 2026-07-10

Can We Trust LLM's Logic? Quantifying Uncertainty, Coherence, and Robustness via a Graph-Based Framework

用图论框架量化大模型逻辑推理的可信度，为评估LLM可靠性提供新方法。

arXiv:2607.08017v1 Announce Type: new Abstract: Large-Language Models (LLMs) can be prone to flawed and unfaithful reasoning that decoding strategies …

llm 逻辑推理不确定性一致性鲁棒性

4

📝 深度技术 arXiv 机器学习 2026-07-09

PeTeR: Post-Training Robustification of Probabilistic Circuits

概率电路后训练鲁棒化新方法PeTeR，增强模型可靠性与鲁棒性，性能更优。

arXiv:2607.07671v1 Announce Type: new Abstract: Probabilistic circuits (PCs) can model complex joint distributions while supporting exact and efficien…

概率电路后训练鲁棒化机器学习可靠性模型鲁棒性

5

📝 深度技术 arXiv AI 2026-07-07

Programming over Thinking: Efficient and Robust Multi-Constraint Planning

提出一种"在思考之上编程"的新范式，高效稳健解决多约束规划难题，AI规划领域重要突破。

arXiv:2601.09097v3 Announce Type: replace Abstract: Multi-constraint planning involves identifying, evaluating, and refining candidate plans while sat…

多约束规划编程思考算法人工智能鲁棒性

6

📝 深度技术 arXiv NLP 2026-07-01

Truth or Sophistry? LoFa: A Benchmark for LLM Robustness Against Logical Fallacies

首个评估大模型逻辑谬误鲁棒性的基准，揭示LLM在诡辩面前的漏洞，被ACL 2026收录。

arXiv:2606.31039v1 Announce Type: new Abstract: Large Language Models (LLMs) exhibit strong semantic capabilities, yet their resilience to manipulativ…

逻辑谬误 llm鲁棒性基准测试 acl 2026 大模型评估

7

📝 深度技术 arXiv AI 2026-07-01

RoPoLL: Robust Panel of LLM Judges

新方法RoPoLL让LLM评判者更鲁棒，18倍参数优势却更精准，有效对抗偏见污染。

arXiv:2606.30931v1 Announce Type: new Abstract: The LLM Jury, a Panel of LLM Evaluators (PoLL) reporting consensus scores, has become a practical alte…

ropoll llm评判者鲁棒性偏见污染参数优势

8

📝 深度技术 arXiv NLP 2026-06-30

AURORA: Asymmetry and Update-Induced Rotation for Robust Hallucination Detection in Large Language Models

基于不对称性与更新诱导旋转的创新方法，有效提升大语言模型幻觉检测的鲁棒性。

arXiv:2606.29545v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across a wide range of natural …

大语言模型幻觉检测不对称性更新诱导旋转 aurora

9

🤖 AI·大模型 arXiv AI 2026-06-29

Low-Agreeableness Persona Conditioning for Safe LLM Fine-Tuning

提出用低宜人性人格条件化提升LLM微调安全性，平衡对齐与攻击鲁棒性。

arXiv:2606.27709v1 Announce Type: cross Abstract: Recent work has shown that fine-tuning large language models (LLMs) for social warmth degrades factu…

llm安全微调人格条件化大模型对齐宜人性

10

📝 深度技术 arXiv 机器学习 2026-06-29

Towards Reliable Recommender Systems for Rating Data

探索如何提升评分数据推荐系统的可靠性，提出基于数据特性的新方法

arXiv:2412.20802v3 Announce Type: replace-cross Abstract: Recommender systems are widely used in the digital landscape to match users with content fit…

推荐系统可靠性评分数据机器学习鲁棒性

11

📝 深度技术 arXiv 机器学习 2026-06-26

Equivariance and Augmentation for Bayesian Neural Networks

贝叶斯神经网络与等变性结合，数据增强理论新突破，提升模型鲁棒性

arXiv:2606.26273v1 Announce Type: new Abstract: Symmetries are important for many deep learning tasks, ranging from applications in the sciences to me…

贝叶斯神经网络等变性数据增强机器学习深度学习

12

📝 深度技术 arXiv AI 2026-06-26

GEOALIGN: Geometric Rollout Curation for Robust LLM Reinforcement Learning

ICML 2026录用论文，针对LLM强化学习中的rollout采样提出几何优化方法，提升模型鲁棒性。

arXiv:2606.26917v1 Announce Type: cross Abstract: Online reinforcement learning is widely used to align large language models (LLMs) with reward signa…

geoalign geometric llm 强化学习 icml 2026

13

📝 深度技术 arXiv AI 2026-06-25

Cliff Tokens: Identifying Single-Token Failure Triggers in LLM Mathematical Reasoning

本文发现LLM推理中的“悬崖词”——单个token即可导致数学运算失败，揭示模型脆弱性根源。

arXiv:2606.25524v1 Announce Type: new Abstract: Large language models (LLMs) reach high accuracy in mathematical reasoning, but individual traces on t…

llm 数学推理单token失败触发器推理错误

14

🔓 开源项目 Hacker News AI 2026-06-25

Show HN: MAVS-GC – An Open-Source Governance Architecture for AI Systems

开源AI治理架构MAVS-GC，通过分层评估专家系统，增强算法在不利场景下的稳定性与适应性。

Hey HN, For some period of the time, I have been working on an open source project called MAVS-GC (Multi Adaptive Vetting Systems-Governance Core). Th…

mavs-gc 开源 ai治理专家系统鲁棒性

15

📝 深度技术 arXiv NLP 2026-06-24

CORE-BREW: LLR-Based Soft Decoding for Robust Multi-Bit LLM Watermarking

突破性LLM水印方案CORE-BREW采用软解码，提升多比特水印的鲁棒性与误报控制

arXiv:2606.24163v1 Announce Type: cross Abstract: Reliable provenance for LLM outputs requires multi-bit watermarks that remain robust under editing w…

llm水印软解码多比特水印鲁棒性 ecc

16

🤖 AI·大模型 arXiv AI 2026-06-24

Tuning without Peeking: Provable Generalization Bounds and Robust LLM Post-Training

提出“无窥视调优”方法，为大模型后训练提供可证明的泛化界限与鲁棒性保障。

arXiv:2507.01752v4 Announce Type: replace-cross Abstract: Gradient-based optimization is the workhorse of deep learning, offering efficient and scalab…

llm 后训练泛化界鲁棒性可证明理论

17

📝 深度技术 arXiv AI 2026-06-23

Paraphrasing Attack Resilience of Various AI-Generated Text Detection Methods

探究多种AI生成文本检测方法在面对改写攻击时的鲁棒性，为内容安全提供新视角。

arXiv:2605.14240v1 Announce Type: cross Abstract: The recent large-scale emergence of LLMs has left an open space for dealing with their consequences,…

ai文本检测改写攻击鲁棒性机器学习自然语言处理

18

📝 深度技术 arXiv NLP 2026-06-19

NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR

提出NIM4-ASR架构，实现高效、鲁棒且可定制的基于LLM的实时语音识别。

arXiv:2604.18105v2 Announce Type: replace-cross Abstract: Integrating large language models (LLMs) into automatic speech recognition (ASR) has become …

nim4-asr llm asr 实时语音识别效率

19

📝 深度技术 arXiv 计算机视觉 2026-06-19

How Fragile Are Training-Free AI-Generated Image Detectors? A Controlled Audit of Score Direction, Preprocessing, and Compression

揭秘无训练AI图像检测器的脆弱性，系统评估分数方向、预处理与压缩的三重影响。

arXiv:2606.20488v1 Announce Type: new Abstract: Training-free detectors of AI-generated images promise generator-agnostic deployment without classifie…

ai图像检测无训练方法脆弱性分析预处理影响压缩鲁棒性

20

📝 深度技术 arXiv 机器学习 2026-06-16

Training-Free Adversarial Robustness in Computational MRI

无需训练即可增强计算MRI的对抗鲁棒性，ICML 2026论文提出全新方法。

arXiv:2501.01908v4 Announce Type: replace-cross Abstract: Deep learning (DL) methods have become the state-of-the-art for reconstructing sub-sampled m…

计算mri 对抗鲁棒性无训练方法机器学习 icml 2026

🐂 牛哥精选