牛哥精选 · 三个月

📋 全部 🤖 AI·大模型 ⚡ 效率工具 📝 深度技术 🚀 产品观察 💰 商业科技 🔓 开源项目 🎨 设计创意 📖 阅读推荐 🏷 资源合集 🌱 成长效率

📝 深度技术 arXiv 机器学习 2026-06-16

Training-Free Adversarial Robustness in Computational MRI

无需训练即可增强计算MRI的对抗鲁棒性，ICML 2026论文提出全新方法。

arXiv:2501.01908v4 Announce Type: replace-cross Abstract: Deep learning (DL) methods have become the state-of-the-art for reconstructing sub-sampled m…

计算mri 对抗鲁棒性无训练方法机器学习 icml 2026

🤖 AI·大模型 arXiv 计算机视觉 2026-06-16

On the Adversarial Robustness of Multimodal LLM Judges

多模态大语言模型充当评判者是否可靠？本文揭示其应对对抗攻击时的脆弱性并提出提升鲁棒性的方向。

arXiv:2606.15608v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) are increasingly used as automated judges, e.g., for image qu…

多模态大语言模型对抗鲁棒性 ai安全评判模型

📝 深度技术 arXiv AI 2026-05-25

Test-Time Training Undermines Safety Guardrails

最新研究揭示：测试时训练（TTT）能绕过AI安全护栏，引发对模型防御机制的新思考。

arXiv:2605.22984v1 Announce Type: cross Abstract: Test-Time Training (TTT) is an emerging paradigm that enables models to adapt their parameters durin…

测试时训练安全护栏 ai安全大模型风险

🤖 AI·大模型 arXiv NLP 2026-05-21

Towards Context-Invariant Safety Alignment for Large Language Models

针对大模型安全对齐中上下文敏感漏洞，提出创新方法实现跨场景一致性防护。

arXiv:2605.20994v1 Announce Type: new Abstract: Preference-based post-training aligns LLMs with human intent, yet safety behavior often remains brittl…

大模型安全上下文不变性安全对齐 llm防御对抗鲁棒性

🤖 AI·大模型 arXiv AI 2026-05-19

GAMBIT: A Three-Mode Benchmark for Adversarial Robustness in Multi-Agent LLM Collectives

首个专注多智能体LLM集体对抗鲁棒性的三模式基准，揭示单一欺骗智能体如何突破现有防御。

arXiv:2605.09027v2 Announce Type: cross Abstract: In multi-agent systems (MAS), a single deceptive agent can nullify all gains of an agentic AI collec…

llm安全多智能体系统对抗鲁棒性基准测试 ai安全

📅 日期

2026-05-20 2026-05-19