牛哥精选 · 本月

📋 全部 🤖 AI·大模型 ⚡ 效率工具 📝 深度技术 🚀 产品观察 💰 商业科技 🔓 开源项目 🎨 设计创意 📖 阅读推荐 🏷 资源合集 🌱 成长效率

📝 深度技术 Hacker News LLM 2026-05-24

Constraint Decay: The Fragility of LLM Agents in Back End Code Generation

arXiv新研究揭示LLM代理在后端代码生成中面临"约束衰减"问题，系统脆弱性令人警醒。

Article URL: https://arxiv.org/abs/2605.06445 Comments URL: https://news.ycombinator.com/item?id=48256912 Points: 4 # Comments: 0

llm ai代理约束衰减代码生成后端开发

📝 深度技术 arXiv 机器学习 2026-05-23

On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs

首份系统研究RL微调VLM的鲁棒性与思维链一致性，揭示模型脆弱性根源

arXiv:2602.12506v3 Announce Type: replace Abstract: Reinforcement learning (RL) finetuning has become a key technique for enhancing large language mod…

rl微调视觉语言模型鲁棒性思维链一致性模型脆弱性

📝 深度技术 arXiv 机器学习 2026-05-20

Alignment Dynamics in LLM Fine-Tuning

揭秘LLM微调中对齐为何脆弱：从参数动态到输出分布的统一视角

arXiv:2605.18309v1 Announce Type: new Abstract: Although Large Language Models (LLMs) achieve strong alignment through supervised fine-tuning and rein…

llm fine-tunin alignment 机器学习深度学习

📝 深度技术 arXiv 机器学习 2026-05-20

Adversarial Fragility and Language Vulnerability in Clinical AI: A Systematic Audit of Diagnostic Collapse Under Imperceptible Perturbations and Cross-Lingual Drift in Low-Resource Healthcare Settings

临床AI系统在细微扰动和多语言场景下存在诊断崩溃风险，这篇系统性审计揭开了安全漏洞。

arXiv:2605.16993v1 Announce Type: cross Abstract: Current clinical artificial intelligence (AI) systems are evaluated almost exclusively on clean, sta…

临床ai 对抗性脆弱性语言脆弱性低资源医疗诊断崩溃

📝 深度技术 arXiv AI 2026-05-19

On the Fragility of Data Attribution When Learning Is Distributed

分布式学习中数据归因的脆弱性：单个参与者可操纵归因值大幅膨胀，挑战定价与审计可信度。

arXiv:2605.15520v1 Announce Type: cross Abstract: Data attribution has become an important component of pricing, auditing, and governance in machine l…

数据归因分布式训练脆弱性机器学习安全参与者贡献操纵

📝 深度技术 arXiv AI 2026-05-19

Quantifying Cyber-Vulnerability in Power Electronics Systems via an Impedance-Based Attack Reachable Domain

提出基于阻抗的攻击可达域，为电力电子系统网络脆弱性提供新的量化度量

arXiv:2605.14502v1 Announce Type: cross Abstract: Power electronics systems are increasingly exposed to cyber threats due to their integration with di…

电力电子系统网络安全脆弱性量化阻抗攻击可达域

📅 日期

2026-05-20 2026-05-19