1
GAMBIT: A Three-Mode Benchmark for Adversarial Robustness in Multi-Agent LLM Collectives
首个专注多智能体LLM集体对抗鲁棒性的三模式基准,揭示单一欺骗智能体如何突破现有防御。
arXiv:2605.09027v2 Announce Type: cross Abstract: In multi-agent systems (MAS), a single deceptive agent can nullify all gains of an agentic AI collec…