1
AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems
多智能体系统一旦出现关键错误就会级联溃败。AgentForesight将故障诊断从事后归因推入在线审计——在轨迹执行中实时识别首个决定性错误,7B模型超越基线,为复杂Agent系统提供宝贵的安全护栏。
arXiv:2605.08715v2 Announce Type: replace-cross Abstract: LLM-based multi-agent systems are increasingly deployed on long-horizon tasks, but a single …