Insights Generator: Systematic Corpus-Level Trace Diagnostics for LLM Agents
系统级诊断LLM代理执行失败,从单条追踪到全语料,规模化发现隐藏错误模式。
arXiv:2605.21347v1 Announce Type: cross Abstract: Diagnosing failures in LLM agents remains largely manual. Practitioners inspect a small subset of ex…