1
Trust No Tool: Evaluating and Defending LLM Agents under Untrusted Tool Feedback
LLM Agent面临工具反馈不可信风险,论文提出评估框架与防御方法,为AI安全提供新思路。
arXiv:2605.17453v1 Announce Type: cross Abstract: Tool-using LLM agents increasingly rely on external tools to make consequential decisions, yet most …