Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
提出Evo-Memory基准,全面评估LLM Agent通过自演化记忆进行测试时学习的能力。
arXiv:2511.20857v2 Announce Type: replace Abstract: Statefulness is essential for large language model (LLM) agents to perform long-term planning and …