1
EvoMemBench: Benchmarking Agent Memory from a Self-Evolving Perspective
用于评估智能体记忆随自我进化能力的全新基准,为Agent Memory研究提供标准化测试。
arXiv:2605.18421v1 Announce Type: cross Abstract: Recent benchmarks for Large Language Model (LLM) agents mainly evaluate reasoning, planning, and exe…