1
BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback
提出BESPOKE基准,通过诊断反馈评估搜索增强大语言模型的个性化能力,揭示同一查询不同意图的关键挑战。
arXiv:2509.21106v2 Announce Type: replace Abstract: Search-augmented large language models (LLMs) have advanced information-seeking tasks by integrati…