1
PolitNuggets: Benchmarking Agentic Discovery of Long-Tail Political Facts
PolitNuggets基准测试评估大模型在多语言长尾政治事实发现中的能力,为智能体信息合成提供新标尺。
arXiv:2605.14002v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) embedded in agentic frameworks have transformed information retrieval fr…