1
ForeSci: Evaluating LLM Agents for Forward-Looking AI Research Judgment
ForeSci基准首次系统评估LLM Agent在AI研究前瞻性判断上的表现
arXiv:2606.00644v1 Announce Type: new Abstract: AI research often requires decisions before future evidence exists: which bottleneck to attack, which …