1
VideoGameBench: Can Vision-Language Models complete popular video games?
最新研究揭示视觉语言模型在真实视频游戏中的表现,挑战其感知与空间导航能力
arXiv:2505.18134v3 Announce Type: replace Abstract: Vision-language models (VLMs) have achieved strong results on coding and math benchmarks that are …