1
How Auxiliary Reasoning Unleashes GUI Grounding in VLMs
VLMs在GUI接地任务中隐藏巨大潜力,辅助推理能有效释放这一能力,突破当前优化瓶颈。
arXiv:2509.11548v2 Announce Type: replace Abstract: Graphical user interface (GUI) grounding is a fundamental task for building GUI agents. However, g…