1
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining
从视频中自动合成海量GUI交互轨迹,破解GUI Agent预训练数据稀缺难题,让智能体更好理解真实应用。
arXiv:2605.14747v1 Announce Type: cross Abstract: Recent advances in multimodal large language models have driven growing interest in graphical user i…