1
MMSkills: Towards Multimodal Skills for General Visual Agents
提出MMSkills框架,为通用视觉智能体注入可重用的多模态技能,超越传统文本或代码型技能库
arXiv:2605.13527v2 Announce Type: replace Abstract: Reusable skills have become a core substrate for improving agent capabilities, yet most existing s…