Show HN: SoMatic – Vision-based OS automation framework for AI agents
基于视觉的OS自动化框架,让AI代理精准操控电脑,突破多模态LLM定位瓶颈。
Hi HN, I'm Smyan and I enjoy building agents. Modern multimodal LLMs are great at vision and perception but are quite poor at localization. This natur…