1
Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
多模态大模型在空间智能上的突破,赋予AI更强的视觉感知与推理能力。
arXiv:2505.23747v2 Announce Type: replace-cross Abstract: Recent advancements in Multimodal Large Language Models (MLLMs) have significantly enhanced …