1
Beyond Localization: A Comprehensive Diagnosis of Perspective-Conditioned Spatial Reasoning in MLLMs from Omnidirectional Images
最新诊断论文深入评估多模态大模型在全景图像中的视角条件空间推理能力,超越传统定位局限
arXiv:2605.12413v3 Announce Type: replace Abstract: Multimodal Large Language Models (MLLMs) show strong visual perception, yet remain limited in reas…