1
PinpointQA: A Dataset and Benchmark for Small Object-Centric Spatial Understanding in Indoor Videos
首个聚焦室内视频小物体空间理解的数据集与评测基准,直击多模态大模型在细小物体感知上的短板
arXiv:2604.08991v2 Announce Type: replace-cross Abstract: Small object-centric spatial understanding in indoor videos remains a significant challenge …