Descriptor: Distance-Annotated Traffic Perception Question Answering (DTPQA)
为自动驾驶场景设计带距离标注的交通感知问答数据集,评估VLM空间推理能力
arXiv:2511.13397v2 Announce Type: replace-cross Abstract: The remarkable progress of Vision-Language Models (VLMs) on a variety of tasks has raised in…