EgoCoT-Bench: Benchmarking Grounded and Verifiable Operation-Centric Chain of Thought Reasoning for MLLMs
评估多模态大模型操作中心链式思维推理能力的新基准,强调接地与可验证性。
arXiv:2605.19559v1 Announce Type: new Abstract: The rapid development of Multimodal Large Language Models (MLLMs) has led to growing interest in egoce…