1
VLM-AutoDrive: Post-Training Vision-Language Models for Safety-Critical Autonomous Driving Events
这篇论文提出了模块化后训练框架,将视觉语言模型适配到自动驾驶安全关键事件的异常检测,融合元数据、LLM描述与VQA及CoT推理,提升准确性。
arXiv:2603.18178v2 Announce Type: replace Abstract: The rapid growth of ego-centric dashcam footage presents a major challenge for detecting safety-cr…