1
EntityBench: Towards Entity-Consistent Long-Range Multi-Shot Video Generation
140 集 2491 镜头,首个多镜头视频生成实体一致性基准,填补长序列评估空白。
arXiv:2605.15199v1 Announce Type: cross Abstract: Multi-shot video generation extends single-shot generation to coherent visual narratives, yet mainta…