1
DiLA: Disentangled Latent Action World Models
DiLA将潜在动作解耦为几何与纹理流,实现高保真视频预测,突破LAMs的抽象-保真权衡。
arXiv:2605.15725v1 Announce Type: cross Abstract: Latent Action Models (LAMs) enable the learning of world models from unlabeled video by inferring ab…