1
JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching
联合人脸运动与语音生成,Flow Matching实现音视频同步合成,突破传统分离建模局限。
arXiv:2506.23552v2 Announce Type: replace Abstract: The intrinsic link between facial motion and speech is often overlooked in generative modeling, wh…