1
Archon: A Unified Multimodal Model for Holistic Digital Human Generation
首次提出全预训练的统一多模态模型Archon,融合文本、音频、动作与视觉,实现数字人整体生成。
arXiv:2605.30311v1 Announce Type: cross Abstract: Digital humans are fundamental to immersive interaction, yet creating a unified model for holistic m…