1
Let ViT Speak: Generative Language-Image Pre-training
ViT也能开口说话?全新生成式语言-图像预训练框架,让视觉与语言深度融合!
arXiv:2605.00809v2 Announce Type: replace Abstract: In this paper, we present \textbf{Gen}erative \textbf{L}anguage-\textbf{I}mage \textbf{P}re-traini…