1
Transformers for Learning on Noisy and Task-Level Manifolds: Approximation and Generalization Insights
从理论层面揭示Transformer在噪声与任务级流形上的学习能力,近似与泛化分析带来新洞察
arXiv:2505.03205v3 Announce Type: replace Abstract: Transformers serve as the foundational architecture for large language and video generation models…