1
ForMaT: Dataset for Visually-Grounded Multilingual PDF Translation
PDF翻译新突破!ForMaT数据集保留原文布局,支持15种语言对,精准处理嵌套表格等复杂结构。
arXiv:2605.15794v1 Announce Type: new Abstract: We present ForMaT (Format-Preserving Multilingual Translation), a parallel corpus of 3,956 PDFs across…