1
Flow-OPD: On-Policy Distillation for Flow Matching Models
提出Flow-OPD新方法,用同策略蒸馏解决流匹配模型在多任务对齐中的奖励稀疏和梯度干扰问题。
arXiv:2605.08063v3 Announce Type: replace-cross Abstract: Existing Flow Matching (FM) text-to-image models suffer from two critical bottlenecks under …