Flow-OPD: On-Policy Distillation for Flow Matching Models
Researchers introduce Flow-OPD, a post-training framework that applies on-policy distillation to Flow Matching text-to-image models, addressing reward sparsity and gradient interference problems. Built on Stable Diffusion 3.5 Medium, the method achieves significant performance gains—GenEval scores improve from 63 to 92 and OCR accuracy from 59 to 94—while maintaining image quality and surpassing individual teacher models.