Kuaishou Tech
Nov 25, 2025 · Artificial Intelligence
How Flow‑GRPO Boosts Image Generation Accuracy to 95% with Online Reinforcement Learning
Flow‑GRPO introduces online reinforcement learning into flow‑matching models by converting deterministic ODE sampling to stochastic SDE sampling and reducing denoising steps, raising SD‑3.5‑Medium's GenEval accuracy from 63% to 95%—surpassing GPT‑4o—and demonstrating strong gains in complex composition, text rendering, and human‑preference alignment across multiple generative tasks.
AI ResearchImage Generationdeep learning
0 likes · 8 min read
