AIWalker
Jan 21, 2025 · Artificial Intelligence
PKU Introduces Next Patch Prediction for Image Generation, Cutting Training Cost to ~0.6×
The paper proposes a Next Patch Prediction (NPP) paradigm that groups image tokens into high‑density patches, enabling autoregressive models to predict patches instead of individual tokens, which reduces training cost to about 0.6× and improves ImageNet FID scores by up to 1.0 across models ranging from 100 M to 1.4 B parameters.
Autoregressive ModelsFID improvementLlamaGen
0 likes · 10 min read
