Machine Heart
Apr 24, 2026 · Artificial Intelligence
Generating High‑Resolution Images with Only 64 Tokens: How MacTok Overcomes Posterior Collapse
MacTok introduces semantic masking and dual‑space alignment to prevent posterior collapse in continuous image tokenizers, enabling high‑quality generation with just 64‑128 tokens and achieving strong gFID scores on ImageNet at 256×256 and 512×512 resolutions.
Image GenerationImageNetMacTok
0 likes · 9 min read
