Nvidia Redefines Text-to-Image Generation: Direct Latent‑to‑4K with Pixel Diffusion Decoder
Nvidia's Spatial Intelligence Lab introduces the Pixel Diffusion Decoder (PiD), a generative decoder that replaces the traditional decode‑plus‑super‑resolution pipeline, delivering 2K images in 210 ms on a GB200 GPU and up to 4K output with 3‑6× speedup while improving visual quality across multiple metrics.
