Janus-Pro Unveiled: A Unified Architecture for Multimodal Understanding and Generation
Janus-Pro, the open‑source successor to Janus, introduces a decoupled visual encoder and scaled training data to boost both multimodal understanding and text‑to‑image generation, achieving state‑of‑the‑art results on benchmarks such as GQA, GenEval and DPG‑Bench.
