Kuaishou Tech
Jul 22, 2025 · Artificial Intelligence
How Orthus Achieves Lossless Multimodal Generation with a Unified Autoregressive Transformer
Orthus, a new unified multimodal model presented at ICML 2025, leverages an autoregressive Transformer backbone with separate language and diffusion heads to enable lossless image‑text interleaved generation, outperforming existing models on both understanding and generation benchmarks while remaining computationally efficient.
AI researchDiffusion ModelsImage Generation
0 likes · 11 min read
