Kuaishou Tech
Kuaishou Tech
Jul 22, 2025 · Artificial Intelligence

How Orthus Achieves Lossless Multimodal Generation with a Unified Autoregressive Transformer

Orthus, a new unified multimodal model presented at ICML 2025, leverages an autoregressive Transformer backbone with separate language and diffusion heads to enable lossless image‑text interleaved generation, outperforming existing models on both understanding and generation benchmarks while remaining computationally efficient.

AI researchDiffusion ModelsImage Generation
0 likes · 11 min read
How Orthus Achieves Lossless Multimodal Generation with a Unified Autoregressive Transformer