AIWalker
Feb 16, 2025 · Artificial Intelligence
VARGPT: A Unified Autoregressive Architecture for Multimodal Understanding and Generation
VARGPT is a novel multimodal large language model that unifies visual understanding and autoregressive image generation within a single architecture, extending LLaVA with next‑token and next‑scale prediction, trained through three staged data‑curated phases and achieving superior performance on numerous vision‑language benchmarks.
AI researchImage GenerationVARGPT
0 likes · 20 min read
