Old Zhang's AI Learning
Mar 2, 2026 · Artificial Intelligence
Qwen3.5 Small Models Unveiled: From 0.8B to 9B with Full Capabilities
The article introduces the newly released Qwen3.5 small model series (0.8B, 2B, 4B, 9B), explains their shared Gated Delta Networks architecture, early multimodal token fusion, 201‑language support and up to 1 million‑token context, and presents benchmark data that show the 9B model rivaling much larger LLMs, followed by practical guidance on model selection and deployment.
Gated Delta NetworksLocal Deploymentbenchmark
0 likes · 10 min read
