DataFunTalk
Jul 5, 2025 · Artificial Intelligence
DeepSeek R1T2 Chimera: Faster, High‑Performance LLM with Assembly of Experts
The DeepSeek R1T2 Chimera model, an open‑source LLM built with Assembly of Experts technology, delivers up to 200% faster inference than R1‑0528, surpasses R1 on GPQA‑Diamond and AIME‑24 benchmarks, and offers a 671‑billion‑parameter MoE architecture, though it lacks function‑calling support and trails the highest‑end R1‑0528 on the toughest tests.
AIAssembly of ExpertsDeepSeek
0 likes · 5 min read
