Tagged articles
1 articles
Page 1 of 1
Tencent Cloud Developer
Tencent Cloud Developer
Mar 19, 2025 · Artificial Intelligence

Inside Tencent Hunyuan Turbo S: Speed, Cost, and Hybrid Mamba Transformer Explained

Tencent's new Hunyuan Turbo S model combines a 44% faster response time, dramatically lower token costs, and a hybrid Mamba‑Transformer architecture that merges linear attention with full attention, offering insights into fast‑thinking versus slow‑thinking LLM designs, MoE scaling laws, low‑precision training effects, and long‑short chain fusion techniques.

AIArchitectureHybridMambaLLM
0 likes · 14 min read
Inside Tencent Hunyuan Turbo S: Speed, Cost, and Hybrid Mamba Transformer Explained