DeepSeek’s DSpark Boosts AI Inference Speed Up to 400% with Speculative Decoding
DeepSeek’s open‑source DSpark applies speculative decoding to its V4 Flash and Pro models, delivering 51%‑400% inference throughput gains that vary by task, while also supporting other models such as Gemma and Qwen, positioning it as a versatile, cross‑model acceleration solution.
