Tagged articles

Semi-Autoregressive Generation

2 articles · Page 1 of 1
PaperAgent
PaperAgent
Jun 29, 2026 · Artificial Intelligence

DeepSeek Opens DSpark: A New Speculative Decoding Framework for Large Language Models

DeepSeek releases DSpark, an open‑source speculative decoding system that combines semi‑autoregressive generation with confidence‑scheduled verification, delivering 60‑85% per‑user speed gains, lower latency, and superior acceptance rates compared with Eagle3 and DFlash across multiple LLM benchmarks.

Confidence SchedulingLLM InferenceSemi-Autoregressive Generation
0 likes · 14 min read
DeepSeek Opens DSpark: A New Speculative Decoding Framework for Large Language Models
Machine Heart
Machine Heart
Jun 27, 2026 · Artificial Intelligence

DSpark in DeepSeek V4 Cuts LLM Inference Latency by Up to 85%

DeepSeek V4’s DSpark adds a speculative decoding framework that combines a lightweight draft model, semi‑autoregressive generation, and confidence‑scheduled verification, delivering 60‑85% faster inference for Qwen3 and Gemma models while providing an open‑source DeepSpec toolkit for training and evaluation.

Confidence-Scheduled VerificationDSparkDeepSeek
0 likes · 7 min read
DSpark in DeepSeek V4 Cuts LLM Inference Latency by Up to 85%