AI Engineering
Jun 30, 2026 · Artificial Intelligence
Running DeepSeek V4 on M5 Max: 5 tps Speedup Without Large Memory
Developer Anemll demonstrates that the DS4 IQ2_Q2 version of DeepSeek V4 on an Apple M5 Max gains a 5‑tps throughput boost, using SSD‑streamed MoE sidecar loading to run large models without requiring high memory, and provides full build and execution instructions.
AI inferenceApple SiliconDS4
0 likes · 8 min read
