Data Party THU
Sep 20, 2025 · Artificial Intelligence
How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model
The article reports that DeepSeek’s R1 large language model, detailed in a peer‑reviewed Nature paper, was built with roughly $300 k in total cost—about $29.4 k for training—using Nvidia H800 chips and novel pure reinforcement‑learning techniques, achieving competitive performance while remaining open‑source.
DeepSeekNvidia H800Peer Review
0 likes · 9 min read
