Tagged articles
1 articles
Page 1 of 1
Data Party THU
Data Party THU
Sep 20, 2025 · Artificial Intelligence

How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model

The article reports that DeepSeek’s R1 large language model, detailed in a peer‑reviewed Nature paper, was built with roughly $300 k in total cost—about $29.4 k for training—using Nvidia H800 chips and novel pure reinforcement‑learning techniques, achieving competitive performance while remaining open‑source.

DeepSeekNvidia H800Peer Review
0 likes · 9 min read
How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model