Baidu Geek Talk
Oct 31, 2022 · Artificial Intelligence
PaddleBox: A GPU‑Based Ultra‑Large‑Scale Sparse DNN Training Framework
PaddleBox is Baidu’s GPU‑based ultra‑large‑scale sparse DNN training framework that combines a three‑tier hierarchical parameter server (SSD, DRAM, HBM) with pipelined scheduling and multi‑machine multi‑GPU communication, delivering 5–40× cost‑performance gains over traditional CPU solutions and powering Baidu’s advertising services.
GPULarge-Scale ModelsPaddleBox
0 likes · 15 min read