Tagged articles
1 articles
Page 1 of 1
JD Cloud Developers
JD Cloud Developers
Mar 14, 2024 · Artificial Intelligence

How JD Retail Boosted Online Recommendation Inference with Distributed Heterogeneous Computing

This article details JD Retail's ad‑tech team's deep‑compute optimizations—including a distributed graph‑based heterogeneous framework, GPU‑focused inference engine enhancements, TensorBatch request aggregation, deep‑learning compiler bucket pre‑compilation, asynchronous compilation, and multi‑stream GPU processing—to overcome high‑concurrency, low‑latency online recommendation challenges.

Deep Learning CompilerGPU inferencedistributed computing
0 likes · 14 min read
How JD Retail Boosted Online Recommendation Inference with Distributed Heterogeneous Computing