JD Retail Technology
Jan 30, 2026 · Artificial Intelligence
How JD’s 9N‑LLM Engine Powers Scalable Generative Recommendation at Industrial Scale
The article details JD Retail’s 9N‑LLM unified training engine—supporting TensorFlow and PyTorch, GPU and NPU, and both traditional and generative recommendation scenarios—explaining its architecture, high‑throughput sample engine, distributed sparse embedding system, five‑stage pipeline, UniAttention accelerator, and reinforcement‑learning capabilities that together enable TB‑scale data, B‑scale dense parameters, and efficient RL training for real‑world recommendation services.
Distributed TrainingGPU/NPUUniAttention
0 likes · 26 min read
