JD Cloud Developers
JD Cloud Developers
Jun 24, 2025 · Artificial Intelligence

How JD Retail’s xLLM Architecture Revolutionizes AI Inference for E‑Commerce

At GAITC2025, JD Retail’s AI Infra lead Zhang Ke detailed the challenges of e‑commerce AI inference and introduced the xLLM edge‑cloud unified large‑model architecture, highlighting adaptive scheduling, offline unified scheduling, multi‑layer pipelines, and agent collaboration that boost performance, cut costs, and pave the way for future AI advancements.

AI inferenceModel Optimizatione-commerce
0 likes · 6 min read
How JD Retail’s xLLM Architecture Revolutionizes AI Inference for E‑Commerce
JD Retail Technology
JD Retail Technology
Jun 20, 2025 · Artificial Intelligence

How JD Retail’s xLLM Architecture Revolutionizes AI Inference for E‑Commerce

The article details JD Retail’s collaboration with Tsinghua University to build the xLLM edge‑cloud unified large‑model inference framework, addressing e‑commerce AI challenges such as diverse inputs, task scheduling, model compression, and cost, while outlining future research directions and performance gains.

AI inferenceModel Optimizatione-commerce
0 likes · 7 min read
How JD Retail’s xLLM Architecture Revolutionizes AI Inference for E‑Commerce