JD Cloud Developers
Jun 24, 2025 · Artificial Intelligence
How JD Retail’s xLLM Architecture Revolutionizes AI Inference for E‑Commerce
At GAITC2025, JD Retail’s AI Infra lead Zhang Ke detailed the challenges of e‑commerce AI inference and introduced the xLLM edge‑cloud unified large‑model architecture, highlighting adaptive scheduling, offline unified scheduling, multi‑layer pipelines, and agent collaboration that boost performance, cut costs, and pave the way for future AI advancements.
AI inferenceModel Optimizatione-commerce
0 likes · 6 min read
