Artificial Intelligence 19 min read

JD Advertising R&D: AI‑Driven Solutions for Traffic Valuation, Multimodal Understanding, Auction Mechanisms, Generative Recommendation, and Large‑Model Engineering

The JD Advertising R&D team applies cutting‑edge AI techniques—including query intent models, multimodal representation pipelines, reinforcement‑learning‑based auction mechanisms, generative recommendation with quantized product tokens, and large‑model infrastructure—to boost traffic valuation, ad relevance, revenue, and creative generation across the platform.

JD Tech Talk

Sep 23, 2024

JD Advertising R&D: AI‑Driven Solutions for Traffic Valuation, Multimodal Understanding, Auction Mechanisms, Generative Recommendation, and Large‑Model Engineering

JD Advertising R&D focuses on applying cutting‑edge AI algorithms to improve traffic valuation, multimodal content understanding, auction mechanisms, generative recommendation, and large‑model infrastructure.

Traffic value estimation – describes query intent recognition, challenges of multi‑intent queries, long‑tail categories, and a generative‑discriminative model that augments training data for under‑represented categories.

Multimodal representation – outlines a dual‑stream pipeline that extracts text embeddings (BGE‑large‑zh1.5) and image embeddings (ViT‑CLIP‑base), aligns them via CLIP‑based projection, and aggregates them in a recommendation space using a Gate‑GNN.

Auction mechanism (ListVCG) – introduces a reinforcement‑learning‑based sequence auction that approximates the combinatorial 700‑choose‑4 problem, integrates multi‑objective optimization, and uses reward shaping and curriculum RL to improve long‑term revenue.

Multi‑agent RL for bidding – explains joint optimization of bidding and mechanism agents, offline simulation stages, and challenges of modeling platform revenue and advertiser behavior.

Generative recommendation – presents a quantized product representation (RQ‑VAE) and a series of pre‑training, fine‑tuning, and DPO steps that predict the next product a user will browse. Example prompt and code snippets are shown below:

提示词: 请告诉我,商品的四元组表示为{input_turple}的标题是什么？
输入: <a_1><b_2><c_3><d_4>
输出: 华为（HUAWEI）旗舰手机mate60 pro+ 16G+512GB 宣白

Creative generation – describes the RFNet multimodal reliability feedback network for automatic quality assessment of AI‑generated ad images, and its training on a million‑scale labeled dataset (RF1M), published at ECCV 2024.

Large‑model engineering – discusses inference latency constraints (≈100 ms), cost reduction for 1.5 B‑parameter models, hardware‑software co‑design, distributed training, and deployment pipelines to support million‑QPS advertising services.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Advertising AI large models Multimodal reinforcement learning Graph Neural Networks

Written by

JD Tech Talk

Official JD Tech public account delivering best practices and technology innovation.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.