Artificial Intelligence 7 min read

How Kuaishou Boosted Ad Performance with Multimodal LLMs and the COPE Framework

This article reviews Kuaishou’s two‑year exploration of large‑model techniques in advertising, detailing the content‑domain estimation challenges, how multimodal and LLM approaches improve full‑domain behavior utilization and external knowledge integration, and introducing the COPE product‑content representation framework and the LEARN LLM knowledge‑transfer system.

DataFunSummit

Oct 8, 2025

How Kuaishou Boosted Ad Performance with Multimodal LLMs and the COPE Framework

Introduction

This article outlines Kuaishou’s exploratory work over the past two years that applies large‑model technology to advertising scenarios, describing the data and model design motivations behind the efforts.

1. Content‑domain ad estimation challenges and large‑model opportunities

Kuaishou’s platform combines content and e‑commerce, offering various media forms such as images, short videos, live streams, product detail pages, and landing pages. User behavior is scattered across these heterogeneous contexts, making single‑scene data sparse. Existing recommendation systems rely on ID‑centric pipelines (Video ID, Item ID, Live ID, etc.), which hinder cross‑domain data sharing and complicate interest modeling across different content lifecycles.

Advertising data is especially sparse, but integrating full‑domain behavior—both natural‑flow and search interactions—can improve user interest modeling. Additionally, recommendation models often fall into “information cocoons” by repeatedly training on historical user actions; injecting external knowledge via large language models (LLMs) can help break this loop.

2. Full‑domain behavior utilization: the COPE unified product‑content representation framework

To leverage full‑domain behavior, Kuaishou first built a product‑ID system (SPU ID) that aggregates semantically similar items across scenes, providing a more stable identifier than raw Item IDs. However, SPU IDs still lack semantic richness. The COPE framework compresses multimodal product content from live streams, detail pages, short videos, etc., into a robust, compact feature representation. This unified representation enhances item‑side features, expands user behavior sequences, and reduces dependence on scene‑specific interaction data.

3. External knowledge utilization: the LEARN LLM knowledge‑transfer framework

To address the information‑cocoon problem, Kuaishou employs the open‑pretrained capabilities of large language models. By transferring world knowledge and strong reasoning abilities from LLMs, the LEARN framework injects external knowledge into the advertising recommendation pipeline, helping break the feedback loop and improving model generalization.

Both COPE and LEARN have delivered measurable business gains in Kuaishou’s ad system.

Article excerpted from the e‑book “A Plain‑spoken Large‑Model Handbook”.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Advertising recommendation LLM large models Multimodal Kuaishou

Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.