Applying Causal Inference to Debias Recommendation Systems at Kuaishou
This talk explores how causal inference techniques are used to identify and mitigate various biases in Kuaishou's recommendation pipeline, covering background theory, recent research advances, practical implementations for popularity and video completion debiasing, and reflections on challenges and future directions.
The presentation begins with an overview of bias in recommendation systems, explaining how user interaction logs introduce selection and popularity biases that affect model training and user experience.
It then introduces causal inference as a framework to distinguish true user interest from confounding factors, describing three levels of causal reasoning: correlation exploration, intervention effect analysis, and counterfactual reasoning.
Recent research examples are discussed, including the DICE method (causal embedding with interest and conformity components), Huawei's bias‑unbias embedding approach, KDD21's total‑effect vs. direct‑effect decomposition, and SIGIR21's back‑door adjustment for popularity bias.
The speaker details Kuaishou's own applications: (1) popularity debiasing using back‑door operators and a modified conditional probability formulation; (2) causal representation decoupling that learns separate interest and conformity embeddings via multi‑task training; (3) video completion‑rate debiasing that defines length‑aware thresholds and applies IPW weighting.
Experimental results show increased exposure for medium‑tail items, improved completion rates across video lengths, and better separation of embeddings, confirming the effectiveness of the causal methods.
In the concluding section, the speaker reflects on the limited adoption of causal inference in recommendation, the difficulty of defining treatments, the need to balance debiasing strength, and the challenge of addressing filter bubbles, followed by a Q&A addressing embedding validation, potential negative impacts of debiasing, and strategies for long‑tail items.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
DataFunSummit
Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
