Graph-Based I2I Recall for Short Video Recommendation at Kuaishou
This article presents Kuaishou's graph‑based item‑to‑item (I2I) recall pipeline for short‑video recommendation, detailing the business challenges, pipeline architecture, optimization techniques such as similarity‑measure tricks, graph structure learning, edge‑weight learning, and future research directions.
Short‑video recommendation faces challenges such as high noise, diverse targets, and cold‑start problems; Kuaishou's platform must efficiently retrieve relevant items from massive pools while ensuring precision and diversity.
The recall stage is the first step of the recommendation system, requiring fast, accurate, and diverse candidate selection. Traditional one‑hop methods suffer from data sparsity and bias, prompting the use of high‑order relationships captured by graph structures.
Kuaishou implements a graph‑based I2I recall pipeline built on internal platforms (IDP, KML) and open‑source tools (DGL, FAISS). The pipeline processes raw data, generates item pairs, constructs a heterogeneous graph, samples neighbors and negative examples, trains embeddings, and iterates with user feedback.
Optimization techniques include:
Similarity‑measure improvement by weighting the numerator of the Adamic‑Adar score with the smaller of two edge capacities, yielding higher offline hit‑rate.
Graph‑structure learning in collaboration with Tsinghua's Cui‑Feng lab, using confidence networks and set‑level resampling to denoise data, followed by a VAE‑based encoder‑decoder that reconstructs purified user‑item sets, improving robustness to injected noise.
Edge‑weight learning that predicts edge confidence, enabling weighted sampling during embedding training, which accelerates convergence and enhances model robustness.
Online observations show that graph recall slightly reduces precision compared with pure target‑oriented I2I but significantly improves diversity, especially in commercial ad scenarios where it surfaces varied product styles.
Future work aims to model multi‑interest users via embedding distributions, handle constrained recall pools for ads, incorporate contrastive and unsupervised learning for denoising, and improve interpretability and dynamic graph updates.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
DataFunSummit
Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
