Artificial Intelligence 15 min read

Decoding Xiaohongshu’s Recommendation System: How Ordinary Users Gain Visibility

Xiaohongshu’s recommendation system uses large‑scale multimodal embeddings, dual‑tower and graph models, and diversity techniques like DPP and SSD to quickly surface high‑quality user‑generated content, enabling ordinary users to gain visibility while balancing personalization, exploration, and efficient LLM‑augmented pipelines.

Xiaohongshu Tech REDtech

Jan 20, 2024

Decoding Xiaohongshu’s Recommendation System: How Ordinary Users Gain Visibility

The article explores the recommendation system of Xiaohongshu, a popular Chinese lifestyle platform, focusing on why ordinary users can easily become visible and how the platform balances user experience, community interaction, and content diversity.

It describes the platform’s shift from a simple “grass‑planting” tool to a comprehensive “life encyclopedia,” emphasizing a decentralized distribution strategy that prioritizes high‑quality user‑generated content (UGC) and multi‑modal data (text, images, video, live streams).

Key technical components include a large‑scale multimodal pre‑training model (similar to CLIP) that converts notes into vector representations, enabling fine‑grained content clustering and dynamic tag generation. Various backbone architectures (BERT, RoBERTa, ResNet, Swin‑T, ViT) are employed to support downstream tasks.

The cold‑start pipeline consists of four steps: (1) content information extraction using NLP, CV, and multimodal fusion; (2) seed‑user selection via dual‑tower models and graph neural networks, with Bayesian optimization for boost coefficients; (3) look‑alike user expansion based on behavior feedback, improving click‑through rate by ~7%; and (4) continuous model updates with minute‑level iteration for recall, coarse‑ranking, and fine‑ranking stages.

To ensure recommendation diversity, Xiaohongshu adopts exploitation‑exploration strategies, leveraging deterministic point processes (DPP) and multi‑granular sampling (MGS) to disperse similar items and capture long‑tail interests. The Sliding Spectrum Decomposition (SSD) model transforms per‑item scoring into a session‑level optimization, while the CB2CF method replaces traditional collaborative filtering with content‑based similarity for better long‑tail handling.

The article also discusses the integration of large language models (LLMs) into the recommendation pipeline, noting challenges such as latency, resource consumption, and explainability. Xiaohongshu prefers augmenting existing systems with LLMs (e.g., as feature encoders or control modules) rather than replacing the entire stack.

Overall, the piece provides a comprehensive view of how advanced AI techniques—multimodal embeddings, cold‑start pipelines, diversity models, and LLM augmentation—enable Xiaohongshu to deliver personalized, equitable content discovery for ordinary users.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Multimodal AI recommendation system Large Language Models cold-start content diversity Xiaohongshu

Written by

Xiaohongshu Tech REDtech

Official account of the Xiaohongshu tech team, sharing tech innovations and problem insights, advancing together.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.