Artificial Intelligence 8 min read

STAN: A User‑Lifecycle‑Based Multi‑Task Recommendation Model for Shopee

The article introduces STAN, a multi‑task recommendation framework that leverages user lifecycle segmentation to jointly optimize CTR, stay‑time, and CVR, detailing the business context, key challenges, solution architecture, offline and online evaluations, and future research directions.

DataFunSummit
DataFunSummit
DataFunSummit
STAN: A User‑Lifecycle‑Based Multi‑Task Recommendation Model for Shopee

Introduction Since the success of MMoE and PLE models, Shopee proposes a new multi‑task recommendation approach—STAN—that refines user segmentation based on lifecycle stages to improve CTR, CVR, and stay‑time.

Business Background Shopee’s feed consists of a dual‑column layout where users progress through stages: new users with low orders and time, wandering users with increasing dwell time but low conversion, and loyal users with high CVR but shorter sessions. Different user groups exhibit distinct preferences for recommendation tasks.

Key Problems Existing multi‑task methods treat all users uniformly, causing a trade‑off (optimization seesaw). The core issues are: (1) how to identify user state, (2) how to accurately track user state over time, and (3) how to incorporate state information into multi‑task models.

Solution – STAN STAN augments the traditional PLE/MMoE architecture with a left‑hand side that models user information. An attention‑based user feature extractor generates state‑aware embeddings, which are used to adjust the loss. Three sub‑components address the key problems:

Identify user state via attention‑driven user representations derived from interaction features.

Track user state by estimating per‑task scores and applying a user‑adaptive Beta‑distribution resampling to mitigate uncertainty for sparse data.

Combine state with multi‑task learning by adding a state‑aware loss term to the existing multi‑task loss, training jointly.

Offline Evaluation Understanding experiments show STAN’s embeddings separate Wander, Stick, and Loyal users better than PLE. On Shopee’s internal dataset, STAN improves AUC and NDCG@1 across tasks, especially after adding adaptive stages and Beta resampling. Similar gains are observed on a public WeChat video dataset (AUC, NDCG@5).

Online Results Deploying STAN in production (baseline PLE) yields CTR + 3.94 %, stay‑time + 3.05 %, and order + 0.88 % improvements, confirming the model’s effectiveness in real‑world traffic.

Conclusion & Outlook Modeling user lifecycle is crucial for multi‑task recommendation. Future work will integrate finer‑grained lifecycle adjustments at each candidate generation stage and continue to innovate methods that translate into business impact.

ctrdeep learningCVRmulti-task learningRecommendation systemsuser lifecycle
DataFunSummit
Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.