Tag

Gradient Normalization

0 views collected around this technical thread.

DaTaobao Tech
DaTaobao Tech
Mar 29, 2022 · Artificial Intelligence

Dynamic Weight Averaging and Gradient Normalization for Multi‑Task Recommendation Models

To improve multi‑task recommendation in the “每平每屋” system, the team augments an MMoE ranking model with dynamic weight averaging, dynamic task prioritization, and GradNorm gradient normalization, stabilizing loss convergence across CTR, CVR, and fav tasks and delivering 3–4% online metric gains.

A/B testingDynamic Weight AveragingGradient Normalization
0 likes · 10 min read
Dynamic Weight Averaging and Gradient Normalization for Multi‑Task Recommendation Models