Meituan Technology Team
Nov 15, 2018 · Artificial Intelligence
Reinforcement Learning for Meituan's "Guess You Like" Recommendation Ranking
Meituan enhanced its homepage “Guess You Like” recommendation slot by modeling user‑item interactions as a Markov Decision Process and applying an improved DDPG reinforcement‑learning agent that adjusts the ranking trade‑off parameter, uses advantage‑based Q decomposition, shares actor‑critic weights, and runs in a real‑time TensorFlow pipeline, delivering consistent lifts in click‑through, dwell time, and depth.
DDPGMDP ModelingOnline Learning
0 likes · 21 min read
