Why FM and FFM Still Dominate Large‑Scale Sparse CTR Prediction
This article explains the principles of Factorization Machines (FM) and Field‑aware Factorization Machines (FFM), their implementation details, and how Meituan‑Dianping applied FFM in a DSP platform to achieve superior CTR and CVR estimation for sparse, high‑dimensional advertising data.
Background
In computational advertising, click‑through rate (CTR) and conversion rate (CVR) are critical metrics; accurate prediction directly boosts ad revenue. Common approaches include Logistic Regression, GBDT + LR, Factorization Machine (FM), and Field‑aware Factorization Machine (FFM). FM and FFM have repeatedly won major CTR competitions such as Criteo and Avazu, demonstrating strong performance on large, sparse datasets.
FM Principle
FM models pairwise feature interactions by representing each feature with a low‑dimensional latent vector. The interaction score is obtained via the inner product of two vectors, allowing the model to capture hidden relationships even when data is extremely sparse, while keeping computational cost linear in the number of non‑zero features.
FFM Improvements
FFM extends FM by assigning a separate latent vector for each feature‑field pair, where a "field" groups related features (e.g., user, item, context). This field‑aware representation enables more precise modeling of interactions across different feature groups, leading to higher predictive accuracy in CTR/CVR tasks.
Implementation Details
Practical deployment involves several steps:
Data preprocessing and feature hashing to manage billions of sparse features.
Generating field‑aware latent vectors, often using open‑source libraries such as libffm or custom implementations.
Training with stochastic gradient descent (SGD) or adaptive variants, applying L2 regularization and learning‑rate schedules.
Hyper‑parameter tuning (latent dimension, learning rate, number of epochs) based on validation AUC.
The Meituan‑Dianping team customized the pipeline for their Demand‑Side Platform (DSP), integrating feature pipelines, model training, and online inference within their ad‑bidding system.
Application in DSP
FFM was deployed to predict both CTR and CVR for real‑time bidding decisions. The model’s superior ranking ability resulted in higher predicted click probabilities, improved bid pricing, and ultimately increased revenue. Empirical results reported a noticeable lift in AUC and revenue per mille (RPM) compared with baseline LR/GBDT solutions.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
Meituan Technology Team
Over 10,000 engineers powering China’s leading lifestyle services e‑commerce platform. Supporting hundreds of millions of consumers, millions of merchants across 2,000+ industries. This is the public channel for the tech teams behind Meituan, Dianping, Meituan Waimai, Meituan Select, and related services.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
