Phoenix News Feed Recommendation System: Architecture, Modeling, and Feature Engineering

This article presents a comprehensive overview of Phoenix News's AI‑driven feed recommendation system, detailing its business challenges, multi‑stage architecture, deep learning models, feature pipelines, metric trade‑offs, cold‑start solutions, and practical insights for improving user satisfaction and content quality.

DataFunTalk
DataFunTalk
DataFunTalk
Phoenix News Feed Recommendation System: Architecture, Modeling, and Feature Engineering

As a user‑oriented news product, Phoenix News continuously explores applying AI across the entire content production and consumption chain, aiming to maximize human value by guiding users rather than merely catering to them.

The presentation covers business characteristics, overall recommendation architecture, feature engineering, and extended applications, highlighting key challenges such as item scale, user scale, product form, content timeliness, user satisfaction metrics, content quality assessment, cold‑start problems, and the balance between manual rules and machine algorithms.

The system follows a classic recall‑and‑ranking framework: a massive item pool is filtered through recall, coarse ranking, and fine‑grained multi‑objective ranking, finally refined by re‑ranking and manual rules to produce a shortlist for users.

Model evolution moved from traditional LR/GBDT to FM, FFM, and ultimately deep learning models, with embedding layers for users and items, FM for low‑order feature interactions, DNN for high‑order patterns, and attention mechanisms for session modeling. Multi‑objective learning optimizes CTR, duration, likes, comments, and other business metrics, with plans to explore reinforcement learning for hyper‑parameter tuning.

Feature pipelines integrate offline logs, real‑time logs, and online request data, ensuring consistency between training and inference. Features are categorized into user profiles, content profiles, and request context, and are often crossed to capture nuanced signals such as short‑term interests, demographic preferences, and contextual behaviors.

Advanced techniques include using FFM for optimized recall, addressing item cold‑start via vector similarity, and mitigating the “Matthew effect” by improving diversity and surprise. Additional extensions cover title‑clickbait detection, thumbnail aesthetic cropping, and intelligent thumbnail selection using fine‑tuned aesthetic ranking models.

Metric balancing is crucial: exposure, click‑through rate, reading duration, completion ratio, interaction frequency, diversity, and retention are jointly considered. Sample re‑weighting based on bounce rate further refines model training, leading to measurable online improvements.

Q&A sections discuss practical aspects such as real‑time model updates, feature real‑time vs. model real‑time trade‑offs, handling video and image features, and advice for practitioners transitioning into recommendation engineering.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

feature engineeringAIDeep Learningrecommendation systemrankingnews feed
DataFunTalk
Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.