Product Management 9 min read

Why Simple Click Counts Fail: Smarter Scoring Strategies for Content Recommendation

The article recounts a junior engineer's journey improving a news app's recommendation system, moving from naive click counts to recent clicks, CTR, lower confidence bounds, and advanced multi‑armed bandit techniques like UCB and Thompson Sampling to balance relevance and novelty.

21CTO

Jul 1, 2017

Why Simple Click Counts Fail: Smarter Scoring Strategies for Content Recommendation

WA #1: Score = #Clicks

The newcomer initially thought recommending the most‑clicked articles would satisfy users, but this quickly showed that old popular content dominated the feed.

WA #2: Score = #Recent Clicks

Switching to the most‑clicked items from the past month improved freshness, yet the feed still mirrored competitors and lacked surprise.

Interlude 1: Indirect Feedback

Because explicit ratings are rare, the team turned to implicit signals—clicks, watch time, purchases—to infer user preferences, acknowledging the cold‑start problem for new items.

WA #3: Score = CTR

Click‑through rate (CTR = clicks / impressions) was adopted as a quality metric, but clickbait titles and extreme CTR volatility for new content revealed its limitations.

WA #4: Score = LCB(CTR)

Inspired by statistical peers, the lower confidence bound (LCB) of CTR (via Wilson Score) provided a conservative estimate, boosting exposure for reliably high‑CTR items while still allowing some new content.

Interlude 2: Multi‑Armed Bandits

The classic multi‑armed bandit problem—choosing among several “arms” (content pieces) to maximize reward (clicks)—was introduced as a framework for balancing exploration and exploitation.

Suggestion #1: Score = Random + LCB

A small ε‑fraction of users are shown random items to discover potential hits, while the majority see top‑scoring content.

Suggestion #2: Score = UCB(CTR)

Upper confidence bound (UCB) complements LCB by favoring items with high optimistic estimates, quickly surfacing new content but risking over‑exposure if the service fails.

Suggestion #3: Thompson Sampling

Thompson Sampling draws a CTR value from the Beta(click+1, impression‑click+1) posterior for each item, ranking by the sampled value; this Bayesian approach offers a simple yet effective solution to the bandit problem.

Conclusion

By moving from raw click counts to probabilistic scoring methods—LCB, UCB, and Thompson Sampling—the recommendation system achieves a better trade‑off between popular and novel content, reducing reliance on clickbait and improving user experience.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

CTR content recommendation UCB Thompson Sampling LCB multi-armed bandits

Written by

21CTO

21CTO (21CTO.com) offers developers community, training, and services, making it your go‑to learning and service platform.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.