Design and Evolution of a Scalable Recommendation System Architecture (V1.0‑V3.0)

This article describes the progressive redesign of an e‑commerce recommendation platform—from a simple strategy‑factory V1.0 through a vertically split V2.0 to a fully configurable, pipeline‑driven V3.0—highlighting architectural challenges, Redis clustering, dynamic configuration, recall and prediction services, and future directions for fine‑grained, explainable recommendations.

Architecture Digest
Architecture Digest
Architecture Digest
Design and Evolution of a Scalable Recommendation System Architecture (V1.0‑V3.0)

1. Introduction Recommendation has become the core competitive advantage for e‑commerce platforms, appearing on virtually every page (home, detail, cart, checkout, error, etc.). It improves user experience, mitigates long‑tail effects, and drives product value and profitability.

2. Architecture Evolution

V1.0 used a simple strategy‑plus‑factory design to enable rapid business iteration but suffered from poor isolation, resource contention, and scaling limits due to a single JVM serving all upstream services.

V2.0 introduced vertical business splitting and horizontal stage‑based decomposition, isolating applications and storage per business line, reducing fault impact, and improving resource utilization. A pipeline scheduler was added to modularize stages such as recall, filter, coarse‑ranking, merge, fine‑ranking, intervention, and shuffling.

V3.0 adds a configuration service (server & client) that dynamically manages the recommendation pipeline. Handlers (pipeline nodes) are configured with AB‑test and strategy attributes, allowing runtime adjustments without code changes. The system now separates recall and prediction services into independent micro‑services, improving scalability and performance.

3. Configuration Service Design

The configuration server exposes RPC interfaces for heartbeat and configuration queries. It centrally manages all recommendation scenarios, enabling online strategy changes. The client periodically polls the server, synchronizes configurations, and assembles an executable handler chain based on user context (device, location, etc.).

4. Recall Service A unified full‑product recall pool is built in Elasticsearch, replacing scattered Redis storage. Real‑time product updates (price, stock, status) are propagated via MQ to keep the recall index fresh, ensuring all recall paths benefit from the latest data.

5. Prediction Service Model prediction is exposed as a service supporting multiple models and versions. Configuration‑driven sorting strategies can be switched on‑the‑fly, enhancing system performance, scalability, and the ability to deploy more complex models.

6. Outlook Future work focuses on fine‑grained operation, explainable recommendations, and real‑time feature enrichment to achieve personalized, “one‑to‑one” recommendations while maintaining system stability and efficiency.

Architecture diagram V1.0
Architecture diagram V1.0
Architecture diagram V2.0
Architecture diagram V2.0
Configuration service flow
Configuration service flow
Server architecture
Server architecture
Client architecture
Client architecture
AB testing flow
AB testing flow
Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

recommendationAIScalabilityDynamic ConfigurationPipeline
Architecture Digest
Written by

Architecture Digest

Focusing on Java backend development, covering application architecture from top-tier internet companies (high availability, high performance, high stability), big data, machine learning, Java architecture, and other popular fields.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.