Backend Development 15 min read

Design and Implementation of the AB Experiment Platform at Beike Zhaofang

The article details the design principles, layered traffic allocation model, architecture, data processing pipeline, and operational experience of the AB experiment platform used at Beike Zhaofang, highlighting its web, API, and storage layers, gray‑release capabilities, current limitations, and future improvements.

Beike Product & Technology

Jan 10, 2019

Design and Implementation of the AB Experiment Platform at Beike Zhaofang

The document introduces the AB experiment platform (Athena) built for Beike Zhaofang to enable fine‑grained product optimization through controlled A/B tests, ensuring each feature iteration is validated with real user conversion data.

It explains the basic A/B testing concept: two variants are shown to randomly split user traffic, and performance metrics such as PV/UV or conversion rates are compared to decide the winning version.

The platform adopts a layered experiment model inspired by Google’s overlapping experiment framework, distinguishing orthogonal experiments (non‑interfering) from mutually exclusive experiments, and using layers, domains, and buckets to manage traffic isolation and sharing.

Traffic bucketing is described: each experiment’s control and treatment groups form buckets within a layer, and a hash of user ID and layer ID determines the bucket, guaranteeing stable and random assignment.

The architecture consists of four main layers: web (UI for configuring experiments and audiences), API (client and server APIs for experiment decisions, using a uniqId passed via load balancer and stored in Redis), data (real‑time Kafka streams to HBase and offline Hive pipelines feeding ClickHouse for metric aggregation), and storage (MySQL for configs, HBase for real‑time logs, Redis for fast lookups, ClickHouse for analytical queries).

Audience targeting supports dimension‑based AB audiences (device, city, URL) and DMP‑derived user‑profile audiences, with white‑list support for testing specific users.

Gray‑release functionality allows dynamic traffic reallocation based on experiment performance, with a Java SDK that periodically fetches configuration, hashes user IDs to bucket assignments, and supports custom parameters per experiment group.

The platform currently serves over 40 concurrent experiments across multiple departments, providing statistical insights for product and algorithm decisions, though it lacks confidence intervals, full support for mutually exclusive layered experiments, and has high coupling in gray‑release implementation.

Future work aims to address these shortcomings, integrate AB testing more tightly into the development workflow, and gather user feedback for continuous improvement.

Map<String, Traffic> traffics = new HashMap<>(); // key: experiment name (e.g., "exp1")
// value: Traffic object containing group‑to‑bucket mappings
Map<String, Set<Integer>> traffic = new HashMap<>(); // key: group name (e.g., "group1"), value: set of bucket IDs

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

AB testing backend-architecture Experiment Platform traffic allocation

Written by

Beike Product & Technology

As Beike's official product and technology account, we are committed to building a platform for sharing Beike's product and technology insights, targeting internet/O2O developers and product professionals. We share high-quality original articles, tech salon events, and recruitment information weekly. Welcome to follow us.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.