Fundamentals 19 min read

Building and Applying an Experiment Platform: A/B Testing, System Integration, and Analysis

This article shares practical experiences and insights on building an experiment platform, covering A/B testing applications, platform selection, broader business use cases, rapid system integration, and robust experiment analysis techniques such as pre‑AA checks, SRM, Welch’s T and Z tests.

DataFunTalk

Mar 26, 2023

Building and Applying an Experiment Platform: A/B Testing, System Integration, and Analysis

The article introduces the construction and application of an experiment platform, emphasizing A/B testing as a gold‑standard method for hypothesis validation and describing how the number of online experiments serves as a hidden metric of a company's scale.

Two main approaches for platform selection are discussed: purchasing third‑party solutions (e.g., Volcano Engine, Tencent, VWO, Google Optimize, Optimizely) for smaller user bases, and building a self‑hosted platform for large‑scale companies (e.g., Didi, Meituan, Alibaba, NetEase, Microsoft, Google) that require customized integration.

Beyond basic A/B tests, the article explores broader business applications, including product, backend, algorithm, and marketing experiments, and presents various experiment types such as flow‑based, joint, and cross‑business experiments, highlighting the importance of traffic, intervention, and analysis.

For faster system integration, the discussion covers experiment configuration distribution, flow execution services, parameter integration (variant‑based, key‑value, and configuration‑based controls), parameter priority, and experiment logging (flow logs for servers and trigger logs for clients), illustrating how logs support rapid analyst insights.

The final section focuses on rigorous experiment analysis, describing pre‑AA checks, SRM validation, and statistical tests (Welch’s T for mean‑type metrics, Z test for conversion rates) along with metric management practices, including key, guard, and OEC metrics, and three stages of metric governance: business‑driven, task‑driven, and event‑driven.

A Q&A segment addresses recommended literature, ensuring uniform traffic distribution in client‑side experiments, differences between strong and weak traffic perception, appropriate statistical methods for total‑value metrics, handling metric volatility, and strategies for evaluating numerous metrics simultaneously.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

data analysis A/B testing Experiment Platform System Integration metric management

Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.