Operations 9 min read

Interview with Lu Pengcheng on Mogu Street’s Monitoring System Architecture and Evolution

In this interview, Lu Pengcheng, a platform architect at Mogu Street, discusses the company’s large‑scale e‑commerce architecture, the evolution of its monitoring platform, design choices for high‑availability distributed systems, and future open‑source plans, providing practical insights for engineers and technical managers.

Architecture Digest
Architecture Digest
Architecture Digest
Interview with Lu Pengcheng on Mogu Street’s Monitoring System Architecture and Evolution

At the SDCC 2016 China Software Developers Conference in Beijing, CSDN interviewed Lu Pengcheng, a platform architect at Mogu Street, about his background, the company’s business, and the technical direction of its systems.

Lu introduced himself as a technology enthusiast focusing on monitoring, logging, and storage architectures, with experience across many domains such as distributed systems, smart operations, and both client‑side and server‑side development.

The core web stack of Mogu Street uses mainstream components like LVS, Nginx, and Tomcat for network services, while the main application layer combines PHP, SOA, and Java, supplemented by monitoring, logging, middleware, and DevOps platforms. Open‑source technologies such as HBase, Spark, Storm, Kafka, Redis, and ScyllaDB are heavily employed.

During the company’s transition from a social platform to a large‑scale e‑commerce site, the architecture evolved through several phases: moving from a monolithic PHP codebase to a service‑oriented architecture (SOA) built on the internally developed Tesla framework, shifting monitoring SDKs from PHP to Java, and rebuilding the monitoring system to reduce data overhead and improve performance.

Lu then described the design of Mogu Street’s monitoring platform, highlighting four key principles:

Decentralization – using a gossip‑based SWIM protocol for configuration sync and cluster management.

Role separation – isolating heavy query functions into peripheral systems for better scalability during traffic spikes.

No external dependencies – keeping the core system independent of third‑party services to ensure stability.

Scenario‑based architecture – simplifying configuration by mapping clear usage scenarios for data collection and visualization.

When building a high‑availability monitoring system, Lu emphasized several critical decisions: data type and granularity, sampling and aggregation methods, storage strategies (full vs. rolling, local vs. centralized), and alerting mechanisms (polling vs. real‑time, false‑positive tolerance, intelligent alerts).

The monitoring platform was implemented in C++ to leverage the team’s expertise and to meet the CPU‑intensive requirements of query‑time calculations. Lu argued that embracing C++’s complexity—using modern features like RAII, smart pointers, and lambdas—leads to more stable and efficient code.

Regarding team management, Lu’s monitoring team consists of six engineers who collaborate informally, focusing on mutual discussion, code reviews, and shared passion, which he believes drives productivity more than formal management.

Mogu Street has also contributed open‑source projects such as the IM system TeamTalk and maintains a suite of internally developed technologies (SOA framework, message queues, sharding, monitoring, logging, full‑link tracing, storage). While most are not yet open‑sourced due to business constraints, Lu hopes to release more components in the future.

Finally, Lu announced his upcoming talk at SDCC 2016 Beijing titled “Mogu Street Monitoring System Architecture and Evolution,” inviting attendees to learn more about the platform’s design and roadmap.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Distributed SystemsmonitoringOperationshigh availabilityC++
Architecture Digest
Written by

Architecture Digest

Focusing on Java backend development, covering application architecture from top-tier internet companies (high availability, high performance, high stability), big data, machine learning, Java architecture, and other popular fields.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.