Backend Development 9 min read

Design and Evolution of Meituan's Real-Time Logistics Distributed System

This article details Meituan's instant logistics platform architecture, covering its background, distributed system design, high‑availability deployment, AI‑driven optimization, and future challenges, while sharing practical solutions for scalability, fault tolerance, and operational efficiency in a high‑concurrency environment.

Top Architect
Top Architect
Top Architect
Design and Evolution of Meituan's Real-Time Logistics Distributed System

Background

Meituan's food delivery has grown for five years and its instant logistics for over three years, accumulating experience in building high‑concurrency distributed systems. The main takeaways are low latency tolerance and the need for scalable, fault‑tolerant architecture.

Meituan Instant Logistics Architecture

The platform focuses on three core tasks: providing SLA guarantees such as ETA calculation and pricing, matching riders under cost‑efficiency‑experience trade‑offs, and offering rider‑side decision support (voice interaction, route recommendation, store‑arrival reminders). It leverages Meituan's common components, HLB load balancer, OCTO service registry, Kafka/RabbitMQ messaging, Zebra distributed database, CAT monitoring, Squirrel+Cellar cache, and Crane scheduler.

Distributed System Practices

Key challenges include massive order‑rider scale, traffic spikes during holidays, ultra‑low fault tolerance, and strict real‑time data requirements. Solutions involve converting stateful nodes to stateless, using Databus for DB‑cache consistency, extensive fault‑injection testing, periodic health checks, rapid rollback, and mechanisms such as rate limiting, circuit breaking, and degradation.

High‑Availability Deployment & Disaster Recovery

Describes single‑IDC fault detection with automatic traffic switching, rapid IDC scaling with pre‑synchronised data and ready‑state services, multi‑IDC virtual centres for capacity expansion, and unit‑level isolation to improve scaling and fault isolation.

Intelligent Logistics Core Technologies

Introduces a machine‑learning platform for end‑to‑end model training and deployment, and the JARVIS AIOps platform for alarm de‑duplication, automated incident analysis, and improving system stability.

Future Challenges

Future issues include increasing service complexity that inflates micro‑service size, network amplification effects from minor latency, fast fault localisation in complex topologies, and the operational shift from cluster‑level to unit‑level maintenance.

distributed systemsmicroservicesscalabilityHigh AvailabilityMeituanreal-time logistics
Top Architect
Written by

Top Architect

Top Architect focuses on sharing practical architecture knowledge, covering enterprise, system, website, large‑scale distributed, and high‑availability architectures, plus architecture adjustments using internet technologies. We welcome idea‑driven, sharing‑oriented architects to exchange and learn together.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.