Backend Development 9 min read

Design and Evolution of Meituan's Real-Time Logistics Distributed System

This article details Meituan's instant logistics platform architecture, covering its background, distributed system design, high‑availability deployment, AI‑driven optimization, and future challenges, while sharing practical solutions for scalability, fault tolerance, and operational efficiency in a high‑concurrency environment.

Top Architect

Jul 15, 2022

Design and Evolution of Meituan's Real-Time Logistics Distributed System

Background

Meituan's food delivery has grown for five years and its instant logistics for over three years, accumulating experience in building high‑concurrency distributed systems. The main takeaways are low latency tolerance and the need for scalable, fault‑tolerant architecture.

Meituan Instant Logistics Architecture

The platform focuses on three core tasks: providing SLA guarantees such as ETA calculation and pricing, matching riders under cost‑efficiency‑experience trade‑offs, and offering rider‑side decision support (voice interaction, route recommendation, store‑arrival reminders). It leverages Meituan's common components, HLB load balancer, OCTO service registry, Kafka/RabbitMQ messaging, Zebra distributed database, CAT monitoring, Squirrel+Cellar cache, and Crane scheduler.

Distributed System Practices

Key challenges include massive order‑rider scale, traffic spikes during holidays, ultra‑low fault tolerance, and strict real‑time data requirements. Solutions involve converting stateful nodes to stateless, using Databus for DB‑cache consistency, extensive fault‑injection testing, periodic health checks, rapid rollback, and mechanisms such as rate limiting, circuit breaking, and degradation.

High‑Availability Deployment & Disaster Recovery

Describes single‑IDC fault detection with automatic traffic switching, rapid IDC scaling with pre‑synchronised data and ready‑state services, multi‑IDC virtual centres for capacity expansion, and unit‑level isolation to improve scaling and fault isolation.

Intelligent Logistics Core Technologies

Introduces a machine‑learning platform for end‑to‑end model training and deployment, and the JARVIS AIOps platform for alarm de‑duplication, automated incident analysis, and improving system stability.

Future Challenges

Future issues include increasing service complexity that inflates micro‑service size, network amplification effects from minor latency, fast fault localisation in complex topologies, and the operational shift from cluster‑level to unit‑level maintenance.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

distributed systems Microservices scalability High Availability Meituan real-time logistics

Written by

Top Architect

Top Architect focuses on sharing practical architecture knowledge, covering enterprise, system, website, large‑scale distributed, and high‑availability architectures, plus architecture adjustments using internet technologies. We welcome idea‑driven, sharing‑oriented architects to exchange and learn together.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.