Databases 21 min read

Meituan's Graph Database Selection and Platform Construction

Meituan evaluated open‑source distributed graph databases against strict latency, scale, and import criteria, selected NebulaGraph for its superior multi‑hop query and bulk‑load performance, and built a four‑layer, highly available platform that ingests petabyte‑scale data in real time, supports diverse business use cases, and provides interactive visualization.

Meituan Technology Team

Apr 1, 2021

Meituan's Graph Database Selection and Platform Construction

Meituan faces massive graph data storage and multi‑hop query requirements, with billions of vertices and edges across various business scenarios such as knowledge graph mining, security risk control, link analysis, and organizational management.

The article outlines the selection process for a distributed, open‑source graph database, evaluating candidates against five criteria: open source, distributed architecture, millisecond‑level multi‑hop latency, support for trillions of edges, and bulk import capability. Three categories were identified: (1) single‑node solutions like Neo4j, (2) JanusGraph‑style systems that add a graph layer on existing stores, and (3) native distributed stores such as DGraph and NebulaGraph. Benchmarks on the LDBC‑SNB dataset showed NebulaGraph outperforming competitors in data import, real‑time writes, and multi‑hop queries.

NebulaGraph’s architecture consists of three services—Meta Service, Storage Service, and Query Service—implemented in C++. It adopts a shared‑nothing distributed storage layer, separates compute from storage, and uses Raft for strong consistency. The platform built on NebulaGraph provides four layers: data‑application, data‑storage, data‑production, and supporting services, offering high availability (AP mode), hourly billions‑scale data import via Spark‑generated SST files, real‑time multi‑cluster write synchronization through Kafka, and an interactive graph visualization component.

Four business use cases are presented: an intelligent assistant powered by a food‑service knowledge graph, a medical‑beauty search recall system, a graph‑based recommendation‑reason generation module, and a code‑dependency analysis tool that writes code‑level dependencies into the graph for impact analysis and testing.

In summary, the platform delivers a self‑service, highly available graph database solution that handles petabyte‑scale data with millisecond latency, supports massive data ingestion, ensures eventual consistency across clusters, and offers rich visualization and management features. Future work includes enhancing the core graph engine for stability and exploring graph learning and computation capabilities.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

distributed systems High Availability graph database visualization data ingestion NebulaGraph Real-time Sync

Written by

Meituan Technology Team

Over 10,000 engineers powering China’s leading lifestyle services e‑commerce platform. Supporting hundreds of millions of consumers, millions of merchants across 2,000+ industries. This is the public channel for the tech teams behind Meituan, Dianping, Meituan Waimai, Meituan Select, and related services.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.