High-Performance User Behavior Analysis Solution for Massive Data
The paper describes a high‑performance user‑behavior analysis system that processes hundreds of billions of daily logs for Meituan‑Dianping, using an inverted‑index structure with bitmap UUID sets and timestamp sequences, combined with Spark, Spring and Alluxio optimizations to cut query times from hours to under five seconds.
This paper presents a high-performance user behavior analysis solution designed to handle massive datasets, specifically addressing the challenge of analyzing hundreds of billions of user behavior logs collected daily by Meituan-Dianping. The solution significantly reduces analysis time from hours to seconds, greatly improving analysis experience and work efficiency.
The paper focuses on the ordered funnel analysis scenario, detailing the problem analysis, design approach, and engineering implementation and optimization process. It is based on a presentation from ArchSummit Beijing in December 2017.
The core problem involves multi-dimensional analysis combined with sequence matching for deduplication counting. The solution introduces an inverted index-based data structure that enables efficient dimension filtering and sequence matching. The index consists of three parts: attribute names and values, UUID sets stored in bitmap format for fast filtering, and timestamp sequences for sequence matching.
The algorithm design includes a matching algorithm that handles sequence relationships, time window constraints, and property/dimension requirements. The engineering implementation uses Spring for interface services, Spark for distributed computing, and Alluxio for storage. Key optimizations include local scheduling, memory mapping, and unsafe calls to achieve significant performance improvements.
The solution has been successfully deployed in production, handling hundreds of billions of daily data records, supporting hundreds of millions of active users, and achieving sub-5-second query response times for interactive analysis.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
Meituan Technology Team
Over 10,000 engineers powering China’s leading lifestyle services e‑commerce platform. Supporting hundreds of millions of consumers, millions of merchants across 2,000+ industries. This is the public channel for the tech teams behind Meituan, Dianping, Meituan Waimai, Meituan Select, and related services.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
