Big Data 20 min read

MercsDB: Architecture, Storage, Computation, and Optimization of Tencent's MPP Data Warehouse Engine

The article presents a comprehensive technical overview of MercsDB—formerly HermesDB—including its background, storage and indexing designs, native and Presto computation engines, vectorization optimizations, benchmark results, real‑world applications, and future development plans.

DataFunSummit

Sep 30, 2022

MercsDB: Architecture, Storage, Computation, and Optimization of Tencent's MPP Data Warehouse Engine

Background – MercsDB (formerly HermesDB) is Tencent's internally developed MPP data‑warehouse engine that has evolved over nine years through four major versions to meet increasingly demanding real‑time analytics workloads, supporting tables with thousands of columns, billions of rows, and high‑frequency updates.

Storage Architecture – The engine offers two storage modes: local disk for low‑latency, small‑scale workloads (with potential data loss on node failure) and remote storage (HDFS, Ceph, Ozone) for high availability and fault tolerance. Storage choices depend on HA, latency, and hot‑data requirements.

Column Types and Indexes – MercsDB implements several column families: retrieval columns (indexed for point look‑ups), sorting columns (group similar values for faster scans), compression columns (using BitShuffle+LZ4 for high‑cardinality data), and nested columns (Parquet‑like). Indexes include sparse, skip‑list, inverted, point‑lookup, and LBS indexes, each tailored to specific query patterns.

Computation Architecture – Queries enter via JDBC, HTTP, or gRPC, pass through an SQL routing layer that selects either the native engine or the open‑source Presto engine. The native engine performs filter, aggregation, and limit operations directly on indexed column data, while Presto is integrated via a connector to handle full OLAP workloads.

Operator Push‑Down – MercsDB pushes supported operators (filter, agg, limit, order, etc.) down to the storage layer using a stack‑based query generator, simplifying rule management and reducing traversal passes.

Vectorization and Java Vector API – Performance is further boosted by rewriting critical paths with Java Vector API (available from JDK 16) to leverage SIMD instructions, achieving up to ~16× speedup for batch operations. Consistent vector types and avoidance of boxing, object allocation, and unnecessary function calls are emphasized.

Additional Optimizations – Batch‑processing of formerly single‑call functions, sorted index‑to‑string conversion to improve cache locality, and extensive use of vectorized code have yielded noticeable latency reductions.

Benchmark Results – Using the Star Schema Benchmark (SSB) on 200 GB and 2 TB datasets, MercsDB shows significant speedups over its pre‑optimisation version and outperforms ClickHouse in both single‑node and concurrent query scenarios, thanks to its index‑driven execution model.

Real‑World Applications – Deployed in Tencent's core apps (WeChat, QQ, Video, Games) and specific use cases such as WeChat Pay log search and advertising AB testing, where high‑throughput writes, mixed point‑lookup and full‑scan queries, and low‑latency requirements are critical.

Future Plans – Open‑source the engine, enhance cloud integration, continue vectorization work, improve fault‑tolerance and retry mechanisms, and further refine memory management and availability.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Big Data Columnar Storage vectorization presto MPP MercsDB

Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.