Databases 17 min read

How OceanBase Crushed the TPC‑H Benchmark with HTAP Innovation

OceanBase, the distributed relational database from Ant Group, set a new TPC‑H world record by leveraging its HTAP architecture, blending transaction and analytical processing, and the article details the technical evolution, challenges, and future direction of this hybrid database technology.

Alibaba Cloud Developer
Alibaba Cloud Developer
Alibaba Cloud Developer
How OceanBase Crushed the TPC‑H Benchmark with HTAP Innovation

HTAP: The Database's Original Intent and Future

HTAP (Hybrid Transaction and Analytical Processing) combines online transaction processing (TP) and online analytical processing (AP) in a single system, eliminating costly ETL steps and enabling real‑time analysis, a capability identified by Gartner in 2014 as a key competitive advantage.

OceanBase Embeds HTAP in Its DNA

Developed internally at Alibaba/Ant Group since 2010, OceanBase evolved from a TP‑focused system to a true HTAP engine, achieving a world‑record 15.26 million QphH in the TPC‑H benchmark on May 20 2021, making it the only Chinese‑developed database to top both TP and AP rankings.

The journey involved building a cost‑based optimizer (2014), distributed execution (2016), vectorized execution and resource isolation for TP/AP (2019‑2020), and integrating OceanBase File System (OFS) for virtually unlimited storage.

The Three Mountains Behind the TPC‑H Record

Key challenges include massive capacity and scalability, strong query‑optimization for complex analytical workloads, and effective resource isolation between TP and AP requests. OceanBase addressed these with horizontal scaling across hundreds of nodes, cache‑aware vectorized processing, and virtualized resource groups.

Where is the Boundary of HTAP? Future Direction

OceanBase aims to simplify system complexity for customers while maintaining its core TP strengths, avoiding trade‑offs that favor AP at the expense of transaction performance. The roadmap focuses on enhancing HTAP capabilities, reducing operational overhead, and continuing to push the limits of hybrid processing.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

distributed databaseHTAPOceanBaseTPC-HHybrid Processing
Alibaba Cloud Developer
Written by

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.