Databases 42 min read

Apache Doris 2.1.0 Release: Major Performance Boosts, New Data Types, Optimizer Enhancements and Operational Features

The Apache Doris 2.1.0 release introduces over 100% query performance improvements on TPC‑DS, up to 230% gains on ARM platforms, new Variant and IP data types, async materialized views, auto‑increment columns, auto‑partitioning, group commit, hardened workload groups, TopSQL monitoring, a built‑in job scheduler, and several behavior changes, all aimed at delivering faster, more flexible and more reliable OLAP processing.

Big Data Technology & Architecture
Big Data Technology & Architecture
Big Data Technology & Architecture
Apache Doris 2.1.0 Release: Major Performance Boosts, New Data Types, Optimizer Enhancements and Operational Features

Dear community members, we are pleased to announce the official release of Apache Doris 2.1.0 on March 8. This version focuses on out‑of‑the‑box performance, new data types, storage enhancements, workload management, and operational tooling.

Performance Highlights

Complex SQL queries achieve >100% performance improvement on TPC‑DS 1TB, surpassing industry leaders.

Data‑lake analysis sees 4‑6× speedup over Trino and Spark, with Arrow Flight‑based high‑throughput data reads improving transfer efficiency by 100×.

ARM architecture receives deep optimizations, delivering >230% performance gains on ClickBench and 93% on TPC‑H.

New Data Types

Variant type enables flexible semi‑structured data storage with columnar efficiency.

Native IPv4/IPv6 types reduce storage by ~60% and provide 20+ IP‑related functions.

Optimizer and Query Engine

Enhanced optimizer infrastructure, new rules (e.g., operator push‑down) and an improved Cascades/DPhyper enumeration framework.

Support for queries without statistics, adaptive runtime filters, and better handling of complex joins and window functions.

Materialized Views

Async multi‑table materialized views now support transparent rewrite, automatic refresh, external‑to‑internal table mapping, and direct query access.

Storage Enhancements

Auto‑increment columns (AUTO_INCREMENT) for OLTP‑style unique IDs.

Auto‑partitioning (AUTO PARTITION BY RANGE) simplifies data ingestion.

INSERT … SELECT performance doubled via MemTable pre‑placement and streaming RPC.

Group Commit (sync_mode / async_mode) aggregates high‑frequency writes, reducing CPU and I/O pressure.

Workload Management

Workload Group now enforces hard CPU limits via CGroup.

TopSQL function active_queries() provides per‑query resource metrics (CPU time, scan rows, memory, shuffle bytes).

Job Scheduler

A built‑in scheduler allows creation of recurring INSERT jobs with second‑level granularity, supporting start/end times, intervals, and execution tracing.

Behavior Changes

Unique‑key tables enable Merge‑On‑Write by default.

Bitmap indexes are deprecated in favor of Inverted indexes.

Segment compaction is enabled by default.

Audit log plugin is now built‑in and toggled via enable_audit_plugin.

For full details, refer to the official Apache Doris documentation links embedded throughout the release notes.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

SQLdatabasematerialized viewApache DorisARM Optimization
Big Data Technology & Architecture
Written by

Big Data Technology & Architecture

Wang Zhiwu, a big data expert, dedicated to sharing big data technology.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.