Apache Doris 2.1.0 Release: Major Performance Boosts, New Data Types, Optimizer Enhancements and Operational Features
The Apache Doris 2.1.0 release introduces over 100% query performance improvements on TPC‑DS, up to 230% gains on ARM platforms, new Variant and IP data types, async materialized views, auto‑increment columns, auto‑partitioning, group commit, hardened workload groups, TopSQL monitoring, a built‑in job scheduler, and several behavior changes, all aimed at delivering faster, more flexible and more reliable OLAP processing.
Dear community members, we are pleased to announce the official release of Apache Doris 2.1.0 on March 8. This version focuses on out‑of‑the‑box performance, new data types, storage enhancements, workload management, and operational tooling.
Performance Highlights
Complex SQL queries achieve >100% performance improvement on TPC‑DS 1TB, surpassing industry leaders.
Data‑lake analysis sees 4‑6× speedup over Trino and Spark, with Arrow Flight‑based high‑throughput data reads improving transfer efficiency by 100×.
ARM architecture receives deep optimizations, delivering >230% performance gains on ClickBench and 93% on TPC‑H.
New Data Types
Variant type enables flexible semi‑structured data storage with columnar efficiency.
Native IPv4/IPv6 types reduce storage by ~60% and provide 20+ IP‑related functions.
Optimizer and Query Engine
Enhanced optimizer infrastructure, new rules (e.g., operator push‑down) and an improved Cascades/DPhyper enumeration framework.
Support for queries without statistics, adaptive runtime filters, and better handling of complex joins and window functions.
Materialized Views
Async multi‑table materialized views now support transparent rewrite, automatic refresh, external‑to‑internal table mapping, and direct query access.
Storage Enhancements
Auto‑increment columns (AUTO_INCREMENT) for OLTP‑style unique IDs.
Auto‑partitioning (AUTO PARTITION BY RANGE) simplifies data ingestion.
INSERT … SELECT performance doubled via MemTable pre‑placement and streaming RPC.
Group Commit (sync_mode / async_mode) aggregates high‑frequency writes, reducing CPU and I/O pressure.
Workload Management
Workload Group now enforces hard CPU limits via CGroup.
TopSQL function active_queries() provides per‑query resource metrics (CPU time, scan rows, memory, shuffle bytes).
Job Scheduler
A built‑in scheduler allows creation of recurring INSERT jobs with second‑level granularity, supporting start/end times, intervals, and execution tracing.
Behavior Changes
Unique‑key tables enable Merge‑On‑Write by default.
Bitmap indexes are deprecated in favor of Inverted indexes.
Segment compaction is enabled by default.
Audit log plugin is now built‑in and toggled via enable_audit_plugin.
For full details, refer to the official Apache Doris documentation links embedded throughout the release notes.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
Big Data Technology & Architecture
Wang Zhiwu, a big data expert, dedicated to sharing big data technology.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
