Databases 24 min read

Apache Doris 2.0.0 Release Highlights and New Features

Apache Doris 2.0.0, released on August 11, 2023, introduces a new Cascades‑based optimizer, inverted index, point‑query acceleration, pipeline execution engine, multi‑tenant resource isolation, cloud‑native compute nodes, and extensive performance gains of up to ten‑fold in benchmark queries and dozens of times in real‑world workloads.

DataFunTalk
DataFunTalk
DataFunTalk
Apache Doris 2.0.0 Release Highlights and New Features

Apache Doris 2.0.0 was officially released on August 11, 2023, with contributions from over 275 developers and more than 4,100 improvements and fixes.

The new version delivers more than a ten‑fold increase in blind‑test query performance on standard benchmarks (SSB‑Flat, TPC‑H) and up to 13‑fold on multi‑table joins, thanks to a brand‑new Cascades‑based optimizer, richer statistics, and adaptive tuning that works out‑of‑the‑box for most SQL workloads.

Key feature highlights include:

Intelligent optimizer (enable_nereids_planner=true) that supports full TPC‑DS coverage and reduces manual tuning.

Inverted index support, boosting keyword, equality, and range queries by 5‑90× in real‑world user‑behavior analysis.

Point‑query concurrency improvements of over 20×, achieving 30,000 QPS per node via row‑column hybrid storage, row‑level cache, and short‑path execution.

Pipeline execution engine (Set enable_pipeline_engine = true) that decouples data flow from thread scheduling, improving CPU utilization and query stability.

Enhanced multi‑tenant and resource isolation with Resource Groups and Workload Groups, offering soft limits, query queuing, and automatic resource scaling.

Cold‑hot data tiering and compute‑node architecture for true cloud‑native separation of storage and compute.

Kubernetes‑friendly deployment with FQDN support, cross‑cluster data replication (CCR), and numerous enterprise‑grade usability improvements.

Performance gains also extend to data ingestion: write throughput increased up to 20×, with specific TPC‑H load tests showing 100‑200% higher throughput for Stream Load and INSERT‑SELECT operations.

Additional enhancements cover schema evolution, partial column updates, complex conditional updates, and automatic schema synchronization from upstream databases.

For users, the release provides ready‑to‑use download links (GitHub and official site), detailed release notes, and documentation for each new capability.

Overall, Apache Doris 2.0.0 positions itself as a high‑performance, cloud‑native, multi‑tenant analytical database suitable for real‑time dashboards, log analytics, lakehouse queries, and high‑concurrency data‑serving scenarios.

Cloud Nativemulti-tenantApache DorisQuery OptimizerDatabase ReleasePerformance Boost
DataFunTalk
Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.