Databases 7 min read

Key New Features of Doris 2.0 and Their Impact on Data Development

The article reviews Doris 2.0's major enhancements—including point‑query concurrency, log‑analysis capabilities, cold‑hot data separation, lakehouse integration, and various performance and usability upgrades—explaining how these changes benefit OLAP workloads and simplify data‑engineering pipelines.

Big Data Technology & Architecture

Aug 14, 2023

Key New Features of Doris 2.0 and Their Impact on Data Development

On the recent Friday, the Doris community announced version 2.0, which brings substantial performance gains and a set of new features that deserve special attention.

The official release notes highlight improvements in several areas: log analysis, data‑lake federation scenarios, data‑update efficiency and write performance, resource elasticity with compute‑storage separation, and other enterprise‑oriented usability features.

Point‑Query Concurrency Support

In data‑development, point (or KV) queries traditionally required external systems such as Apache HBase or Redis to handle high‑concurrency lookups. Doris 2.0 introduces a short‑path for point queries, allowing small‑scale workloads to avoid additional components, thereby reducing stack complexity and data redundancy.

The underlying mechanisms involve cache optimization, row‑store format, short‑path query routing, statement pre‑processing, and a Row Cache; users should understand these principles and best‑practice production settings.

Log‑Analysis Scenarios

Doris 2.0 adds features like inverted indexes and semi‑structured data types, enabling it to serve as a low‑cost, easy‑to‑use alternative to a full ELK stack for log analysis, especially when the system scale does not justify the operational overhead of FileBeat, Logstash, Kafka, and Kibana.

Cold‑Hot Data Separation

The new version supports cold‑hot separation, a common concept in big‑data systems. Doris can automatically move hot data from SSD to HDD using dynamic partition lifecycle management, significantly lowering storage costs for large‑scale workloads.

Lakehouse Integration

Building on earlier support for heterogeneous sources (Hive, ES), Doris 2.0 expands to lake‑table formats such as Hudi, Iceberg, and Paimon, allowing seamless mapping of lake tables into Doris for accelerated federated queries.

Other Enhancements

Additional improvements include multi‑model column updates, high‑frequency write compaction memory optimizations, and other usability upgrades that reduce the need for manual tuning.

Overall, Doris 2.0 strengthens its position in the open‑source OLAP landscape, offering capabilities that rival many commercial cloud offerings and urging data engineers to keep pace with the rapid evolution of the data‑development stack.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

performance Big Data SQL OLAP Lakehouse Doris

Written by

Big Data Technology & Architecture

Wang Zhiwu, a big data expert, dedicated to sharing big data technology.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.