Tag

Doris

0 views collected around this technical thread.

DataFunSummit
DataFunSummit
Mar 2, 2025 · Artificial Intelligence

Lightweight Algorithm Service Architecture Based on Offline Tag Knowledge Base and Real‑time Data Warehouse

This article presents a lightweight algorithm service solution that combines an offline pre‑computed tag knowledge base with a real‑time data warehouse using Flink, Doris, Hive SQL and Python to achieve short development cycles, agile iteration, low cost, and scalable deployment for classification and clustering tasks.

DorisFlinkalgorithm service
0 likes · 16 min read
Lightweight Algorithm Service Architecture Based on Offline Tag Knowledge Base and Real‑time Data Warehouse
DataFunSummit
DataFunSummit
Aug 26, 2024 · Big Data

Building a Doris‑Based Lakehouse Integrated Analytics System at Kuaishou

This article presents Kuaishou's experience of designing and implementing a Doris‑driven lakehouse integrated analytics system, covering the current OLAP landscape, challenges of data duplication and governance, the new architecture with caching and auto‑materialization, implementation details, performance impact, and future work.

Auto MaterializationBig DataData Warehouse
0 likes · 24 min read
Building a Doris‑Based Lakehouse Integrated Analytics System at Kuaishou
DataFunSummit
DataFunSummit
Dec 16, 2023 · Databases

Optimizing Precise Deduplication with Doris Bitmap: Architecture, Performance Enhancements, and Practical Practices

This article presents a comprehensive overview of precise deduplication in Meituan's Doris database, detailing the underlying bitmap data structures, aggregation bottlenecks, and a series of optimizations—including memory management, fast union, orthogonal encoding, and vectorized engine integration—that together achieve significant performance gains in high‑cardinality scenarios.

BitMapDatabaseDeduplication
0 likes · 20 min read
Optimizing Precise Deduplication with Doris Bitmap: Architecture, Performance Enhancements, and Practical Practices
DataFunTalk
DataFunTalk
Dec 15, 2023 · Big Data

Zhihu Bridge Platform: Internal Marketing Architecture, Challenges, and Optimizations

This article presents a comprehensive overview of Zhihu's Bridge Platform internal marketing module, detailing its background, business logic, product components such as CDP, activity and delivery platforms, architectural layers, performance bottlenecks, optimization techniques—including distributed transactions, bitmap indexing, and vectorized query execution—and future directions toward marketing automation and intelligence.

Big DataCDPDoris
0 likes · 28 min read
Zhihu Bridge Platform: Internal Marketing Architecture, Challenges, and Optimizations
DataFunTalk
DataFunTalk
Sep 24, 2023 · Databases

Insights into the Design and Challenges of Doris' New Optimizer (Nereids)

The article explains why Doris needed a new optimizer, describes its architecture—including rule‑based and cost‑based stages, early data‑size reduction techniques, dynamic‑programming join‑reorder methods, and practical challenges such as statistics errors and runtime filters—while sharing performance results and a Q&A session.

Cost-Based OptimizationDatabase PerformanceDoris
0 likes · 17 min read
Insights into the Design and Challenges of Doris' New Optimizer (Nereids)
DataFunTalk
DataFunTalk
Aug 28, 2023 · Big Data

Practical Experience of an E‑commerce Platform’s Offline and Real‑time Data Warehouse

This article shares the practical architecture, technology selection, implementation details, and evolution of an e‑commerce platform’s offline and real‑time data warehouses, covering data modeling, processing pipelines, system components such as Hive, Spark, Flink, ClickHouse, Doris, and Hudi, and the lessons learned from multiple production deployments.

Big DataClickHouseData Warehouse
0 likes · 18 min read
Practical Experience of an E‑commerce Platform’s Offline and Real‑time Data Warehouse
DataFunTalk
DataFunTalk
Aug 26, 2023 · Big Data

Ensuring Doris Stability in HuoLala's Big Data Platform: Practices and Lessons

This article presents HuoLala's practical approach to guaranteeing the stability of the Doris OLAP engine within its large‑scale big data platform, covering background, challenges, case studies, capability building, process standards, and future planning.

AutomationBig DataCapacity Planning
0 likes · 12 min read
Ensuring Doris Stability in HuoLala's Big Data Platform: Practices and Lessons
ByteDance Data Platform
ByteDance Data Platform
May 29, 2023 · Databases

Which Open‑Source OLAP Engine Wins the TPC‑DS Benchmark? A Deep Performance Comparison

Using the TPC‑DS benchmark’s 99 queries on a 1 TB dataset, this study evaluates the performance of four open‑source OLAP engines—ClickHouse, Doris, Presto, and ByConity—across basic, join, aggregation, subquery, and window‑function scenarios, revealing ByConity’s superior speed and the limitations of ClickHouse.

ByConityClickHouseDoris
0 likes · 12 min read
Which Open‑Source OLAP Engine Wins the TPC‑DS Benchmark? A Deep Performance Comparison
DataFunTalk
DataFunTalk
Dec 19, 2022 · Big Data

Evolution of OLAP: Key Technologies, Engine Comparison, and Future Trends

This article provides a comprehensive overview of OLAP technology evolution, covering its origins, modern requirements for massive and real‑time data, detailed comparisons of major open‑source OLAP engines such as Druid, Elasticsearch, Kylin, Doris/StarRocks, and ClickHouse, core architectural and storage techniques, and emerging trends like federated queries, hybrid storage, and lakehouse integration.

Big DataClickHouseDoris
0 likes · 22 min read
Evolution of OLAP: Key Technologies, Engine Comparison, and Future Trends
DataFunSummit
DataFunSummit
Nov 2, 2022 · Big Data

Evolution and Construction of Huolala's Doris‑Based OLAP System

This article details Huolala's journey from a MySQL‑centric analytics pipeline to a multi‑engine OLAP platform built on Doris, covering system architecture, data flow, stage‑wise evolution, engine selection, POC validation, performance tuning, stability measures, and future roadmap for self‑service analytics.

Big DataData EngineeringDoris
0 likes · 15 min read
Evolution and Construction of Huolala's Doris‑Based OLAP System
DataFunTalk
DataFunTalk
Sep 29, 2022 · Databases

Applying Doris OLAP Data Warehouse in NIO Automotive: Architecture, Evaluation, and Practices

This technical presentation details NIO's evolution of OLAP solutions—from Druid and TiDB to Doris—explaining the selection criteria, Doris's advantages as a unified OLAP warehouse, its role in the CDP platform, practical deployment experiences, and lessons learned from real‑world usage.

Big DataCDPData Warehouse
0 likes · 15 min read
Applying Doris OLAP Data Warehouse in NIO Automotive: Architecture, Evaluation, and Practices
DataFunTalk
DataFunTalk
Sep 22, 2022 · Big Data

Architecture and Practices of Zhihu DMP System Based on Doris

This article presents a comprehensive overview of Zhihu's Data Management Platform (DMP), covering its business background, three core business modes, detailed architecture, offline and real‑time data pipelines, feature storage design, performance optimization techniques, and future iteration directions.

Big DataDMPDoris
0 likes · 14 min read
Architecture and Practices of Zhihu DMP System Based on Doris
DataFunTalk
DataFunTalk
Sep 1, 2022 · Big Data

Evolution and Construction of Huolala's OLAP System Based on Doris

This presentation details Huolala's journey from its initial OLAP architecture to a multi‑engine platform, describing background, data‑flow layers, technical research, engine selection (Druid, ClickHouse, Doris), POC validation, performance tuning, stability measures, production rollout, problem analysis, and future roadmap.

Big DataClickHouseData Warehouse
0 likes · 17 min read
Evolution and Construction of Huolala's OLAP System Based on Doris
DataFunSummit
DataFunSummit
Oct 16, 2021 · Databases

Practical Use Cases of Materialized Views and Indexes in Doris

This article shares practical experiences with Doris, covering materialized view concepts, typical use cases, index principles, performance optimizations, and real‑world scenarios such as order analysis, PV/UV aggregation, and detailed queries, while also providing operational tips and Q&A insights.

Big DataDorisIndex
0 likes · 16 min read
Practical Use Cases of Materialized Views and Indexes in Doris
DataFunTalk
DataFunTalk
Sep 23, 2021 · Databases

Practical Use Cases of Materialized Views and Indexes in Doris

This article shares practical experiences with Doris, covering materialized view concepts, typical use cases, advantages, creation syntax, prefix index principles, performance‑boosting scenarios such as order analysis, PV/UV counting, detail queries, and operational tips for high‑throughput and low‑latency workloads.

Big DataDorisIndex
0 likes · 18 min read
Practical Use Cases of Materialized Views and Indexes in Doris
DataFunTalk
DataFunTalk
Sep 4, 2021 · Big Data

High‑Availability Practices of ClickHouse in JD.com: Architecture, Deployment, and Operations

The article details JD.com’s large‑scale OLAP strategy using ClickHouse as the primary engine and Doris as a secondary engine, covering application scenarios, component selection criteria, cluster deployment models, high‑availability architecture, fault‑handling procedures, performance tuning, and future cloud‑native plans.

Big DataClickHouseCluster Deployment
0 likes · 19 min read
High‑Availability Practices of ClickHouse in JD.com: Architecture, Deployment, and Operations
DataFunSummit
DataFunSummit
Aug 15, 2021 · Big Data

Building a General Real-Time Data Warehouse: Methods and Practices at Meituan Waimai

This article introduces a universal method for building a real-time data warehouse at Meituan Waimai, covering streaming technologies, architecture choices such as Lambda and Kappa, component design, feature production, SLA management, and practical OLAP solutions using Flink, Storm, and Doris.

DorisFlinkKappa architecture
0 likes · 15 min read
Building a General Real-Time Data Warehouse: Methods and Practices at Meituan Waimai
JD Retail Technology
JD Retail Technology
Jun 9, 2021 · Big Data

JD OLAP High‑Availability Practices: ClickHouse and Doris Deployment, Architecture, and Future Plans

This article details JD's OLAP implementation using ClickHouse as the primary engine and Doris as a secondary engine, covering business scenarios, selection criteria, multi‑tenant deployment, high‑availability architecture, encountered challenges, and future roadmap for cloud‑native, scalable analytics.

ClickHouseCloud NativeCluster Management
0 likes · 17 min read
JD OLAP High‑Availability Practices: ClickHouse and Doris Deployment, Architecture, and Future Plans
Baidu Geek Talk
Baidu Geek Talk
May 24, 2021 · Big Data

Real-Time Quantile Computation Using TDigest: Architecture and Solutions

The article presents a real‑time quantile solution using the TDigest data structure, which clusters data into centroids and stores digests in Redis or Doris, pre‑computes quantiles for all dimension combinations, and provides a reusable API that delivers fast, accurate, low‑memory quantile statistics for diverse business scenarios.

Big DataDorisRedis
0 likes · 11 min read
Real-Time Quantile Computation Using TDigest: Architecture and Solutions
DataFunTalk
DataFunTalk
May 9, 2021 · Big Data

User Segmentation and Growth Practices for Mini‑Programs Based on Doris

This article presents a comprehensive case study of how Baidu’s senior R&D engineer Zhao Yuyang built a Doris‑based user‑segmentation system for mini‑programs, detailing the product’s private‑domain fine‑grained operation capabilities, the four technical challenges, the architecture and solutions—including global dictionaries, bitmap storage, partitioning, tag optimization, dynamic‑static query handling, and rapid user‑package generation—along with future roadmap plans.

Big DataData EngineeringDoris
0 likes · 20 min read
User Segmentation and Growth Practices for Mini‑Programs Based on Doris