Tag

OLAP

0 views collected around this technical thread.

Alimama Tech
Alimama Tech
Apr 10, 2025 · Big Data

Performance Optimization of Apache Paimon in Dolphin OLAP Engine

The article details how Apache Paimon, integrated as an external table format in Alibaba’s Dolphin OLAP engine, achieves millisecond‑level query latency and up to 10k QPS through ORC push‑down, manifest conversion, caching, concurrency, and encoding optimizations, outperforming StarRocks and Hologres.

JavaOLAPPaimon
0 likes · 17 min read
Performance Optimization of Apache Paimon in Dolphin OLAP Engine
JD Retail Technology
JD Retail Technology
Apr 8, 2025 · Databases

ClickHouse Architecture and Core Technologies Overview

ClickHouse is an open‑source, massively parallel, column‑oriented OLAP database that integrates its own columnar storage, vectorized batch processing, pre‑sorted data, diverse table engines, extensive data types, sharding with replication, sparse primary‑key and skip indexes, and a multithreaded query engine, delivering high‑throughput real‑time analytics on massive datasets.

Big DataClickHouseOLAP
0 likes · 15 min read
ClickHouse Architecture and Core Technologies Overview
ByteDance Data Platform
ByteDance Data Platform
Jan 9, 2025 · Databases

Why ByteHouse’s GIS Engine Beats Traditional Spatial Databases in Real‑World Analytics

This article explains how ByteHouse integrates high‑performance GIS capabilities into its OLAP engine, describes its spatial indexing architecture, showcases benchmark results against ClickHouse, StarRocks, PostGIS and DuckDB using the NYC Taxi dataset, and outlines when to choose ByteHouse versus other spatial database solutions.

ByteHouseDatabase PerformanceGIS
0 likes · 11 min read
Why ByteHouse’s GIS Engine Beats Traditional Spatial Databases in Real‑World Analytics
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jan 8, 2025 · Databases

ByteHouse GIS: High‑Performance Geospatial Analytics and Benchmark Comparison with ClickHouse, StarRocks, PostGIS, and DuckDB

The article explains ByteHouse's GIS capabilities, describing its R‑Tree and Google S2 spatial index implementation, OGC‑compatible data types and functions, and presents benchmark results that show ByteHouse outperforming ClickHouse, StarRocks, PostGIS, and DuckDB on key geospatial queries.

ByteHouseGISOLAP
0 likes · 13 min read
ByteHouse GIS: High‑Performance Geospatial Analytics and Benchmark Comparison with ClickHouse, StarRocks, PostGIS, and DuckDB
JD Tech Talk
JD Tech Talk
Dec 26, 2024 · Databases

Using ClickHouse for Efficient Tag Bitmap Storage and Group Computation in a CDP

This article explains how ClickHouse’s columnar storage, bitmap functions, and distributed architecture can be leveraged to store billions of tag bitmaps, combine them efficiently, and support fast group calculations for customer data platforms, while addressing data‑warehouse integration, storage format, and performance challenges.

BitmapCDPClickHouse
0 likes · 10 min read
Using ClickHouse for Efficient Tag Bitmap Storage and Group Computation in a CDP
Architecture Digest
Architecture Digest
Nov 3, 2024 · Backend Development

Using Easy-Query ORM for Strongly Typed OLTP and OLAP Queries in Java

This article introduces Easy-Query, a Java ORM that offers strong‑typed OLTP and OLAP query capabilities, demonstrates how to define entity classes with many‑to‑many and one‑to‑one relationships, and provides multiple code examples for complex queries, DTO generation, and automatic inclusion of related data.

Easy-QueryJavaOLAP
0 likes · 11 min read
Using Easy-Query ORM for Strongly Typed OLTP and OLAP Queries in Java
ByteDance Data Platform
ByteDance Data Platform
Oct 16, 2024 · Databases

How ByteHouse Boosted Sales Data Platform Queries Up to 16× with ACL and Optimizer

This article examines a fast‑growing company's sales data platform, outlines the data‑access pain points caused by ACL permissions, describes the migration from ClickHouse to ByteHouse, details the optimizer’s rule‑based, cost‑based, and distributed‑plan enhancements, and presents benchmark results showing query speedups of up to sixteen times.

ACLByteHouseOLAP
0 likes · 16 min read
How ByteHouse Boosted Sales Data Platform Queries Up to 16× with ACL and Optimizer
DataFunSummit
DataFunSummit
Aug 26, 2024 · Big Data

Building a Doris‑Based Lakehouse Integrated Analytics System at Kuaishou

This article presents Kuaishou's experience of designing and implementing a Doris‑driven lakehouse integrated analytics system, covering the current OLAP landscape, challenges of data duplication and governance, the new architecture with caching and auto‑materialization, implementation details, performance impact, and future work.

Auto MaterializationBig DataCaching
0 likes · 24 min read
Building a Doris‑Based Lakehouse Integrated Analytics System at Kuaishou
Bilibili Tech
Bilibili Tech
Aug 23, 2024 · Big Data

Accelerating Multi‑Dimensional OLAP Queries in ClickHouse with Grouping Sets, RBM, and Dense Dictionary Encoding

To achieve sub‑second, multi‑dimensional analytics on Bilibili’s hundred‑million‑row datasets, the team built a ClickHouse‑based acceleration layer that combines grouping‑set pre‑aggregation, bitmap (RBM) distinct handling, and a dense dictionary encoding service, dramatically cutting CPU, memory and query latency versus traditional OLAP pipelines.

Big DataBitmapClickHouse
0 likes · 28 min read
Accelerating Multi‑Dimensional OLAP Queries in ClickHouse with Grouping Sets, RBM, and Dense Dictionary Encoding
DataFunTalk
DataFunTalk
Aug 6, 2024 · Fundamentals

Solving Massive Data Retrieval Demands: From Problem Causes to OLAP Multidimensional Reporting Solutions

This article analyzes why data engineers face endless data‑extraction requests, identifies common missteps in data‑construction practices, and proposes a comprehensive solution based on dimensional modeling, OLAP multidimensional reporting, self‑service tools, and knowledge empowerment to dramatically improve efficiency and scalability.

Big DataData EngineeringData Warehouse
0 likes · 12 min read
Solving Massive Data Retrieval Demands: From Problem Causes to OLAP Multidimensional Reporting Solutions
DataFunTalk
DataFunTalk
Jul 10, 2024 · Big Data

Apache SeaTunnel: A Next‑Generation Data Integration Platform for ETL/ELT and OLAP

This article introduces Apache SeaTunnel, a modern data integration platform designed for the EtLT era, detailing its architecture, core connector APIs, checkpoint mechanism, model inference, multi‑table synchronization, the high‑performance SeaTunnel Zeta engine, OLAP use cases, community roadmap, and the commercial WhaleTunnel product.

Apache SeatunnelBig DataELT
0 likes · 22 min read
Apache SeaTunnel: A Next‑Generation Data Integration Platform for ETL/ELT and OLAP
Beijing SF i-TECH City Technology Team
Beijing SF i-TECH City Technology Team
Jun 18, 2024 · Big Data

Apache Kylin in Logistics: Optimizing OLAP for Big Data Analytics

This article discusses the implementation of Apache Kylin as an OLAP engine for logistics data, focusing on optimizing cube building and query performance to handle large-scale, high-dimensional data analytics.

Apache KylinBig DataCube Building
0 likes · 15 min read
Apache Kylin in Logistics: Optimizing OLAP for Big Data Analytics
JD Retail Technology
JD Retail Technology
Jun 5, 2024 · Big Data

Design and Implementation of a Configurable OLAP Framework for Rapid Development

This article presents a configurable OLAP framework that streamlines multi‑dimensional, multi‑metric analysis by separating core and interaction domains, using metadata‑driven modeling, runtime request handling, and Maven‑based integration to accelerate development and improve query performance.

ConfigurationData WarehouseJava
0 likes · 10 min read
Design and Implementation of a Configurable OLAP Framework for Rapid Development
DataFunTalk
DataFunTalk
Apr 15, 2024 · Databases

ByteHouse Cloud‑Native Data Warehouse Performance Whitepaper: Architecture, Optimizations, and Benchmark Results

The ByteHouse performance whitepaper details the cloud‑native data warehouse’s architecture, rule‑based and cost‑based optimizer enhancements, exchange runtime, runtime filters, parallelism and wide‑table optimizations, and presents benchmark comparisons on TPC‑DS, TPC‑H and SSB datasets demonstrating orders‑of‑magnitude query speed improvements.

ByteHouseData WarehouseOLAP
0 likes · 17 min read
ByteHouse Cloud‑Native Data Warehouse Performance Whitepaper: Architecture, Optimizations, and Benchmark Results
DataFunSummit
DataFunSummit
Mar 12, 2024 · Big Data

Solving Massive Data Retrieval Demands: From Root Causes to OLAP Multidimensional Reporting Solutions

This article analyzes why data engineers face endless data‑retrieval requests, identifies common missteps in data‑construction such as demand‑driven development, lack of modeling and OLAP concepts, and proposes a dimension‑model‑based data warehouse with OLAP reporting, tooling, and knowledge‑empowerment to break the cycle.

Big DataData EngineeringData Warehouse
0 likes · 13 min read
Solving Massive Data Retrieval Demands: From Root Causes to OLAP Multidimensional Reporting Solutions
ByteDance Data Platform
ByteDance Data Platform
Mar 6, 2024 · Databases

How ByteHouse Boosted Douyin’s Interest Circle Queries by 100×

This article explains how Douyin rebuilt its interest‑circle platform by replacing MySQL with the columnar OLAP engine ByteHouse, achieving roughly a hundred‑fold improvement in query speed, lower hardware costs, and seamless horizontal scalability for massive daily data volumes.

ByteHouseColumnarStorageDataWarehouse
0 likes · 10 min read
How ByteHouse Boosted Douyin’s Interest Circle Queries by 100×
DataFunSummit
DataFunSummit
Feb 29, 2024 · Big Data

Trino at Xiaomi: Architecture, Practices, and Future Plans

This article details Xiaomi’s practical deployment of Trino, covering its architectural role, core and extended capabilities, performance comparisons, integration with Iceberg and Spark, operational enhancements, multi‑cluster and ad‑hoc query scenarios, future cloud‑storage plans, and a Q&A session.

Big DataIcebergOLAP
0 likes · 20 min read
Trino at Xiaomi: Architecture, Practices, and Future Plans
Didi Tech
Didi Tech
Feb 27, 2024 · Big Data

Real-time Precise Deduplication Using StarRocks Materialized Views at Didi

Didi leverages StarRocks materialized views with a global dictionary and bitmap aggregation to perform real‑time, high‑cardinality precise deduplication, automatically rewriting queries and refreshing views, cutting query latency by ~80%, reducing resource use ~95%, and boosting concurrent QPS up to 100‑fold, while planning further automation and bitmap optimizations.

Big DataMaterialized ViewsOLAP
0 likes · 19 min read
Real-time Precise Deduplication Using StarRocks Materialized Views at Didi
DataFunTalk
DataFunTalk
Feb 27, 2024 · Big Data

Best Practices of Cloud‑Native OLAP Architecture and Logistics Warning at Jushuitan

This article presents Jushuitan's cloud‑native OLAP architecture, detailing its evolution, current big‑data stack—including DataWorks, MaxCompute, Flink, Hologres, and Aerospike—along with logistics warning workflows, rule‑matching mechanisms, real‑time processing challenges, and future scalability plans.

Big DataData WarehouseHologres
0 likes · 20 min read
Best Practices of Cloud‑Native OLAP Architecture and Logistics Warning at Jushuitan