Tagged articles
219 articles
Page 2 of 3
StarRocks
StarRocks
Mar 19, 2024 · Databases

How StarRocks Powers Data‑Driven Financial Marketing at Ping An Bank

This article explains how Ping An Bank transformed its retail finance model from product‑centric to customer‑centric using a five‑in‑one data‑driven approach, the KYC/KYP/KYATO methodology, and the StarRocks analytics platform to build the Smart Bank 3.0 architecture, CDP, and real‑time metric layers.

Big DataCustomer 360Financial Marketing
0 likes · 14 min read
How StarRocks Powers Data‑Driven Financial Marketing at Ping An Bank
DataFunSummit
DataFunSummit
Mar 14, 2024 · Big Data

Tencent Game Data Analysis: Lakehouse Integration Practice

This article presents Tencent Game's comprehensive lakehouse integration practice, detailing the project background, storage‑compute separation, data layering, unified DDL/DML operations, performance optimizations, and future plans, illustrating how StarRocks, Iceberg, and Spark are combined to achieve scalable, cost‑effective analytics for massive game data.

Compute-Storage SeparationIcebergLakehouse
0 likes · 16 min read
Tencent Game Data Analysis: Lakehouse Integration Practice
StarRocks
StarRocks
Mar 13, 2024 · Databases

Master StarRocks: Simplify Partitioning, Data Import, and Table Optimization

This guide walks you through using StarRocks—covering effortless expression‑based partitioning, streamlined data loading with INSERT FROM FILES and PIPE, powerful in‑flight data transformation using SELECT/JOIN/UNNEST, and flexible table structure tweaks via ALTER TABLE to boost query performance.

PartitioningPipeStarRocks
0 likes · 16 min read
Master StarRocks: Simplify Partitioning, Data Import, and Table Optimization
StarRocks
StarRocks
Feb 29, 2024 · Databases

How a Student Built an ORC Chunk Writer for StarRocks: Insights from Open Source Summer

In this interview, graduate student Sun Yinzhen shares how he selected, designed, and implemented an ORC Chunk Writer for the StarRocks database during the Open Source Summer program, detailing the technical challenges, learning outcomes, and his perspective on open‑source contributions for computer science students.

ORCStarRocksStudent Contribution
0 likes · 12 min read
How a Student Built an ORC Chunk Writer for StarRocks: Insights from Open Source Summer
Didi Tech
Didi Tech
Feb 27, 2024 · Big Data

Real-time Precise Deduplication Using StarRocks Materialized Views at Didi

Didi leverages StarRocks materialized views with a global dictionary and bitmap aggregation to perform real‑time, high‑cardinality precise deduplication, automatically rewriting queries and refreshing views, cutting query latency by ~80%, reducing resource use ~95%, and boosting concurrent QPS up to 100‑fold, while planning further automation and bitmap optimizations.

Big DataMaterialized ViewsOLAP
0 likes · 19 min read
Real-time Precise Deduplication Using StarRocks Materialized Views at Didi
StarRocks
StarRocks
Feb 27, 2024 · Databases

How StarRocks Materialized Views Enable High‑Concurrency Precise Deduplication

StarRocks’ materialized view feature lets Didi replace costly fuzzy deduplication with precise, high‑concurrency deduplication for real‑time dashboards, using global dictionary mapping, layered ODS/DWD/ADS views, synchronous and asynchronous refreshes, and transparent query rewrite to cut query latency by 80% and boost QPS dramatically.

Big DataMaterialized ViewsOLAP
0 likes · 20 min read
How StarRocks Materialized Views Enable High‑Concurrency Precise Deduplication
DataFunSummit
DataFunSummit
Feb 26, 2024 · Big Data

Building a New Lakehouse Analytics Paradigm with StarRocks and Paimon

This article introduces a new lakehouse analytics paradigm by combining StarRocks and Paimon, covering the evolution of data lake technologies, key integration scenarios, core technical mechanisms such as JNI connectors, materialized views, and future roadmap for enhanced lakehouse capabilities.

AnalyticsBig DataData Lake
0 likes · 16 min read
Building a New Lakehouse Analytics Paradigm with StarRocks and Paimon
DataFunSummit
DataFunSummit
Feb 1, 2024 · Databases

StarRocks 3.0 Storage‑Compute Separation Architecture: Design, Implementation, and Evaluation

This article explains the storage‑compute separation architecture introduced in StarRocks 3.0, presents industry case studies, details the design of StarOS and compute nodes, discusses technical challenges and key techniques, and evaluates cost, reliability, elasticity, and performance through benchmarks and user feedback.

Cloud NativePerformance EvaluationStarRocks
0 likes · 11 min read
StarRocks 3.0 Storage‑Compute Separation Architecture: Design, Implementation, and Evaluation
Sohu Tech Products
Sohu Tech Products
Jan 31, 2024 · Industry Insights

How Didi Scaled Real‑Time Dashboards with StarRocks Materialized Views

This article details Didi's evolution from a multi‑engine OLAP stack to a unified StarRocks solution, explains the design of global dictionaries and materialized views for real‑time dashboard acceleration, and shares performance results, challenges, and future optimization directions.

Big DataDidiMaterialized Views
0 likes · 19 min read
How Didi Scaled Real‑Time Dashboards with StarRocks Materialized Views
StarRocks
StarRocks
Jan 30, 2024 · Big Data

How InLong Guarantees Exactly‑Once Real‑Time Writes to StarRocks

This article explains how Apache InLong provides automatic, secure, high‑performance real‑time data transfer to StarRocks, detailing the transactional Stream Load API, the two‑phase commit process, Flink‑based ingestion architecture, exactly‑once guarantees, and performance test results across different parallelism levels.

Big DataExactly-OnceInLong
0 likes · 11 min read
How InLong Guarantees Exactly‑Once Real‑Time Writes to StarRocks
Big Data Technology & Architecture
Big Data Technology & Architecture
Jan 29, 2024 · Databases

Practical Experience of StarRocks Materialized Views at Didi

This article details Didi's evolution of OLAP systems, the adoption of StarRocks for high‑performance MPP analytics, and how materialized views, global dictionary mapping, and transparent acceleration were engineered to boost real‑time dashboard queries while outlining performance gains, challenges, and future optimization plans.

Big DataDidiOLAP
0 likes · 16 min read
Practical Experience of StarRocks Materialized Views at Didi
DataFunTalk
DataFunTalk
Jan 28, 2024 · Databases

Practical Experience of StarRocks Materialized Views at Didi

This article presents Didi's practical experience with StarRocks materialized views, covering the evolution of its OLAP architecture, the challenges of previous engines, the adoption of StarRocks, the design of materialized view acceleration for real‑time dashboards, and future optimization directions.

Big DataData PlatformOLAP
0 likes · 17 min read
Practical Experience of StarRocks Materialized Views at Didi
StarRocks
StarRocks
Jan 10, 2024 · Big Data

How Tencent Built the ABetterChoice SaaS A/B Testing Platform for Global Games

In 2022 Tencent's A/B Test team created the overseas SaaS product ABetterChoice, abstracting internal experiment capabilities, adapting to multi‑cloud compliance, and unifying computation with StarRocks, enabling game titles like Honor of Kings, PUBG Mobile, and Ubisoft to run scalable, compliant A/B experiments worldwide.

A/B testingData LakeExperiment Platform
0 likes · 14 min read
How Tencent Built the ABetterChoice SaaS A/B Testing Platform for Global Games
Weimob Technology Center
Weimob Technology Center
Jan 2, 2024 · Big Data

How to Efficiently Test BI Reports in a Hive‑StarRocks Data Warehouse

This article details practical methods for testing BI reports built on Hive and StarRocks, covering the report creation workflow, testing characteristics, SQL writing techniques, impact analysis, data warehouse simplification, and the application of data quality tools to ensure accurate and efficient reporting.

BI testingData QualityStarRocks
0 likes · 9 min read
How to Efficiently Test BI Reports in a Hive‑StarRocks Data Warehouse
Zhuanzhuan Tech
Zhuanzhuan Tech
Dec 27, 2023 · Big Data

Implementing Self-Service OLAP Analytics with Quick BI and StarRocks: Architecture, Optimizations, and Lessons Learned

This article presents a comprehensive case study of building a self‑service OLAP analytics platform at ZhaiZhai using Quick BI and StarRocks, covering background motivations, technical architecture, implementation details, performance‑optimizing case studies, and the resulting business impact.

OLAPQuick BISelf-Service Analytics
0 likes · 16 min read
Implementing Self-Service OLAP Analytics with Quick BI and StarRocks: Architecture, Optimizations, and Lessons Learned
Tongcheng Travel Technology Center
Tongcheng Travel Technology Center
Dec 27, 2023 · Big Data

Recap of Tongcheng Travel’s 7th Big Data Technology Salon – Talks on StarRocks, Paimon, Iceberg, Data+AI, Vector Retrieval, Real‑Time Computing, and Hotel Ranking

The 7th Tongcheng Travel Big Data Technology Salon in Beijing featured a series of expert talks covering StarRocks architecture evolution, lake‑house solutions with Paimon, Iceberg real‑time upsert, Data+AI for travel recommendation, vector retrieval in AI, JD Logistics real‑time computing governance, and multi‑task hotel ranking modeling, providing deep technical insights and future roadmaps.

AIBig DataLakehouse
0 likes · 10 min read
Recap of Tongcheng Travel’s 7th Big Data Technology Salon – Talks on StarRocks, Paimon, Iceberg, Data+AI, Vector Retrieval, Real‑Time Computing, and Hotel Ranking
StarRocks
StarRocks
Dec 22, 2023 · Databases

What’s New in StarRocks 3.2? Key Features and Usability Enhancements

StarRocks 3.2, released on December 21, 2023, introduces major usability upgrades—including optimized random bucketing, fast schema evolution, PIPE import, HTTP SQL API, runtime profiling, enhanced storage‑compute separation, data lake analysis, and advanced materialized view capabilities—while refining existing features such as indexing, catalog support, and export syntax.

Release NotesStarRocksStorage Compute Separation
0 likes · 15 min read
What’s New in StarRocks 3.2? Key Features and Usability Enhancements
StarRocks
StarRocks
Dec 19, 2023 · Big Data

How WeChat Achieved Sub‑Second Real‑Time Analytics with StarRocks Lakehouse

WeChat transformed its data platform from Hadoop and ClickHouse to a StarRocks‑based lakehouse, tackling massive data volume, ultra‑low latency, and storage fragmentation by deploying lake‑on‑warehouse and warehouse‑lake fusion architectures, real‑time incremental materialized views, and unified SQL access, resulting in dramatic cost cuts and performance gains.

Big DataLakehouseStarRocks
0 likes · 15 min read
How WeChat Achieved Sub‑Second Real‑Time Analytics with StarRocks Lakehouse
StarRocks
StarRocks
Dec 12, 2023 · Databases

How StarRocks Enables Real-Time Updates in Analytical Databases

The article explains why analytical databases struggle with real‑time data changes due to columnar storage, complex indexes and distributed processing, and then details StarRocks' primary‑key model, adaptive update mode, bitmap indexes, row/column partial updates, and practical SQL upsert techniques to achieve low‑latency updates without sacrificing query performance.

Analytical DatabasePartial UpdateReal-Time Update
0 likes · 15 min read
How StarRocks Enables Real-Time Updates in Analytical Databases
DataFunSummit
DataFunSummit
Dec 11, 2023 · Big Data

Design and Implementation of a Big Data Metadata Warehouse at Bilibili

This article presents Bilibili's big‑data metadata warehouse, covering its background, technology selection between data‑lake and data‑warehouse solutions, the architecture built on Prometheus, StarRocks, Flink and Routine Load, performance comparisons, diagnostic system design, and future development plans.

FlinkMetadata WarehouseStarRocks
0 likes · 20 min read
Design and Implementation of a Big Data Metadata Warehouse at Bilibili
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 8, 2023 · Cloud Computing

How Alibaba Cloud EMR Powers Serverless StarRocks for Seamless Lakehouse Analytics

This article summarizes Li Yu's presentation on Alibaba Cloud EMR's deep collaboration with the StarRocks community, detailing major contributions across versions, the serverless StarRocks product’s core capabilities, and future plans to enhance OLAP‑lakehouse integration, performance, and cloud‑native elasticity.

Alibaba CloudEMRLakehouse
0 likes · 7 min read
How Alibaba Cloud EMR Powers Serverless StarRocks for Seamless Lakehouse Analytics
政采云技术
政采云技术
Nov 28, 2023 · Databases

Deploying StarRocks 3 on ARM Architecture Using Docker

This guide explains how to deploy the high‑performance MPP database StarRocks 3 on ARM servers by using Docker images, configuring FE and BE nodes, adjusting Java versions, integrating with Hadoop/Hive, and setting up monitoring and log‑cleanup scripts.

ARMDatabase DeploymentDocker
0 likes · 12 min read
Deploying StarRocks 3 on ARM Architecture Using Docker
StarRocks
StarRocks
Nov 23, 2023 · Databases

How StarRocks Redefines Lakehouse Architecture with Compute‑Storage Separation

StarRocks, an open‑source MPP analytical database, consolidates BI, interactive, and real‑time analytics into a single engine by evolving from version 1.0 to 3.x, introducing compute‑storage separation, unified catalog, generated columns, operator spill, and advanced materialized views, while outlining its cloud‑native lakehouse roadmap.

Compute-Storage SeparationLakehouseMPP database
0 likes · 22 min read
How StarRocks Redefines Lakehouse Architecture with Compute‑Storage Separation
StarRocks
StarRocks
Nov 22, 2023 · Big Data

How StarRocks’ Compute‑Storage Separation Cut Costs 46% and Boosted Performance

This article details a Chinese tech company's migration of its internal big‑data analytics platform to StarRocks’ compute‑storage separation architecture, describing the original multi‑component setup, the pain points encountered, the evaluation methodology, performance and cost benchmarks, operational optimizations, migration steps, and future roadmap.

Big DataCompute-Storage SeparationCost reduction
0 likes · 17 min read
How StarRocks’ Compute‑Storage Separation Cut Costs 46% and Boosted Performance
StarRocks
StarRocks
Nov 3, 2023 · Databases

How StarRocks’ Spill to Disk Boosts Query Stability and Performance

StarRocks introduces a spill-to-disk mechanism that writes intermediate results of heavy operators to disk, freeing memory and enabling stable execution of ETL and ad‑hoc queries, while combined with materialized views it dramatically improves query success rates and delivers up to 4.35× faster performance than Spark.

Big DataDatabase OptimizationMaterialized Views
0 likes · 10 min read
How StarRocks’ Spill to Disk Boosts Query Stability and Performance
StarRocks
StarRocks
Oct 31, 2023 · Databases

How Ctrip Accelerated Report Queries 10× with StarRocks: A Real‑World Lakehouse Migration

Ctrip migrated its Artnova reporting platform from Hive‑based queries to StarRocks, first loading data into OLAP tables and then using StarRocks as a lakehouse with Hive catalog, Data Cache and materialized views, achieving average query latency reductions from 20 seconds to 1.5 seconds, over 7× speed‑up versus Trino and up to 40× acceleration for complex workloads.

Big DataData CacheLakehouse
0 likes · 15 min read
How Ctrip Accelerated Report Queries 10× with StarRocks: A Real‑World Lakehouse Migration
Weimob Technology Center
Weimob Technology Center
Oct 13, 2023 · Big Data

Optimizing StarRocks Tables: Design Tips, Real‑World Cases and Monitoring Strategies

This article explains how to design efficient StarRocks tables with proper field types, partitioning and bucketing, compares update and primary‑key models, presents real‑world cases of memory and tablet issues, provides a complete table‑creation example, and outlines comprehensive monitoring metrics to keep the analytical data warehouse performant and stable.

AnalyticsPartitioningStarRocks
0 likes · 25 min read
Optimizing StarRocks Tables: Design Tips, Real‑World Cases and Monitoring Strategies
Sohu Tech Products
Sohu Tech Products
Oct 11, 2023 · Industry Insights

How StarRocks Materialized Views Power Real‑Time Lakehouse Analytics

The article provides a deep technical overview of StarRocks 3.0’s data‑lake analysis capabilities, its unified Lakehouse architecture, Catalog integration, Trino compatibility, extensive I/O optimizations, materialized view features, resource isolation techniques, real‑world use cases, and future development directions.

AnalyticsData LakeLakehouse
0 likes · 22 min read
How StarRocks Materialized Views Power Real‑Time Lakehouse Analytics
DataFunTalk
DataFunTalk
Sep 16, 2023 · Big Data

StarRocks Data Lake Analysis, Materialized Views, and Lakehouse Architecture

This article explains how StarRocks 3.0 extends real‑time data‑warehouse capabilities to support data‑lake analysis, external catalog integration, Trino compatibility, extensive I/O optimizations, and powerful materialized‑view features that together enable a unified, cloud‑native Lakehouse solution with high performance and flexible resource isolation.

Big DataData LakeLakehouse
0 likes · 20 min read
StarRocks Data Lake Analysis, Materialized Views, and Lakehouse Architecture
DataFunSummit
DataFunSummit
Sep 8, 2023 · Big Data

Tianqiong OLAP Real‑time Lakehouse Fusion Platform Architecture Practice

This article explains why lake‑warehouse fusion is needed, describes the challenges of integrating real‑time data warehouses with data lakes, introduces a new StarRocks‑based architecture that supports real‑time ingestion, cooling, offline loading, and adaptive hot‑cold query rewriting, and outlines future plans and Q&A.

Big DataData IntegrationLakehouse
0 likes · 21 min read
Tianqiong OLAP Real‑time Lakehouse Fusion Platform Architecture Practice
StarRocks
StarRocks
Sep 6, 2023 · Big Data

How Paimon + StarRocks Revolutionize Lakehouse Analytics

This article reviews traditional Lambda and Kappa data‑warehouse architectures, then details four Paimon‑StarRocks lakehouse solutions—including a data‑lake center, accelerated query with materialized views, hot‑cold data separation, and the JNI connector—while also outlining StarRocks’ future roadmap for lakehouse analytics.

Big DataLakehousePaimon
0 likes · 11 min read
How Paimon + StarRocks Revolutionize Lakehouse Analytics
StarRocks
StarRocks
Aug 24, 2023 · Databases

How StarRocks Boosted Query Speed 3‑10× for a Billion‑Scale Reporting Platform

Facing massive daily query loads, Wanwu Newborn’s Watcher reporting platform migrated from MySQL, Greenplum, and Trino to StarRocks, cutting compute nodes by half while achieving 3‑10× faster query performance, higher success rates, and lower cost, as demonstrated by TPC‑DS and real‑world business query benchmarks.

OLAPStarRocksmigration
0 likes · 14 min read
How StarRocks Boosted Query Speed 3‑10× for a Billion‑Scale Reporting Platform
StarRocks
StarRocks
Aug 22, 2023 · Databases

How StarRocks Query Cache Supercharges High‑Concurrency Aggregations

StarRocks introduces a Query Cache that stores intermediate aggregation results in memory, enabling reuse across semantically equivalent, partition‑overlapping, or append‑only queries, which can boost query performance by 3‑17× in high‑concurrency scenarios while reducing CPU and disk load.

MPP databaseStarRocksaggregation
0 likes · 13 min read
How StarRocks Query Cache Supercharges High‑Concurrency Aggregations
StarRocks
StarRocks
Aug 9, 2023 · Databases

StarRocks 3.1 Highlights: Faster Lakehouse Analytics and Advanced Materialized Views

StarRocks 3.1 introduces a cloud‑native, lakehouse‑oriented architecture with enhanced storage‑compute separation, up to 3‑6× faster data‑lake queries than Trino/Presto, expanded Iceberg and Paimon support, richer materialized view capabilities, new random bucketing, expression partitioning, generated columns, and spill‑to‑disk stability, all backed by extensive performance optimizations and open‑source contributions.

Data LakeLakehouseMaterialized Views
0 likes · 17 min read
StarRocks 3.1 Highlights: Faster Lakehouse Analytics and Advanced Materialized Views
StarRocks
StarRocks
Jul 27, 2023 · Cloud Native

Deploy StarRocks Quickly with Docker and Kubernetes: Step‑by‑Step Guide

This guide explains how to set up a StarRocks cluster using Docker for rapid testing and the StarRocks Kubernetes Operator for production‑grade deployments, covering architecture basics, required tools, command‑line steps, YAML configuration, and connection methods for both internal and external access.

DockerKubernetesStarRocks
0 likes · 11 min read
Deploy StarRocks Quickly with Docker and Kubernetes: Step‑by‑Step Guide
StarRocks
StarRocks
Jun 29, 2023 · Big Data

How StarRocks Boosted Mango TV’s Data Platform Performance by Over 10×

Mango TV replaced its fragmented EMR‑Hive‑Kudu‑Presto stack with a unified StarRocks lakehouse, simplifying architecture, cutting operational costs, and achieving more than a ten‑fold increase in query speed while supporting real‑time analytics, materialized views, bitmap indexing, and store‑compute separation.

Big DataBitmap IndexMaterialized Views
0 likes · 14 min read
How StarRocks Boosted Mango TV’s Data Platform Performance by Over 10×
Ctrip Technology
Ctrip Technology
Jun 15, 2023 · Databases

Rebuilding Ctrip Train Ticket Metrics Platform with StarRocks: Architecture, Data Synchronization, and Performance Gains

The article details how Ctrip's train ticket business revamped its multi‑engine OLAP metrics platform by consolidating to the StarRocks MPP database, describing the new architecture, query workflow, data synchronization strategies, practical lessons, and the resulting dramatic improvement in query latency and reliability.

ETLMetrics PlatformStarRocks
0 likes · 15 min read
Rebuilding Ctrip Train Ticket Metrics Platform with StarRocks: Architecture, Data Synchronization, and Performance Gains
StarRocks
StarRocks
Jun 2, 2023 · Databases

How Tongcheng Travel Scaled Real‑Time Analytics with StarRocks

Tongcheng Travel migrated its multi‑stage OLAP platform from Druid/Kylin and ClickHouse‑Greenplum to a unified StarRocks solution, dramatically improving real‑time query latency, offline report performance, and CDP data processing while reducing operational complexity and enabling cloud‑native deployment.

OLAPStarRocks
0 likes · 14 min read
How Tongcheng Travel Scaled Real‑Time Analytics with StarRocks
DataFunSummit
DataFunSummit
May 30, 2023 · Big Data

DataFunCon Conference – OLAP, StarRocks, ClickHouse, and ByteHouse Technical Sessions

The DataFunCon conference showcases leading experts from Ctrip, Didi, Bilibili, and ByteDance presenting next‑generation OLAP technologies such as StarRocks, ClickHouse, and ByteHouse, covering architecture, materialized views, ELT practices, and performance optimization to guide practitioners in big‑data platform selection and implementation.

ByteHouseOLAPStarRocks
0 likes · 7 min read
DataFunCon Conference – OLAP, StarRocks, ClickHouse, and ByteHouse Technical Sessions
StarRocks
StarRocks
May 26, 2023 · Big Data

How SeaTunnel’s StarRocks Connector Enables High‑Performance Data Sync

This article explains SeaTunnel’s architecture and its StarRocks connector, detailing source and sink features such as field projection, predicate push‑down, parallel reading, state recovery, data type mapping, Stream Load writes, CDC support, configuration examples, and future roadmap for exactly‑once semantics.

Big DataConnectorData Integration
0 likes · 16 min read
How SeaTunnel’s StarRocks Connector Enables High‑Performance Data Sync
Tongcheng Travel Technology Center
Tongcheng Travel Technology Center
May 17, 2023 · Databases

StarRocks Production Practice at Tongcheng Travel: Architecture, Use Cases, and Technical Evaluation

This article details Tongcheng Travel’s production deployment of the StarRocks OLAP database, covering background, business scenarios, technical evaluation against ClickHouse and Greenplum, implementation with Flink SQL, real‑time analytics, offline reporting, CDP use cases, performance optimizations, and future cloud‑native plans.

Big DataFlinkOLAP
0 likes · 12 min read
StarRocks Production Practice at Tongcheng Travel: Architecture, Use Cases, and Technical Evaluation
StarRocks
StarRocks
May 16, 2023 · Databases

How StarRocks’ Compute‑Storage Separation Cuts Costs and Boosts Query Efficiency

This article explains how StarRocks’ new compute‑storage separation architecture reduces storage expenses and improves analytical performance by leveraging hot‑cold data segregation, elastic scaling, caching strategies, multi‑version storage, and optimized compaction, illustrated with real‑world log and e‑commerce workload examples.

Compute-Storage SeparationCost reductionMPP database
0 likes · 18 min read
How StarRocks’ Compute‑Storage Separation Cuts Costs and Boosts Query Efficiency
StarRocks
StarRocks
Apr 23, 2023 · Databases

Why Query Performance Optimization Matters and How to Master It

This guide explains the importance of query performance optimization for database products and engineers, outlines latency and throughput goals, shows how to locate bottlenecks with observability tools and Linux profilers, and provides practical high‑level and low‑level optimization techniques along with testing best practices.

BenchmarkingCPU profilingStarRocks
0 likes · 16 min read
Why Query Performance Optimization Matters and How to Master It
DataFunTalk
DataFunTalk
Apr 13, 2023 · Big Data

Four Paradigms of StarRocks Lakehouse Integration and an Overview of StarRocks 3.0

This article explains why lake‑warehouse integration is needed, outlines its challenges, describes StarRocks' four integration paradigms—including query acceleration, layered modeling, real‑time warehouse‑lake fusion, and the cloud‑native 3.0 solution—and previews the upcoming StarRocks 3.0 release.

Big DataCloud NativeData Lake
0 likes · 18 min read
Four Paradigms of StarRocks Lakehouse Integration and an Overview of StarRocks 3.0
dbaplus Community
dbaplus Community
Apr 11, 2023 · Big Data

How Autohome Built a Flink‑StarRocks Real‑Time Ad Data Warehouse

This article details Autohome's transition from an hourly offline ad data warehouse to a Flink‑StarRocks real‑time architecture, covering background, engine and storage selection, multi‑layer design, implementation steps, encountered issues, monitoring strategies, and future roadmap to achieve second‑level data freshness and high accuracy.

AdvertisingFlinkReal-time Streaming
0 likes · 12 min read
How Autohome Built a Flink‑StarRocks Real‑Time Ad Data Warehouse
StarRocks
StarRocks
Apr 7, 2023 · Databases

StarRocks 3.0 Highlights: Storage‑Compute Separation, New RBAC, and Lakehouse Features

StarRocks 3.0 introduces a storage‑compute separation architecture, a full‑featured RBAC permission framework, enhanced materialized views, Trino‑compatible query dialect, richer Primary‑Key update/delete syntax, automatic partition creation, and numerous performance optimizations, marking a major step from OLAP to lakehouse analytics.

LakehouseRBACStarRocks
0 likes · 10 min read
StarRocks 3.0 Highlights: Storage‑Compute Separation, New RBAC, and Lakehouse Features
StarRocks
StarRocks
Feb 21, 2023 · Databases

How Yidian Tianxia Built a Unified Real‑Time & Offline Data Warehouse with StarRocks

Yidian Tianxia tackled massive daily data volumes and complex analytics by defining a five‑layer data‑warehouse standard, comparing ClickHouse and StarRocks performance, and implementing a unified real‑time/offline architecture with StarRocks, DataPlus, and EasyJob, achieving multi‑fold query speedups and lower operational costs.

Data GovernancePerformance TestingReal-time analytics
0 likes · 14 min read
How Yidian Tianxia Built a Unified Real‑Time & Offline Data Warehouse with StarRocks
StarRocks
StarRocks
Feb 10, 2023 · Databases

How StarRocks Achieves Fine-Grained Resource Isolation for Multi‑Tenant Workloads

StarRocks introduces user‑space scheduling with resource groups and classifiers to provide hard memory isolation, soft CPU/IO isolation, short‑query groups, concurrency limits, and large‑query circuit breaking, balancing isolation and utilization while supporting multi‑tenant workloads and future serverless scenarios.

CPU schedulingStarRocksmemory hard limit
0 likes · 17 min read
How StarRocks Achieves Fine-Grained Resource Isolation for Multi‑Tenant Workloads
JD Tech
JD Tech
Jan 13, 2023 · Big Data

UData: Solving the Last Mile of Data Usage – Architecture, Query Engine Design, and Federated Query Enhancements

This article introduces the UData platform, explains its data‑integration architecture, details the StarRocks‑based query engine workflow from SQL parsing to distributed execution, and describes recent optimizations such as computation push‑down, support for JSF/HTTP/ClickHouse external tables, and a proxy‑based federated query framework.

Big DataData IntegrationQuery Engine
0 likes · 20 min read
UData: Solving the Last Mile of Data Usage – Architecture, Query Engine Design, and Federated Query Enhancements
Ctrip Technology
Ctrip Technology
Jan 12, 2023 · Big Data

Real-Time Data Warehouse Architecture and Practice at Ctrip Hotel

The article explains why enterprises need real-time data warehouses, compares Lambda and Kappa architectures, describes Ctrip Hotel's Lambda‑plus‑OLAP variant built with Flink and StarRocks, and details practical solutions for ordering, wide‑table generation, and data validation that enable billion‑row, low‑latency analytics.

CtripFlinkLambda architecture
0 likes · 10 min read
Real-Time Data Warehouse Architecture and Practice at Ctrip Hotel
DataFunSummit
DataFunSummit
Dec 10, 2022 · Databases

StarRocks in the Modern Data Stack: Architecture Evolution, Typical Applications, and Performance Insights

This article presents a comprehensive overview of StarRocks within the modern data stack, covering the evolution of MPP architectures, typical industry use cases, core features, performance benchmark comparisons, real‑time data‑warehouse construction methods, CDP and lakehouse analytics, as well as short‑term roadmap plans and a brief Q&A.

CDPMPPStarRocks
0 likes · 11 min read
StarRocks in the Modern Data Stack: Architecture Evolution, Typical Applications, and Performance Insights
StarRocks
StarRocks
Dec 1, 2022 · Big Data

How Alibaba Cloud EMR StarRocks Supercharges Data Lake Analytics with Advanced Optimizations

This article explains how Alibaba Cloud EMR StarRocks extends data lake analytics to support Hive, Iceberg, and Hudi, detailing its architecture, Iceberg integration, performance gains over Trino, IO merging, lazy materialization, intelligent caching, and elastic compute capabilities for faster, unified, and cost‑effective queries.

Data LakeEMRElastic Compute
0 likes · 16 min read
How Alibaba Cloud EMR StarRocks Supercharges Data Lake Analytics with Advanced Optimizations
DataFunTalk
DataFunTalk
Nov 21, 2022 · Big Data

Building a Unified Data Analytics Platform at TCL Using StarRocks

The article describes how TCL leveraged StarRocks to create a unified data analytics platform, detailing the company’s background, OLAP evolution, typical StarRocks use cases such as real‑time dashboards, HR analytics, and email alerts, and outlines future plans for further integration and performance improvements.

Data PlatformOLAPStarRocks
0 likes · 10 min read
Building a Unified Data Analytics Platform at TCL Using StarRocks
360 Smart Cloud
360 Smart Cloud
Nov 17, 2022 · Databases

Exploring StarRocks Applications, Performance Tests, and Cloud‑Native Integration at 360

This article reviews the practical applications and experimental explorations of StarRocks at 360, describing the cloud‑native lake‑warehouse product Yunzhou, its three‑tier architecture, performance comparisons with Trino using TPCH 100 GB, challenges of Kubernetes integration, and future directions for storage‑compute separation.

Big DataCloud NativeKubernetes
0 likes · 7 min read
Exploring StarRocks Applications, Performance Tests, and Cloud‑Native Integration at 360
DataFunTalk
DataFunTalk
Nov 17, 2022 · Big Data

Building a Unified High‑Speed Analytics Platform with StarRocks at Cross‑Express

Cross‑Express consolidated multiple big‑data engines into a unified, high‑performance analytics platform using StarRocks, achieving millisecond‑level query latency, real‑time data warehousing, significant cost savings, and improved multi‑scenario business applications; the initiative also simplified BI development, reduced hardware requirements, and set a roadmap for future engine enhancements.

OLAPStarRocksreal-time data warehouse
0 likes · 10 min read
Building a Unified High‑Speed Analytics Platform with StarRocks at Cross‑Express
StarRocks
StarRocks
Nov 15, 2022 · Databases

How StarRocks 2.5 Improves Materialized Views for Real‑Time and Offline Queries

This article analyzes the requirements, design choices, and implementation details of materialized views in StarRocks, covering demand analysis, synchronous and asynchronous refresh solutions, partition binding, task scheduling, partition‑refresh maintenance, Insert‑Overwrite mechanics, view invalidation handling, and the upcoming features planned for version 2.5.

Async RefreshInsert OverwriteQuery Rewrite
0 likes · 14 min read
How StarRocks 2.5 Improves Materialized Views for Real‑Time and Offline Queries
StarRocks
StarRocks
Nov 8, 2022 · Databases

How StarRocks’ Real‑Time Storage Engine Evolves to Meet Modern Analytics Demands

This article outlines the evolution of StarRocks’ storage engine—from its real‑time update capabilities and primary‑key model challenges to recent optimizations like persistent indexes, partial column updates, conditional updates, high‑frequency import improvements, DML support, and future plans for separating primary and sort keys, introducing row‑store, and enhancing materialized view support.

DMLReal-time analyticsRow Store
0 likes · 18 min read
How StarRocks’ Real‑Time Storage Engine Evolves to Meet Modern Analytics Demands
政采云技术
政采云技术
Nov 8, 2022 · Big Data

User Path Analysis in the Hunyi System: Design, Computation Logic, and StarRocks Implementation

This article explains user path analysis as a method to visualize and optimize user flow, describes its productization in the Hunyi analytics platform, details the underlying computation logic, presents a complex StarRocks SQL solution, discusses performance challenges, and suggests future improvements and recruitment opportunities.

Big DataStarRocksperformance optimization
0 likes · 21 min read
User Path Analysis in the Hunyi System: Design, Computation Logic, and StarRocks Implementation
StarRocks
StarRocks
Nov 4, 2022 · Big Data

Building a High‑Performance, Cost‑Effective Cloud Lakehouse with StarRocks and EMR

This article explains how to design and implement a cloud‑native Lakehouse using StarRocks and Tencent Cloud EMR, covering core technical requirements, a five‑layer architecture, data ingestion with Iceberg/Hudi, performance tricks like Z‑order clustering, cost‑control through elastic scaling, and the key product features of EMR StarRocks.

Big DataEMRHudi
0 likes · 24 min read
Building a High‑Performance, Cost‑Effective Cloud Lakehouse with StarRocks and EMR
DataFunTalk
DataFunTalk
Nov 3, 2022 · Big Data

How Meituan Food Service SaaS Built a Data Middle Platform on StarRocks

This article describes how Meituan Food Service SaaS built a high‑quality, large‑scale data middle platform using StarRocks, covering business overview, technical selection, multi‑layer architecture, virtual views, intelligent tiered querying, multi‑active hot standby, and the performance gains achieved.

Data PlatformMeituanStarRocks
0 likes · 17 min read
How Meituan Food Service SaaS Built a Data Middle Platform on StarRocks
StarRocks
StarRocks
Nov 2, 2022 · Databases

Mastering Join Optimization in StarRocks: Techniques, Algorithms, and Distributed Planning

This article provides a comprehensive, step‑by‑step guide to StarRocks join optimization, covering join types, logical rewrite rules, predicate push‑down, join reorder algorithms, cost modeling, distributed join strategies, and runtime filters, while offering practical tips for achieving high‑performance query execution.

Cost ModelDistributed SQLJOIN optimization
0 likes · 26 min read
Mastering Join Optimization in StarRocks: Techniques, Algorithms, and Distributed Planning
StarRocks
StarRocks
Oct 20, 2022 · Databases

Inside StarRocks Pipeline Engine: How BE Splits and Schedules Queries

This article explains the core concepts, architecture, and source‑code details of StarRocks’ Pipeline execution framework, covering BE initialization, query lifecycle management, operator splitting, PipelineBuilder processing, and the scheduling logic of PipelineDriver, with concrete code examples and diagrams to illustrate each step.

Database EnginePipelineQuery Execution
0 likes · 21 min read
Inside StarRocks Pipeline Engine: How BE Splits and Schedules Queries
DataFunTalk
DataFunTalk
Oct 17, 2022 · Big Data

How Data Empowers the Fast‑Moving Consumer Goods Industry: Baicaowei’s End‑to‑End Data Platform Evolution

This article details Baicaowei’s journey from a Hadoop‑based data platform to a modern StarRocks‑driven architecture, illustrating how digitalization, evolving business needs, and streamlined data pipelines empower the fast‑moving consumer goods sector through efficient data collection, modeling, and analytics.

Big DataData ArchitectureDigital Transformation
0 likes · 10 min read
How Data Empowers the Fast‑Moving Consumer Goods Industry: Baicaowei’s End‑to‑End Data Platform Evolution
StarRocks
StarRocks
Oct 13, 2022 · Databases

Inside StarRocks: How the Pipeline Execution Engine Boosts Query Performance

This article explains the core concepts, architecture, and code logic of StarRocks' Pipeline execution framework, covering ExecPlan, PlanFragment, Fragment Instance, ExecNode, SourceOperator, SinkOperator, PipelineDriver scheduling, asynchronous handling of blocking operations, and the roles of FE and BE in MPP scheduling.

Execution EngineMPPPipeline
0 likes · 13 min read
Inside StarRocks: How the Pipeline Execution Engine Boosts Query Performance
StarRocks
StarRocks
Oct 11, 2022 · Databases

Why StarRocks Outperforms Traditional OLAP: Architecture, Storage Model, and Real‑World Use Cases

This article explains the advantages of StarRocks as a next‑generation MPP database, detailing its simplified architecture, vectorized engine, storage layout, partitioning and bucketing strategies, and showcases two production case studies with performance comparisons, configuration tips, and future roadmap considerations.

Flink IntegrationMPP databaseStarRocks
0 likes · 17 min read
Why StarRocks Outperforms Traditional OLAP: Architecture, Storage Model, and Real‑World Use Cases
DataFunSummit
DataFunSummit
Sep 24, 2022 · Big Data

Evolution of 37 Mobile Games' Multi-Dimensional Analysis Platform: From MySQL to StarRocks

The article details how 37 Mobile Games built and continuously evolved a multi-dimensional analytics platform—covering business background, data challenges, the migration from MySQL through Druid, Impala, ClickHouse to StarRocks, self‑service data tools, monitoring, and future roadmap—highlighting technical decisions and lessons learned.

ImpalaOLAPStarRocks
0 likes · 20 min read
Evolution of 37 Mobile Games' Multi-Dimensional Analysis Platform: From MySQL to StarRocks
DeWu Technology
DeWu Technology
Sep 14, 2022 · Databases

Introduction to StarRocks: Architecture, Storage, Use Cases, and Troubleshooting

StarRocks is a high‑performance MPP database whose simplified FE/BE architecture, fully vectorized engine, and CBO optimizer enable fast multi‑table joins, while its partition‑bucket‑tablet storage model supports real‑time metric services and dashboard migrations, accompanied by practical troubleshooting guidance and upcoming enhancements.

MPP databaseReal-time analyticsStarRocks
0 likes · 15 min read
Introduction to StarRocks: Architecture, Storage, Use Cases, and Troubleshooting
DataFunSummit
DataFunSummit
Sep 2, 2022 · Big Data

ZhongAn Insurance Data Platform: Digital Transformation, 4633 Framework, and Real‑time Data Warehouse with StarRocks

This article details ZhongAn Insurance's digital transformation through its 4633 data‑centric framework, the architecture of its JiZhi data platform, the challenges of its original ClickHouse‑based real‑time warehouse, and how migrating to StarRocks improved performance, scalability, and operational efficiency across advertising and insurance use cases.

Big DataData PlatformDigital Transformation
0 likes · 13 min read
ZhongAn Insurance Data Platform: Digital Transformation, 4633 Framework, and Real‑time Data Warehouse with StarRocks
StarRocks
StarRocks
Aug 17, 2022 · Databases

Why Vectorization Supercharges Database Performance: Deep Dive into StarRocks

This article explains how CPU‑centric vectorization, especially SIMD, reduces instruction count and CPI, addresses the four major CPU bottlenecks, and how StarRocks systematically applies automatic and manual SIMD techniques, verification methods, and a suite of engineering optimizations to achieve multi‑fold query speedups.

CPU optimizationSIMDStarRocks
0 likes · 16 min read
Why Vectorization Supercharges Database Performance: Deep Dive into StarRocks
StarRocks
StarRocks
Aug 6, 2022 · Databases

How StarRocks Accelerates Low‑Cardinality String Queries with Global Dictionary Optimization

This article explains how StarRocks uses a global dictionary to transform low‑cardinality string columns into integer codes, dramatically improving query performance across scan, filter, aggregation, join, shuffle, and sort phases, and details the construction, maintenance, and practical impact of this optimization.

StarRocksglobal dictionarylow cardinality
0 likes · 17 min read
How StarRocks Accelerates Low‑Cardinality String Queries with Global Dictionary Optimization
DataFunTalk
DataFunTalk
Jul 30, 2022 · Databases

StarRocks-Based Unified Data Service and Analytics Platform at JD Logistics

JD Logistics leverages StarRocks to create the Udata unified query engine, addressing data silos, low performance, and high maintenance costs by integrating data services and analytics, enabling low‑code data service generation, high‑speed federated queries, real‑time updates, and future data‑lake and resource isolation capabilities.

Data IntegrationReal-time analyticsStarRocks
0 likes · 14 min read
StarRocks-Based Unified Data Service and Analytics Platform at JD Logistics
DataFunTalk
DataFunTalk
Jul 24, 2022 · Big Data

Real-time Data Warehouse Empowering Fine-grained Intelligent Operations in Finance – A Practical Case Study

This talk by Zhongan Insurance’s Data Senior Director Shi Xingtian outlines the company’s digital transformation, detailing the 4633 framework, the real-time data warehouse architecture, the migration from ClickHouse to StarRocks, and how these technologies support fine‑grained, intelligent financial operations and advertising analytics.

Big DataStarRocksZhongan Insurance
0 likes · 14 min read
Real-time Data Warehouse Empowering Fine-grained Intelligent Operations in Finance – A Practical Case Study
StarRocks
StarRocks
Jul 22, 2022 · Big Data

How 37 Mobile Games Boosted Analytics with StarRocks: A Real‑World Performance Case Study

37 Mobile Games, a leading mobile game publisher, migrated its user‑profile analytics from a Hadoop‑Hudi‑Kafka‑Hive‑Flink stack to StarRocks, achieving sub‑second query latency on billion‑row tables, simplifying operations, reducing storage costs, and enabling real‑time data sync, as detailed in this technical case study.

Big DataOLAPStarRocks
0 likes · 12 min read
How 37 Mobile Games Boosted Analytics with StarRocks: A Real‑World Performance Case Study
HomeTech
HomeTech
Jul 20, 2022 · Big Data

Design and Implementation of a Real-Time Advertising Data Warehouse Using Flink and StarRocks

This article presents a comprehensive case study of building a real‑time advertising data warehouse at Auto Home, detailing the evaluation of streaming engines and storage solutions, the layered architecture design, implementation steps with Flink and StarRocks, monitoring practices, encountered issues, and future roadmap, demonstrating how second‑level data freshness and high accuracy were achieved.

FlinkStarRocksStreaming
0 likes · 10 min read
Design and Implementation of a Real-Time Advertising Data Warehouse Using Flink and StarRocks
StarRocks
StarRocks
Jul 18, 2022 · Big Data

How Songguo Mobility Built a Real‑Time OLAP Platform with StarRocks: From 1.0 to 3.0

Songguo Mobility’s data‑center team migrated from a fragmented Impala‑Kudu‑ClickHouse stack to a unified StarRocks‑based real‑time OLAP architecture, iterating through three versions to solve scalability, latency, and maintenance challenges while supporting minute‑level dashboards for orders and vehicle analytics.

FlinkKafkaReal-time OLAP
0 likes · 19 min read
How Songguo Mobility Built a Real‑Time OLAP Platform with StarRocks: From 1.0 to 3.0
DataFunTalk
DataFunTalk
Jun 29, 2022 · Big Data

Migrating a Game Data Platform to StarRocks: Architecture, Performance Gains, and Operational Benefits

This article describes how the gaming company Boke City rebuilt its comprehensive data service platform by replacing a CDH‑based Impala solution with StarRocks, detailing the architectural changes, performance benchmark results, and the resulting improvements in query speed, real‑time data updates, and operational simplicity.

Big DataData PlatformGame Analytics
0 likes · 14 min read
Migrating a Game Data Platform to StarRocks: Architecture, Performance Gains, and Operational Benefits
StarRocks
StarRocks
Jun 2, 2022 · Big Data

Simplify Real‑Time Data Warehousing with Flink CDC and StarRocks

This article explores how combining Flink CDC with StarRocks can streamline real‑time data pipelines, reduce component complexity, support both full and incremental synchronization, and enable efficient OLAP queries and updates for fast, scalable analytics across diverse business scenarios.

Flink CDCOLAPReal-time analytics
0 likes · 18 min read
Simplify Real‑Time Data Warehousing with Flink CDC and StarRocks
StarRocks
StarRocks
May 12, 2022 · Databases

How StarRocks’ Primary Key Model Delivers 3‑5× Faster Real‑Time Queries

This article explains the design and implementation of StarRocks 2.x Primary Key tables, covering real‑time update mechanisms, write and commit workflows, in‑memory primary indexing, compaction, read‑path optimizations, performance benchmarks, and upcoming features such as partial and conditional updates.

OLAPStarRockscompaction
0 likes · 19 min read
How StarRocks’ Primary Key Model Delivers 3‑5× Faster Real‑Time Queries
StarRocks
StarRocks
May 7, 2022 · Databases

How 360 Built a Lightning‑Fast Unified Analytics Platform with StarRocks

Facing massive data storage and query challenges, 360 upgraded its analytics architecture by adopting StarRocks, achieving multi‑dimensional, high‑concurrency analysis, simplified data pipelines, and significant performance and cost improvements across its radar and user‑portrait platforms.

AnalyticsBig DataOLAP
0 likes · 10 min read
How 360 Built a Lightning‑Fast Unified Analytics Platform with StarRocks
StarRocks
StarRocks
Apr 24, 2022 · Databases

How StarRocks Transforms a SQL Query into Distributed Execution: A Deep Dive

This article explains how StarRocks converts a SQL statement into an optimal distributed physical execution plan, schedules the plan across compute nodes, and runs it using MPP, pipeline parallelism, and vectorized execution to achieve near‑linear performance scaling.

CBO optimizerMPPSQL query processing
0 likes · 15 min read
How StarRocks Transforms a SQL Query into Distributed Execution: A Deep Dive
StarRocks
StarRocks
Apr 13, 2022 · Big Data

How StarRocks Achieves Lightning‑Fast Data Lake Analytics

This article explains StarRocks' streamlined architecture, cost‑based optimizer, massively parallel processing and vectorized engine, and how they enable high‑performance queries over data stored in Hive, Iceberg, Hudi and other lake formats, backed by benchmark results and future roadmap details.

Big DataCBOData Lake
0 likes · 19 min read
How StarRocks Achieves Lightning‑Fast Data Lake Analytics
DataFunTalk
DataFunTalk
Apr 13, 2022 · Databases

Adopting StarRocks for Real‑Time Analytics in ZhongAn’s JiZhi Platform: A Performance Comparison with ClickHouse

This article describes how ZhongAn Insurance’s JiZhi data‑analysis platform migrated from ClickHouse to the MPP OLAP engine StarRocks, detailing the business requirements, architectural challenges, benchmark results across single‑table and multi‑table queries, and the resulting improvements in latency, concurrency, and operational simplicity for real‑time analytics.

Big DataOLAPPerformance Testing
0 likes · 14 min read
Adopting StarRocks for Real‑Time Analytics in ZhongAn’s JiZhi Platform: A Performance Comparison with ClickHouse