Tagged articles

StarRocks

228 articles · Page 1 of 3

Jun 25, 2026 · Databases

StarRocks 4.1 Enables Faster Iceberg Queries While Preserving Data Freshness

StarRocks 4.1 introduces an incremental materialized view for Apache Iceberg that ties refresh cost to data changes instead of table size, dramatically cutting refresh time, maintaining low latency, and keeping query results fresh even as tables scale to terabytes or petabytes, with a fallback to partition refresh when needed.

Apache IcebergData FreshnessIncremental Materialized View

0 likes · 8 min read

StarRocks

Jun 17, 2026 · Databases

How StarRocks 4.1 Simplifies Operations and Boosts Production Performance

StarRocks 4.1 introduces automatic multi‑tenant data management, large‑capacity tablets, second‑level schema evolution, enhanced cache observability, and deeper Iceberg support, addressing static data distribution, data skew, high repair costs and expertise requirements while delivering up to 1.86× higher throughput and dramatically lower latency in production workloads.

Cache ObservabilityData DistributionFast Schema Evolution

0 likes · 13 min read

StarRocks

Jun 12, 2026 · Big Data

Building a Millisecond-Responsive Real-Time Data Engine with StarRocks, Fluss, and Paimon

This article presents a lake‑stream integrated solution that combines Apache Fluss, Apache Paimon, and StarRocks to achieve second‑level data freshness, tenfold storage cost reduction, and a single‑query access pattern for both real‑time and historical data, detailing its architecture, advantages, query modes, and future roadmap.

FlussLakehousePaimon

0 likes · 13 min read

StarRocks

Jun 4, 2026 · Databases

How StarRocks and Iceberg Enable Federated Queries: A Practical Walkthrough

This article details Fresha's real‑world integration of StarRocks with Apache Iceberg, covering metadata planning, distributed execution, adaptive metadata retrieval, hot‑cold data layering, missing statistics handling, catalog configuration, and performance optimizations that together demonstrate how federated queries can be efficiently executed over data‑lake tables.

Apache IcebergData LakeFederated Query

0 likes · 14 min read

StarRocks

May 28, 2026 · Industry Insights

How Fresha Built a Modern Real‑Time Analytics Stack with AutoMQ and StarRocks

Fresha replaced its Postgres‑Snowflake‑MSK pipeline with an AutoMQ‑based Diskless Kafka message layer and StarRocks for real‑time analytics, cutting storage costs 17‑20×, dropping query latency from seconds to sub‑second, and migrating ~1,000 topics in a week with zero downtime.

AutoMQCloud MigrationKafka

0 likes · 24 min read

How Fresha Built a Modern Real‑Time Analytics Stack with AutoMQ and StarRocks

Alibaba Cloud Big Data AI Platform

May 23, 2026 · Cloud Computing

Best Practice: Using EMR Serverless StarRocks AI Function for Financial Text Classification

This article demonstrates how to leverage StarRocks AI Function on EMR Serverless to perform sentiment analysis, intelligent classification, information extraction, and PII redaction on financial text entirely within SQL, eliminating data export, reducing latency, and ensuring compliance while providing concrete code examples, performance benchmarks, and best‑practice recommendations.

AI FunctionEMR ServerlessFinancial NLP

0 likes · 25 min read

StarRocks

May 21, 2026 · Databases

Say Goodbye to Repeated Pitfalls with Our Open‑Source AI Skill for Database Troubleshooting

The article introduces starrocks‑debug‑skills, an open‑source, three‑layer knowledge base (Skills, Cases, Tools) that captures real‑world StarRocks troubleshooting experience, shows how AI assistants can use it to diagnose issues such as import timeouts, version errors, and compaction slowdowns, and explains how to contribute new cases.

AIDatabase TroubleshootingOpen-source

0 likes · 13 min read

Say Goodbye to Repeated Pitfalls with Our Open‑Source AI Skill for Database Troubleshooting

StarRocks

May 20, 2026 · Big Data

How StarRocks, Paimon, and Fluss Enable Multimodal Fusion Search in a Lakehouse

The Streaming Lakehouse Meetup (May 27) explores breaking data silos by unifying structured tables, images, video, audio, and high‑dimensional vectors through StarRocks‑Paimon‑Fluss integration, covering multimodal fusion retrieval, vector search internals, native reader/writer performance gains, and real‑world ANN indexing practices.

FlussLakehouseMultimodal

0 likes · 5 min read

StarRocks

May 8, 2026 · Big Data

Scaling Real‑Time Analytics at KaptureCX: Best Practices with RisingWave and StarRocks

KaptureCX migrated its core analytics from ClickHouse to StarRocks, introduced RisingWave and Kafka for CDC, and achieved millisecond‑level query latency, a reporting cycle cut from weeks to one day, and a solid data foundation for AI‑driven services.

CDCKafkaMVP

0 likes · 11 min read

StarRocks

Apr 16, 2026 · Databases

Why Traditional Databases Stall AI Agents—and How StarRocks Overcomes the Bottleneck

Traditional databases were built for low‑frequency, human‑driven queries, but AI agents generate dozens of concurrent, sub‑second queries that expose architectural limits, and StarRocks addresses these challenges with self‑healing optimization, real‑time data pipelines, extreme concurrency handling, and seamless lakehouse access.

Database ConcurrencyLakehouseQuery Optimization

0 likes · 13 min read

StarRocks

Apr 1, 2026 · Databases

Inside StarRocks I/O: How Tablets, Fragments, Pipelines, and Morsels Power Parallel Scanning

This article explains the core I/O components of StarRocks—Tablet, Fragment, Pipeline, Morsel, ScanOperator, ChunkSource, ScanTask, and ChunkBuffer—showing how they work together to achieve high‑performance, low‑latency query execution in a compute‑storage separated architecture.

I/O ArchitectureMPP databaseMorsel

0 likes · 21 min read

Inside StarRocks I/O: How Tablets, Fragments, Pipelines, and Morsels Power Parallel Scanning

vivo Internet Technology

Mar 25, 2026 · Industry Insights

How Vivo Scaled Marketing Automation with Presto, Bitmap, and StarRocks

This case study details how Vivo’s marketing automation platform evolved its data‑driven architecture—from a Presto‑based wide‑table design, through a Bitmap optimization, to a StarRocks migration—addressing performance bottlenecks, reducing resource costs, and enhancing data security.

Big DataData ArchitectureOLAP

0 likes · 11 min read

StarRocks

Mar 11, 2026 · Databases

How StarRocks Supercharges Real‑Time Ad Funnel Monitoring and Creative Optimization

This article dissects the full advertising funnel, explains why CTR and eCPM are critical, and demonstrates how StarRocks combined with Flink can deliver minute‑level real‑time monitoring, material selection, anomaly alerts, A/B testing, and a successful migration from Druid for massive ad‑tech workloads.

AdvertisingMaterialized ViewsSQL

0 likes · 20 min read

How StarRocks Supercharges Real‑Time Ad Funnel Monitoring and Creative Optimization

Big Data Technology & Architecture

Mar 6, 2026 · Big Data

What’s New in Big Data Frameworks? ClickHouse, Fluss, Delta Lake, StarRocks & More (Mar 2026)

This roundup compiles the latest releases across major data platforms—including ClickHouse, Apache Fluss, Delta Lake, StarRocks, Apache Pulsar and DolphinScheduler—highlighting version numbers, key feature additions, security fixes, and emerging trends shaping the big‑data ecosystem.

Apache FlussBig DataClickHouse

0 likes · 19 min read

StarRocks

Mar 5, 2026 · Big Data

How Fanatics Scaled to PB‑Level Data with StarRocks & Apache Iceberg Lakehouse

Fanatics unified its fragmented data stack by building a StarRocks‑powered Lakehouse on Apache Iceberg, replacing Redshift, Snowflake, Athena, and Druid, which cut costs by up to 95%, delivered sub‑second dashboard queries on petabyte‑scale data, and enabled real‑time and historical analytics on a single platform.

Apache IcebergData ArchitectureFanatics

0 likes · 10 min read

StarRocks

Feb 11, 2026 · Big Data

How StarRocks and Apache Paimon Build a True Lakehouse Native Engine

This article details the deep integration of StarRocks with Apache Paimon, describing the unified architecture, version evolution, performance enhancements, time‑travel queries, native readers/writers, distributed planning, and future roadmap for achieving lakehouse‑native analytics at scale.

Apache PaimonData LakeLakehouse

0 likes · 10 min read

How StarRocks and Apache Paimon Build a True Lakehouse Native Engine

Alibaba Cloud Big Data AI Platform

Feb 4, 2026 · Big Data

How Paimon + StarRocks Power Real‑Time OLAP for Double‑11 Mega‑Sales

During Double‑11 mega‑sales, Taobao Group faced exploding OLAP query traffic, costly data sync pipelines, and slow near‑real‑time analytics, so they unified real‑time and batch data in Paimon, leveraged StarRocks for high‑performance lake queries, tuned cluster settings, and saved nearly ten‑million yuan annually while cutting refresh latency by 80%.

Big DataData LakeOLAP

0 likes · 22 min read

How Paimon + StarRocks Power Real‑Time OLAP for Double‑11 Mega‑Sales

Alibaba Cloud Big Data AI Platform

Feb 2, 2026 · Big Data

How We Built a Scalable Lakehouse Architecture with StarRocks, Paimon, and Flink

This article details the evolution of a data warehouse at RenliJia from a MaxCompute‑centric setup to a modern lakehouse using StarRocks, Paimon, Flink, and Fluss, describing design goals, technical evaluations, implementation steps for offline, OLAP, and real‑time workloads, and the challenges and future plans that emerged.

Big DataData WarehouseFlink

0 likes · 25 min read

StarRocks

Jan 22, 2026 · Big Data

How Paimon + StarRocks Accelerates Double‑11 OLAP Queries by 80% Refresh Speed

This article explains how Taotian Group unified real‑time and offline data using Paimon as lake storage and StarRocks for high‑performance OLAP, eliminating costly sync pipelines, cutting refresh time by about 80%, saving nearly ten million yuan annually, and detailing the architecture, cluster safeguards, configuration tweaks, monitoring, and future roadmap for large‑scale promotional events.

Big DataData ArchitectureOLAP

0 likes · 24 min read

How Paimon + StarRocks Accelerates Double‑11 OLAP Queries by 80% Refresh Speed

Alibaba Cloud Big Data AI Platform

Jan 8, 2026 · Big Data

How Gaode Maps Built a Real‑Time Lakehouse for Billion‑Scale Trajectory Data

This article details Gaode Maps' end‑to‑end lakehouse solution for massive, high‑frequency trajectory data, covering the challenges of real‑time visibility, query performance, and storage cost, and explaining how a hot‑warm‑cold tiering architecture built on Apache Flink, Paimon, StarRocks, Redis and Lindorm delivers millisecond‑level queries while cutting storage expenses.

Apache FlinkApache PaimonData Tiering

0 likes · 19 min read

How Gaode Maps Built a Real‑Time Lakehouse for Billion‑Scale Trajectory Data

Big Data Tech Team

Jan 8, 2026 · Industry Insights

Why Paimon + StarRocks Is the New Real‑Time Lakehouse Choice for Big Tech

Veteran data‑warehouse expert explains how the Paimon‑StarRocks stack solves the write‑read split, cuts storage costs, and delivers real‑time analytics, comparing it with Hudi, Iceberg, ClickHouse and Trino, and shows why leading Chinese tech firms are adopting this lakehouse architecture.

Data WarehouseLakehousePaimon

0 likes · 9 min read

StarRocks

Jan 7, 2026 · Big Data

How Gaode Maps Built a Real‑Time Lakehouse for Billion‑Scale Trajectory Data

This article details Gaode Maps' end‑to‑end lakehouse solution for handling high‑frequency, high‑volume trajectory data, covering the challenges of real‑time visibility, multi‑scenario queries, storage cost, and data silos, and describing the layered storage architecture, performance validation, and future expansion plans.

Apache FlinkData TieringLakehouse

0 likes · 21 min read

Alibaba Cloud Big Data AI Platform

Dec 30, 2025 · Big Data

How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine

StarRocks and Apache Paimon have been progressively integrated across multiple releases, enabling a unified lakehouse architecture that supports multi-source federated analysis, time-travel queries, native readers/writers, distributed planning, and advanced profiling, while delivering performance gains that bring Paimon query speed on par with native StarRocks tables.

Apache PaimonData IntegrationLakehouse

0 likes · 9 min read

StarRocks

Dec 25, 2025 · Big Data

How dbt, DataOps, and StarRocks Combine to Accelerate Real‑Time Data Modeling

This article explains how dbt drives automated data modeling and governance, how DataOps practices bring agility and control to data projects, and how StarRocks’ lakehouse architecture enables real‑time and batch analytics, illustrated with concrete workflows, version‑control conventions, and enterprise case studies.

Data GovernanceDataOpsELT

0 likes · 14 min read

StarRocks

Dec 18, 2025 · Databases

How Fresha Scaled Real‑Time Analytics with StarRocks: A Deep Dive into Their Hybrid Architecture

Facing Postgres overload and costly Snowflake queries, Fresha rebuilt its analytics platform by introducing StarRocks as a unified SQL entry point, combining federated lakehouse queries with high‑performance internal tables, which reduced homepage query latency to around 200 ms and achieved minute‑level data freshness across real‑time, historical, and search workloads.

Compute-Storage SeparationHybrid ArchitectureLakehouse

0 likes · 20 min read

StarRocks

Dec 11, 2025 · Databases

How StarRocks Redesigns Bulk Import to Cut Small Files and Boost Throughput

This article explains how StarRocks mitigates the hidden risks of massive one‑time data imports in a storage‑compute separated architecture by redesigning the write path to spill to local disk, merge centrally, and write to object storage, resulting in fewer small files, higher write throughput, and more stable query performance.

Bulk ImportCompactionData Engineering

0 likes · 12 min read

How StarRocks Redesigns Bulk Import to Cut Small Files and Boost Throughput

Ctrip Technology

Nov 27, 2025 · Big Data

How Ctrip Cut Query Latency by 85% with StarRocks’ Compute‑Storage Separation

Ctrip migrated its massive User Behavior Tracking system from ClickHouse to a compute‑storage separated StarRocks cluster on Kubernetes, achieving millisecond‑level query latency, halving storage usage, reducing node count, and sustaining millions‑of‑rows‑per‑second write throughput while simplifying scaling and operations.

Big DataClickHouseCompute-Storage Separation

0 likes · 15 min read

StarRocks

Nov 18, 2025 · Databases

StarRocks Beats ClickHouse, Snowflake, and Databricks in Coffee‑Shop Benchmark – Up to 10× Faster and Cheaper

A reproducible evaluation of StarRocks using the open‑source Coffee‑shop Benchmark shows that across 500 M, 1 B and 5 B row scales, StarRocks completes 17 complex join and aggregation queries 2–10× faster and with significantly lower cost than ClickHouse, Snowflake and Databricks, demonstrating superior performance and cost efficiency for analytical workloads.

Coffee-shop BenchmarkDatabase PerformanceStarRocks

0 likes · 11 min read

StarRocks Beats ClickHouse, Snowflake, and Databricks in Coffee‑Shop Benchmark – Up to 10× Faster and Cheaper

Alibaba Cloud Big Data AI Platform

Nov 15, 2025 · Big Data

From a Decade-Long Big Data Journey to a Cloud‑Native Lakehouse

This article chronicles a ten‑year evolution of a self‑built big data platform—detailing early Hadoop clusters, successive migrations to Spark, Hive, Hudi, and StarRocks, the operational challenges encountered, and the comprehensive shift to Alibaba Cloud EMR Serverless that delivered significant cost, performance, and stability gains while outlining future intelligent‑ecosystem plans.

Big DataData LakeEMR Serverless

0 likes · 17 min read

From a Decade-Long Big Data Journey to a Cloud‑Native Lakehouse

StarRocks

Nov 5, 2025 · Databases

How FlatJSON Transforms JSON Queries in StarRocks 4.0 for Near‑Columnar Performance

StarRocks 4.0 introduces FlatJSON, a columnar storage and execution engine that converts high‑frequency JSON fields into native columns, dramatically reducing I/O and CPU costs and enabling JSON queries to run with performance close to that of traditional columnar data.

Columnar StorageDatabase PerformanceFlatJSON

0 likes · 19 min read

StarRocks

Oct 28, 2025 · Databases

How Cisco Migrated from Pinot to StarRocks and Boosted Query Performance by Up to 70%

This article details Cisco Webex's migration from a complex Pinot‑Trino OLAP stack to StarRocks, covering the challenges of the legacy system, the step‑by‑step migration process—including storage, compute, and SQL dialect transformation—and the resulting performance gains, cost reductions, and operational improvements.

Big DataOLAPPinot

0 likes · 23 min read

StarRocks

Oct 21, 2025 · Databases

How StarRocks 3.5 Enables Fast Cluster Snapshots and Disaster Recovery in Kubernetes

StarRocks 3.5 introduces a cluster‑level snapshot mechanism that automates backup to object storage, supports minute‑level recovery, and integrates with Kubernetes via Helm charts to streamline disaster‑recovery workflows for high‑availability workloads.

Disaster RecoveryKubernetesS3

0 likes · 17 min read

How StarRocks 3.5 Enables Fast Cluster Snapshots and Disaster Recovery in Kubernetes

Alibaba Cloud Big Data AI Platform

Oct 18, 2025 · Big Data

Alibaba Cloud EMR’s AI Evolution: Accelerating Big Data Performance

Since its 2016 launch, Alibaba Cloud EMR has transformed from a basic open‑source Hadoop service into a high‑performance, AI‑enabled big‑data platform, delivering optimized I/O, vectorized processing, and integrated AI functions such as natural‑language SQL, StarRocks and Spark enhancements, while supporting diverse industry workloads.

Cloud ComputingEMRSpark

0 likes · 9 min read

Alibaba Cloud EMR’s AI Evolution: Accelerating Big Data Performance

StarRocks

Oct 14, 2025 · Big Data

How Ctrip Scaled UBT Analytics by Migrating from ClickHouse to StarRocks

Ctrip's User Behavior Tracking (UBT) system, handling 30 TB of daily data, moved from ClickHouse to StarRocks' compute‑storage separated architecture, cutting average query latency from 1.4 seconds to 203 ms, halving storage, reducing nodes from 50 to 40, and boosting write throughput to 3 million rows per second.

Big DataClickHouseData Migration

0 likes · 15 min read

StarRocks

Sep 23, 2025 · Databases

How Zepto Scaled Real‑Time Brand Analytics with StarRocks: From Postgres MVP to Sub‑Second Queries

Zepto transformed its brand‑analytics platform from a Postgres MVP into a production‑grade, sub‑second real‑time analytics solution by adopting StarRocks, redesigning its data pipeline with Databricks, Kafka, and Flink, and choosing a storage‑compute architecture that supports massive joins and rapid insights.

DatabricksFlinkKafka

0 likes · 14 min read

Alibaba Cloud Big Data AI Platform

Sep 15, 2025 · Big Data

How a FinTech Firm Boosted Real‑Time Decision Making with StarRocks Data Warehouse

This case study details how Shuhe Technology, a leading fintech company, overcame data redundancy, low resource utilization, and slow reporting by adopting Alibaba Cloud EMR Serverless StarRocks for a unified, real‑time data warehouse, achieving standardized data pipelines, cost savings, and minute‑level decision latency.

Big DataFinTechReal-Time Data Warehouse

0 likes · 8 min read

How a FinTech Firm Boosted Real‑Time Decision Making with StarRocks Data Warehouse

iQIYI Technical Product Team

Sep 11, 2025 · Databases

How StarRocks Unified Data Warehouse Simplified Our Multi-Source Advertising Platform

This article explains how the Tianji advertising platform consolidated heterogeneous MySQL, ClickHouse, and TiDB data sources into a single StarRocks data warehouse, addressing data silos, real‑time performance, and query complexity while improving accuracy, latency, and development efficiency.

Data WarehouseOLAPStarRocks

0 likes · 15 min read

StarRocks

Sep 9, 2025 · Big Data

From Hadoop to StarRocks: Revamping a Government Procurement Data Platform

Facing massive data volumes, complex component dependencies, high TCO, and real‑time processing limits, the政采云 platform replaced its Hadoop stack with StarRocks’ minimalist, decoupled architecture, achieving lower costs, elastic scaling, faster queries, easier operations, and robust fault tolerance across diverse government procurement workloads.

Cloud NativeData WarehouseHadoop migration

0 likes · 16 min read

From Hadoop to StarRocks: Revamping a Government Procurement Data Platform

Alibaba Cloud Big Data AI Platform

Sep 5, 2025 · Big Data

How StarRocks + Paimon Powered Real‑Time Analytics for Alibaba’s Taobao Flash Sale

Facing minute‑level decision demands and billions of marketing events during Taobao's Flash Sale, the Ele.me data team built a real‑time lakehouse with StarRocks and Paimon, leveraging asynchronous materialized views, RoaringBitmap de‑duplication, and resource isolation to achieve sub‑second query latency, lower storage costs, and stable high‑concurrency.

LakehouseMaterialized ViewsPaimon

0 likes · 25 min read

StarRocks

Sep 2, 2025 · Big Data

How StarRocks + Paimon Powered Real‑Time Analytics for Alibaba’s Flash Sale

Faced with billions of marketing events and minute‑level decision requirements during Taobao's flash‑sale campaign, the e‑commerce data team built a real‑time lakehouse using StarRocks and Paimon, leveraged asynchronous materialized views and RoaringBitmap deduplication, and achieved sub‑second query latency, massive cost savings, and stable high‑concurrency performance.

Big DataLakehouseMaterialized Views

0 likes · 26 min read

How StarRocks + Paimon Powered Real‑Time Analytics for Alibaba’s Flash Sale

Big Data Technology Tribe

Aug 22, 2025 · Backend Development

How StarRocks Keeps Metadata Consistent Across FE Nodes

This article explains the roles of StarRocks FE and BE nodes, details the metadata stored in FE, describes the leader‑follower‑observer architecture, and shows how BDB JE replication, journal logs, and checkpoint mechanisms ensure metadata synchronization and durability even after node failures.

BDB JEMetadataStarRocks

0 likes · 17 min read

How StarRocks Keeps Metadata Consistent Across FE Nodes

Alibaba Cloud Big Data AI Platform

Aug 21, 2025 · Big Data

How Hypergryph Built a High‑Performance Real‑Time Analytics Platform with StarRocks

This case study details how Hypergryph leveraged Alibaba Cloud EMR Serverless StarRocks, Flink, and Kafka to replace a ClickHouse data warehouse with a high‑performance, elastic, and easy‑to‑operate real‑time analytics platform that dramatically improved query speed, stability, operational efficiency, and cost for their gaming business.

Cloud ComputingFlinkKafka

0 likes · 8 min read

StarRocks

Aug 19, 2025 · Big Data

How Joydata Scaled to 150 Billion Daily Events with StarRocks: A Data Architecture Journey

Facing daily data growth from millions to 150 billion records, Joydata‑U transformed its analytics platform through three architectural stages—Hadoop, Hadoop + Trino, and finally StarRocks—introducing resource isolation, Flat JSON acceleration, and Bitmap indexing to cut query latency by up to seven times and achieve sub‑2‑minute data freshness across BI, ad‑tech, game analytics, and CRM workloads.

Bitmap IndexData ArchitectureFlat JSON

0 likes · 12 min read

How Joydata Scaled to 150 Billion Daily Events with StarRocks: A Data Architecture Journey

Big Data Technology Tribe

Aug 15, 2025 · Backend Development

How StarRocks TabletChecker Guarantees Tablet Health and Scheduling

The article explains the purpose, configuration, and core implementation of StarRocks' TabletChecker component, detailing how it periodically scans OlapTable tablets, evaluates their health through multiple checks, and hands unhealthy tablets to the TabletScheduler for repair.

JavaStarRocksTabletChecker

0 likes · 16 min read

StarRocks

Aug 6, 2025 · Databases

How Qunar Migrated to StarRocks: Architecture, Performance Gains & Best Practices

This article details Qunar's transition to StarRocks as a unified OLAP engine, covering the business background, engine evaluation, architecture redesign, observability, high‑availability strategies, query‑performance optimizations, real‑world application cases, community contributions, and future plans.

Data PlatformHigh AvailabilityOLAP

0 likes · 21 min read

StarRocks

Jul 23, 2025 · Big Data

How StarRocks Powers Intelligent BI with AI‑Native Lakehouse Architecture

This article explores the evolution of business intelligence toward intelligent BI, detailing traditional BI limitations, agile BI improvements, and how StarRocks' MPP lakehouse engine combined with large language models enables natural‑language analytics, real‑time performance, AI‑driven insights, and scalable enterprise deployments.

AI integrationIntelligent BILakehouse

0 likes · 19 min read

How StarRocks Powers Intelligent BI with AI‑Native Lakehouse Architecture

Qunar Tech Salon

Jul 22, 2025 · Databases

Quark’s Data Platform Upgrade with StarRocks: Architecture, Performance, Roadmap

This article details how Quark’s data platform consolidated multiple analytics engines into a unified StarRocks‑based OLAP solution, covering business background, engine selection, architecture redesign, performance tuning, operational practices, and future plans for scalability and reliability.

Data PlatformKubernetesOLAP

0 likes · 19 min read

StarRocks

Jul 16, 2025 · Cloud Native

Build a Decoupled Storage‑Compute Data Platform with StarRocks and MinIO

This step‑by‑step tutorial shows how to deploy StarRocks and MinIO in a decoupled storage‑compute architecture using Docker Compose and Kubernetes, configure local caching, create storage volumes, load public datasets, and run SQL queries to explore the combined data.

Data LakehouseDecoupled StorageDocker Compose

0 likes · 14 min read

StarRocks

Jul 9, 2025 · Big Data

How Shopee Built a Near‑Real‑Time Data Warehouse with Paimon and StarRocks

Shopee combined the Paimon data lake with StarRocks and Flink to create a quasi‑real‑time warehouse, enabling fast task diagnostics and a high‑performance financial reconciliation system while dramatically reducing storage costs and latency through innovative ODS, snapshot, and branch table techniques.

FlinkPaimonReal-Time Data Warehouse

0 likes · 13 min read

StarRocks

Jul 1, 2025 · Big Data

How StarRocks Boosted Suixingfu’s Real‑Time Data Platform: 3× Faster Queries & 10× Faster Analytics

Suixingfu rebuilt its payment data pipeline by replacing a fragmented Lambda stack with a unified Porter CDC + StarRocks + Elasticsearch architecture, achieving three‑fold query speed, ten‑fold analytics efficiency, 20% storage reduction, and sub‑second data‑capture latency across high‑concurrency, ad‑hoc, and batch workloads.

CDCData WarehouseFlink

0 likes · 14 min read

StarRocks

Jun 26, 2025 · Databases

What’s New in StarRocks 3.5? Snapshot Backup, Bulk Load, Partition & Transaction Enhancements

StarRocks 3.5 introduces a cluster‑level Snapshot backup for fast recovery, a bulk‑load optimization that reduces small files and compaction cost, smarter partition management with time‑based merging and TTL, multi‑statement transactions with full ACID guarantees, low‑cardinality dictionary support for lake tables, and several security and performance upgrades.

ACID TransactionsData LakeLow Cardinality Dictionary

0 likes · 17 min read

What’s New in StarRocks 3.5? Snapshot Backup, Bulk Load, Partition & Transaction Enhancements

DataFunSummit

Jun 19, 2025 · Big Data

How Shopee Leverages Paimon for Real‑Time Data Warehousing and Task Diagnosis

This article details Shopee's Data Infra team's use of the Paimon data lake to build near‑real‑time warehouses, accelerate ODS layers, implement a task‑diagnosis system, and create a reconciliation platform, while sharing future plans and a Q&A session.

Data LakeFlinkPaimon

0 likes · 12 min read

How Shopee Leverages Paimon for Real‑Time Data Warehousing and Task Diagnosis

StarRocks

Jun 17, 2025 · Databases

How to Ace the StarRocks SRCA Certification: Key Topics and Study Strategies

This guide outlines the StarRocks SRCA certification exam, highlights essential study resources, breaks down the core topics such as architecture, data import/export, SQL optimization and performance tuning, and offers practical tips, mock‑exam details, and personal experience to help candidates succeed.

Database CertificationPerformance TuningSRCA

0 likes · 9 min read

How to Ace the StarRocks SRCA Certification: Key Topics and Study Strategies

Big Data Technology & Architecture

Jun 10, 2025 · Big Data

Transforming Real‑Time Analytics: Incremental Computing with Lakehouse Architecture

This article examines how Xiaohongshu replaced its costly Lambda architecture with a real‑time lakehouse built on Iceberg, Paimon, Spark, and StarRocks, achieving minute‑level latency, higher data quality, lower resource consumption, and dramatically faster query performance.

Big Data ArchitectureIcebergLakehouse

0 likes · 7 min read

Transforming Real‑Time Analytics: Incremental Computing with Lakehouse Architecture

Alibaba Cloud Big Data AI Platform

May 21, 2025 · Big Data

How Alibaba’s A+ Traffic Analysis Achieved Sub‑Second Log Queries with StarRocks & Paimon

This article details how Alibaba's A+ traffic analysis platform tackled trillion‑row log ingestion and high‑concurrency queries by redesigning storage with Paimon, leveraging Flink for real‑time ingestion, and using StarRocks for fast lake analytics, ultimately reducing query latency from minutes to seconds.

FlinkLog AnalyticsPaimon

0 likes · 15 min read

StarRocks

May 13, 2025 · Artificial Intelligence

How StarRocks MCP Server Enables LLMs to Query Databases Without Custom Plugins

StarRocks MCP Server provides a universal adapter that lets large language models like Claude, OpenAI, and Gemini execute SQL queries directly against StarRocks, simplifying data Q&A, intelligent analysis, and automated reporting by eliminating the need for bespoke plugins or complex prompt engineering.

AI agentsLLMMCP

0 likes · 14 min read

StarRocks

May 8, 2025 · Backend Development

How Grab Supercharged Spark Observability 10× with StarRocks – Inside the Iris Architecture

Grab replaced its fragmented Grafana‑Superset stack with a StarRocks‑backed Iris platform, achieving over ten‑fold query speedups, 40% lower resource usage, and a unified real‑time and historical data store for Spark observability across its Southeast Asian super‑app ecosystem.

Data PlatformKafkaMaterialized Views

0 likes · 16 min read

How Grab Supercharged Spark Observability 10× with StarRocks – Inside the Iris Architecture

Alibaba Cloud Big Data AI Platform

Apr 27, 2025 · Big Data

Scaling Property Services: StarRocks‑Powered Storage‑Compute Separation for 8000+ Communities

Facing a flood of data from over 8,000 communities, the Bifeng service team migrated from a monolithic storage‑compute architecture to a StarRocks‑based storage‑compute separation solution, achieving lower costs, higher resource utilization, faster queries, and improved SLA across their property management platform.

Big DataData WarehouseInfrastructure Migration

0 likes · 11 min read

StarRocks

Apr 24, 2025 · Databases

Inside StarRocks Optimizer: Architecture, Multi‑Stage Optimization, and Advanced Features

This article provides a comprehensive technical overview of StarRocks' query optimizer, covering its evolution, core architecture, multi‑stage optimization pipeline, key optimizations such as multi‑join colocate, low‑cardinality global dictionary, MV union rewrite, and advanced mechanisms like cost‑estimation fixes, query feedback, adaptive execution, runtime filters, join‑reorder strategies, and SQL plan management.

Adaptive ExecutionCost-Based OptimizationMaterialized Views

0 likes · 26 min read

StarRocks

Apr 22, 2025 · Operations

How to Build an Effective Monitoring and Alerting System for StarRocks Clusters

This guide explains how to design a comprehensive monitoring and alerting framework for StarRocks, covering resource usage, service availability, and business continuity with practical PromQL queries and troubleshooting steps.

AlertingPerformanceStarRocks

0 likes · 42 min read

How to Build an Effective Monitoring and Alerting System for StarRocks Clusters

Big Data Technology & Architecture

Apr 22, 2025 · Artificial Intelligence

Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek

This article explains the fundamentals of Retrieval‑Augmented Generation, demonstrates how to create and query vector indexes using StarRocks, shows how DeepSeek provides embeddings and answer generation, and walks through a complete end‑to‑end RAG pipeline with code examples and a web UI.

AIDeepSeekEmbedding

0 likes · 20 min read

StarRocks

Mar 27, 2025 · Databases

How JD Logistics Boosted Query Speed and Cut Costs with StarRocks Storage‑Compute Separation

JD Logistics transformed its one‑stop self‑service analytics platform, UData, by migrating from an integrated storage‑compute architecture to a storage‑compute separated design powered by StarRocks, achieving sub‑10‑second P95/P99 query latency, reducing storage costs by 90%, and cutting compute expenses around 30% while supporting massive data volumes.

Data PlatformKubernetesPerformance Optimization

0 likes · 20 min read

How JD Logistics Boosted Query Speed and Cut Costs with StarRocks Storage‑Compute Separation

vivo Internet Technology

Mar 26, 2025 · Big Data

Reading Encrypted ORC Files in StarRocks: Architecture and Implementation Details

The article details how StarRocks extends the Apache ORC C++ library to decrypt column‑level encrypted ORC files, describing the file hierarchy, AES‑128‑CTR key handling, the query‑time master‑key retrieval, a decorator‑based decryption/decompression pipeline, and the block‑skip‑read mechanism that enables efficient predicate push‑down.

Big DataDatabaseEncryption

0 likes · 19 min read

Reading Encrypted ORC Files in StarRocks: Architecture and Implementation Details

Alibaba Cloud Big Data AI Platform

Mar 20, 2025 · Big Data

How to Read and Write StarRocks Data with EMR Serverless Spark

This step‑by‑step guide explains how to use EMR Serverless Spark together with the StarRocks Spark Connector to create a workspace, upload the connector JAR, configure network connections, create databases and tables in StarRocks, and perform read/write operations via SQL sessions, Notebook sessions, or batch Spark jobs, complete with code examples and UI screenshots.

Big DataData IntegrationEMR Serverless

0 likes · 14 min read

StarRocks

Mar 4, 2025 · Databases

How NAVER Boosted Query Performance and Scalability by Migrating from ClickHouse to StarRocks

NAVER migrated its massive analytics platform from ClickHouse to StarRocks, achieving dramatic improvements in multi‑table JOIN performance, real‑time aggregation speed, and horizontal scalability while simplifying data integration across heterogeneous sources on a Kubernetes‑based architecture.

ClickHouseKubernetesMaterialized Views

0 likes · 13 min read

StarRocks

Feb 27, 2025 · Big Data

How iQIYI Boosted Ad Query Performance 400% with StarRocks – A Deep Dive into OLAP Evolution

This article details iQIYI's transition from Impala+Kudu and ClickHouse to StarRocks, describing the OLAP architecture, performance gains of up to 400% in advertising workloads, the technical challenges of data consistency, lake‑warehouse fusion, operational scaling, and the step‑by‑step migration process using a dual‑run platform.

ClickHouseFlinkOLAP

0 likes · 15 min read

How iQIYI Boosted Ad Query Performance 400% with StarRocks – A Deep Dive into OLAP Evolution

Xiaohongshu Tech REDtech

Feb 20, 2025 · Big Data

How Xiaohongshu Accelerated Data Warehouse Queries with Logical Datasets & Materialized Views

Xiaohongshu tackled low reuse of APP tables, limited scalability of single-table BI datasets, and poor dashboard query performance by introducing logical datasets and materialized views, which enable query pruning, reduce data redundancy, and accelerate BI queries, achieving up to 80% latency reduction and higher hit rates.

BIBig DataData Warehouse

0 likes · 25 min read

How Xiaohongshu Accelerated Data Warehouse Queries with Logical Datasets & Materialized Views

StarRocks

Feb 20, 2025 · Big Data

How RedBI Boosted Query Speed 3× with StarRocks & Iceberg Lakehouse

The article details how Xiaohongshu's RedBI self‑service analytics platform transformed its architecture by integrating StarRocks and Iceberg, replacing ClickHouse‑based storage with Parquet, introducing DataCache, Z‑Order sorting and intelligent key selection, achieving a three‑fold P90 query speed improvement, sub‑10‑second latency, and halving storage consumption.

DataCacheIcebergLakehouse

0 likes · 19 min read

StarRocks

Feb 11, 2025 · Databases

How StarRocks Supercharges Vector Search: 7× Faster Queries and 1/3 Cost

This article explains the principles and practical implementation of vector retrieval in StarRocks, covering approximate nearest‑neighbor algorithms, index design, query planning, performance optimizations, real‑world case studies, and future challenges, showing how query latency dropped from 15 seconds to 2 seconds while cutting costs to a third.

ANNHNSWIVFPQ

0 likes · 25 min read

StarRocks

Jan 14, 2025 · Databases

How 58.com Achieved 20× Faster Real‑Time Queries by Migrating to StarRocks

58.com integrated the StarRocks analytical engine into its data‑exploration platform, replacing Spark/Hive, to overcome minute‑level latency, and after a year of migration achieved over 20× query speedup, 98%+ success rate, and solved numerous Spark‑StarRocks compatibility issues while also moving the service to the cloud.

Big DataSQL accelerationSpark compatibility

0 likes · 17 min read

StarRocks

Jan 2, 2025 · Big Data

StarRocks Compute‑Storage Separation Cuts Costs 40% and Boosts Efficiency 20% at DMALL

DMALL upgraded its big‑data platform by adopting StarRocks 3.x with compute‑storage separation, lakehouse external tables, and Kubernetes deployment, achieving 20% higher compute utilization, 40% lower storage cost, faster cluster provisioning, and notable improvements in development and operations efficiency.

Big DataCompute-Storage SeparationKubernetes

0 likes · 25 min read

StarRocks Compute‑Storage Separation Cuts Costs 40% and Boosts Efficiency 20% at DMALL

Alibaba Cloud Developer

Dec 25, 2024 · Big Data

Build a Low‑Cost, High‑Performance Game Player Profiling Platform with Alibaba Cloud EMR StarRocks

This tutorial walks you through using Alibaba Cloud EMR Serverless StarRocks and Apache Paimon to create a cost‑effective, high‑performance game player profiling and behavior analysis platform, covering data import, materialized view creation, DWD/ADS layer construction, and lakehouse integration.

Alibaba CloudData LakeGame Analytics

0 likes · 12 min read

StarRocks

Dec 25, 2024 · Databases

Cutting Costs 40% and Halving Query Latency: Our ClickHouse‑to‑StarRocks Migration

Facing high costs and scaling limits with ClickHouse, we migrated a 4000‑core, 500TB OLAP workload to StarRocks, achieving 40% cost reduction, 50% storage savings, and up to 30× query speedups through storage‑compute separation, materialized‑view rewrites, and extensive performance tuning.

ClickHouseMaterialized ViewsOLAP

0 likes · 18 min read

Cutting Costs 40% and Halving Query Latency: Our ClickHouse‑to‑StarRocks Migration

58 Tech

Dec 19, 2024 · Big Data

Architecture Evolution and Implementation of the Intelligent Acceleration Engine in the 58 Big Data Platform

The article details the background, architectural analysis, multi‑tenant redesign, engine selection enhancements, compatibility adaptations, stability fixes, containerized deployment, performance optimizations, and measurable business outcomes of the Intelligent Acceleration Engine upgrade using Apache Kyuubi and StarRocks within the 58 big data platform.

Apache KyuubiBig DataData Architecture

0 likes · 12 min read

Architecture Evolution and Implementation of the Intelligent Acceleration Engine in the 58 Big Data Platform

58 Tech

Dec 18, 2024 · Big Data

Architecture Evolution and Capability Building of the Smart Acceleration Engine in the 58 Big Data Platform

The article details the background, architectural challenges, and comprehensive redesign of the Smart Acceleration Engine—including multi‑tenant support, cross‑datacenter scheduling, enriched engine selection, parsing and forwarding enhancements, compatibility adaptations, stability fixes, containerized deployment, and performance gains—demonstrating significant operational improvements and future directions for the platform.

Apache KyuubiBig DataMulti‑tenant

0 likes · 14 min read

Architecture Evolution and Capability Building of the Smart Acceleration Engine in the 58 Big Data Platform

StarRocks

Dec 2, 2024 · Big Data

How Paimon Revamps Lakehouse Management and Supercharges Queries with StarRocks

This article details Tongcheng Travel's migration from Hive/Kudu/Hudi to Paimon for lakehouse integration, highlighting a 30% resource reduction, three‑fold write speed gains, significant query acceleration via StarRocks, the end‑to‑end architecture across ODS‑DWD‑DWS‑ADS layers, and future roadmap plans.

Big DataFlinkLakehouse

0 likes · 18 min read

How Paimon Revamps Lakehouse Management and Supercharges Queries with StarRocks

Sohu Tech Products

Nov 27, 2024 · Databases

Understanding StarRocks Materialized View Refreshes and New Optimization Parameters

This article walks through StarRocks' materialized view refresh mechanisms, explains the various refresh triggers, details the refresh workflow, and introduces new parameters that allow selective refreshing of only changed data to avoid costly full‑partition refreshes.

OptimizationPartitionSQL

0 likes · 8 min read

Understanding StarRocks Materialized View Refreshes and New Optimization Parameters

Open Source Tech Hub

Nov 16, 2024 · Databases

Build Real‑Time Analytics with StarRocks: Quickstart Tutorial and Sample Queries

This guide introduces StarRocks, a high‑performance MPP database, explains its architecture and typical use cases, walks through a Docker‑based quickstart, shows how to create databases and tables, load NYC crash and weather datasets via Stream Load, and demonstrates analytical SQL queries that reveal traffic‑accident patterns under different weather conditions.

Data WarehouseDockerMPP database

0 likes · 18 min read

Build Real‑Time Analytics with StarRocks: Quickstart Tutorial and Sample Queries

Big Data Technology & Architecture

Nov 1, 2024 · Big Data

Real‑Time Lakehouse Architecture at Ximalaya Live: Leveraging Flink, Paimon, and StarRocks

This article details Ximalaya Live's transition from an offline‑centric data warehouse to a real‑time lakehouse using Flink, Paimon, and StarRocks, covering business background, architectural challenges, technology evaluation, implementation steps, encountered issues, performance gains, and future expansion plans.

FlinkLakehousePaimon

0 likes · 12 min read

Real‑Time Lakehouse Architecture at Ximalaya Live: Leveraging Flink, Paimon, and StarRocks

Shopee Tech Team

Oct 25, 2024 · Big Data

StarRocks at Shopee: Practical Use Cases and Performance Analysis

Shopee’s deployment of StarRocks across DataService, DataGo, and DataStudio demonstrates that its vectorized engine, cost‑based optimizer, and materialized‑view caching can query Hive, Iceberg, Delta Lake and Hudi up to 20,000× faster than Presto, cutting CPU usage and delivering consistently lower latency for complex analytics.

Data LakeHiveMPP

0 likes · 11 min read

StarRocks

Oct 16, 2024 · Big Data

How to Build a High‑Performance Lakehouse with StarRocks and Apache Hive

This guide walks through the core concepts of Apache Hive, its architecture and key features, then shows how to integrate Hive with StarRocks via the Hive Catalog, construct ODS/DWD/DWS/ADS tables, enable DataCache, use materialized views, and handle automatic partition detection for fast lakehouse analytics.

Apache HiveBig DataDataCache

0 likes · 17 min read

How to Build a High‑Performance Lakehouse with StarRocks and Apache Hive

Alibaba Cloud Big Data AI Platform

Sep 13, 2024 · Big Data

How Qimao Scales 20PB Data with StarRocks, Flink, and Real‑Time Analytics

Qimao, a Shanghai‑based cultural entertainment internet firm, details its 20 PB big‑data architecture built on StarRocks, Flink, Hive, and Redis, covering data ingestion, real‑time processing, audience selection, metric anomaly drill‑down, 730‑day aggregation, and future plans for metric acceleration and full‑link data governance.

Big DataData GovernanceData Warehouse

0 likes · 13 min read

StarRocks

Sep 5, 2024 · Big Data

Accelerate Lakehouse Queries: A Hands‑On Guide to StarRocks + Apache Iceberg

This tutorial walks you through the fundamentals of Apache Iceberg, its architecture and key features, explains why it’s advantageous for lakehouse workloads, and provides a step‑by‑step Docker‑Compose setup to integrate Iceberg with StarRocks for fast, ACID‑compliant analytics on real‑world taxi data.

Apache IcebergData EngineeringDocker

0 likes · 15 min read

StarRocks

Aug 30, 2024 · Databases

How Cloud‑Native Persistent Index Boosts StarRocks Performance 10× in Elastic Scheduling

StarRocks 3.3.1 introduces a cloud‑native persistent index that moves index files to object storage, eliminates local‑disk constraints, and supports elastic scaling, delivering up to ten‑fold latency improvement over local‑disk indexes in elastic scheduling while matching performance in batch and real‑time imports.

Cloud NativeDatabasePerformance

0 likes · 11 min read

StarRocks

Aug 14, 2024 · Big Data

Mastering StarRocks & Apache Paimon: A Fast‑Track Lakehouse Guide

This guide provides a comprehensive overview of Apache Paimon’s architecture, key features, and advantages, explains how to integrate it with StarRocks for real‑time lakehouse analytics, and walks through a complete quick‑start setup including component installation, Flink and Kafka deployment, data ingestion, table creation, and query execution with time‑travel support.

Apache PaimonData EngineeringFlink

0 likes · 18 min read

StarRocks

Aug 9, 2024 · Big Data

How Pinterest Cut Query Latency by 50% with StarRocks Migration

Pinterest migrated its Partner Insights analytics from Druid to StarRocks, achieving a 50% reduction in p90 latency, a six‑fold cost‑performance improvement, and simplified data ingestion, illustrating the benefits of a modern MPP database for real‑time ad analytics.

AnalyticsMPPPinterest

0 likes · 6 min read

How Pinterest Cut Query Latency by 50% with StarRocks Migration

Wukong Talks Architecture

Aug 6, 2024 · Databases

Migrating Tencent Music's Data Infrastructure from ClickHouse and Druid to StarRocks: Strategy, Implementation, and Best Practices

This article details how Tencent Music’s data‑infrastructure team migrated thousands of ClickHouse and Druid nodes to a StarRocks compute‑storage‑separated lakehouse, achieving 40‑50% cost reduction while maintaining query performance, and shares the technical challenges, solutions, and best‑practice recommendations gathered during the process.

ClickHouseData MigrationDruid

0 likes · 19 min read

StarRocks

Aug 1, 2024 · Big Data

How Kingsoft Office Boosted Query Speed 2.3× with StarRocks 3.0

Kingsoft Office migrated its reporting platform from a multi‑engine stack to StarRocks 3.0, achieving a 48.84% performance gain, halving query latency, reducing operational costs, and improving resource utilization while supporting storage‑compute separation and seamless Trino SQL compatibility.

Big DataStarRocksStorage-Compute Separation

0 likes · 14 min read

How Kingsoft Office Boosted Query Speed 2.3× with StarRocks 3.0

Wukong Talks Architecture

Jul 23, 2024 · Databases

An Overview of StarRocks: Architecture, Features, and Performance Benchmarks

StarRocks, an open‑source, high‑performance MPP analytical database under the Linux Foundation, offers vectorized engines, CBO optimizer, materialized views, and storage‑compute separation, integrates with BI tools and data lakes, and demonstrates superior query speed in benchmark tests against ClickHouse, Druid, and Trino.

Data LakehouseMPPPerformance Benchmark

0 likes · 10 min read

An Overview of StarRocks: Architecture, Features, and Performance Benchmarks

Alibaba Cloud Big Data AI Platform

Jul 22, 2024 · Databases

Why StarRocks Is Redefining Fast Unified OLAP Analytics

StarRocks combines vectorized execution, a new cost‑based optimizer, materialized views, a real‑time storage engine, pipeline execution, and distributed joins to deliver a unified, high‑performance OLAP solution that supports both traditional and lakehouse analytics while reducing operational complexity.

CBODatabaseLakehouse

0 likes · 14 min read

StarRocks

Jul 17, 2024 · Databases

Unlock 30% Faster Queries: StarRocks on AWS Graviton3 Performance Deep Dive

This article examines how StarRocks, a next‑generation MPP database, leverages AWS Graviton3 instances to achieve over 30% query speed improvement and 15% cost reduction compared with x86 C6i instances, detailing benchmark methodology, hardware specs, SIMD optimizations, and real‑world OLAP results.

AWS Graviton3MPP databasePerformance Benchmark

0 likes · 11 min read

Unlock 30% Faster Queries: StarRocks on AWS Graviton3 Performance Deep Dive

Sohu Tech Products

Jul 10, 2024 · Industry Insights

How StarRocks and Apache Paimon Transform Data Lake Analytics and Migration

This article provides a practical deep‑dive into StarRocks and Apache Paimon, covering data‑lake fundamentals, the technical advantages of both platforms, performance gains over traditional engines, step‑by‑step migration strategies, deployment options on Alibaba Cloud EMR, and future roadmap plans.

Apache PaimonData LakeQuery Optimization

0 likes · 15 min read

How StarRocks and Apache Paimon Transform Data Lake Analytics and Migration

DataFunTalk

Jul 6, 2024 · Big Data

StarRocks and Paimon Data Lake Capabilities, Migration Solutions, and Future Roadmap

This article presents a practical overview of StarRocks and Apache Paimon data‑lake capabilities, explains their performance advantages, details migration strategies from Trino/Presto and other engines, describes cluster‑to‑cluster migration, and outlines future roadmap for integration and optimization.

Big DataCloud ComputingData Lake

0 likes · 13 min read

StarRocks and Paimon Data Lake Capabilities, Migration Solutions, and Future Roadmap

DeWu Technology

Jul 5, 2024 · Databases

StarRocks 2.5.13 Cross-Cluster Upgrade and Data Migration Practices

The article outlines a cross‑cluster upgrade to StarRocks 2.5.13, evaluating resource and stability costs, and presents two migration schemes—using external tables and a Flink connector—along with planning, parallel execution, validation steps, and results showing successful migration of over 10 TB at 2 Gb/s across ten nodes, while noting future automation and CDC enhancements.

Data MigrationExternal TableFlink

0 likes · 15 min read

StarRocks

Jul 2, 2024 · Big Data

What’s New in StarRocks 3.3? Deep Dive into Lakehouse‑Optimized Performance and Features

StarRocks 3.3 introduces a comprehensive set of enhancements—including maturity levels, ARM‑optimized performance, advanced caching, materialized‑view rewrites, storage optimizations, and expanded lakehouse ecosystem support—that together boost stability, query speed, and usability for large‑scale analytics workloads.

Big DataCache OptimizationLakehouse

0 likes · 15 min read

What’s New in StarRocks 3.3? Deep Dive into Lakehouse‑Optimized Performance and Features

DataFunSummit

Jul 1, 2024 · Big Data

Optimizing JD Retail Data Architecture: From Lambda to Real‑time Unified Processing with Flink, Hudi, and StarRocks

This article details JD Retail's transition from a complex Lambda architecture to a unified real‑time data pipeline using Flink, Hudi, and StarRocks, addressing data completeness versus latency, reducing maintenance costs, improving storage efficiency, and delivering faster, more consistent analytics for business users.

Data WarehouseFlinkHudi

0 likes · 13 min read

StarRocks

Jun 18, 2024 · Databases

How StarRocks Compaction Boosts Query Performance: Mechanics, Tuning, and Best Practices

This article explains StarRocks' compaction process that merges multiple data versions into larger files to reduce I/O, details the scheduler and executor roles, shows how to monitor and control compaction via SQL commands, and provides tuning parameters and best‑practice recommendations for optimal performance.

CompactionData ManagementPerformance Tuning

0 likes · 21 min read

StarRocks

Jun 6, 2024 · Big Data

Why StarRocks Beats Trino: A Deep Technical Comparison

This article provides a detailed technical comparison between StarRocks and Trino, covering their shared MPP architecture, cost‑based optimizer, pipeline execution, ANSI SQL support, differences in vectorized execution, materialized view capabilities, caching systems, data source connectors, benchmark results, high‑availability designs, join algorithms, and real‑world user case studies.

Big DataCacheMPP

0 likes · 20 min read

Why StarRocks Beats Trino: A Deep Technical Comparison

Alibaba Cloud Big Data AI Platform

Jun 6, 2024 · Databases

How StarRocks Redefines Lakehouse Architecture with Ultra-Fast Unified Analytics

StarRocks combines extreme query speed and a unified architecture to deliver a lakehouse solution that separates storage and compute, supports multi‑warehouse resource isolation, offers Trino compatibility, materialized‑view acceleration, and cost‑effective scaling, making it suitable for real‑time analytics, data‑lake queries, and traditional OLAP workloads.

Big DataLakehouseStarRocks

0 likes · 23 min read

StarRocks

May 22, 2024 · Big Data

Unlocking Data Lake Power: Iceberg Architecture & StarRocks Acceleration

Apache Iceberg offers a modern, ACID‑compliant table format for data lakes with features like hidden partitions and schema evolution, while StarRocks provides high‑performance query acceleration, metadata caching, and distributed planning to address Iceberg’s latency challenges, enabling seamless lake‑warehouse integration and real‑time analytics.

Apache IcebergData LakeMetadata Caching

0 likes · 19 min read

Unlocking Data Lake Power: Iceberg Architecture & StarRocks Acceleration