Big Data 10 min read

Building a Unified High‑Speed Analytics Platform with StarRocks at Cross‑Express

Cross‑Express consolidated multiple big‑data engines into a unified, high‑performance analytics platform using StarRocks, achieving millisecond‑level query latency, real‑time data warehousing, significant cost savings, and improved multi‑scenario business applications; the initiative also simplified BI development, reduced hardware requirements, and set a roadmap for future engine enhancements.

DataFunTalk
DataFunTalk
DataFunTalk
Building a Unified High‑Speed Analytics Platform with StarRocks at Cross‑Express

Introduction: Cross‑Express, a large modern logistics company, faces massive data volume and real‑time requirements, with over 5 million daily tickets and more than 10 million API calls, demanding sub‑second response times.

Platform evolution: Initially used MySQL, then added Presto, ES, Impala+Kudu, TiDB, and ClickHouse, leading to multiple engines and complex BI development.

Engine selection: In 2021 the team benchmarked several engines on identical hardware and data, finding StarRocks superior in query performance and real‑time updates.

Engine convergence: The team retained ES for detailed queries and StarRocks as the primary engine, citing AP query performance, MySQL compatibility, primary‑key updates, federated external tables, low maintenance, and integrated data loading tools.

Benefits: Query latency dropped to milliseconds, daily API calls exceed 6 million, BI development time reduced, ETL eliminated, overall performance up 20%, and hardware usage cut from 15 nodes to 4.

Real‑time data warehouse: Replaced a 2‑hour batch pipeline with a Flink‑StarRocks pipeline, achieving sub‑5‑second end‑to‑end latency and 300 % faster queries compared with Presto; the primary‑key model improved performance another 200 %.

Multi‑scenario applications: Core business cost‑optimization uses StarRocks to compute optimal routes, reducing per‑order calculation from 90 seconds to 4 seconds and increasing throughput by 400 %.

Summary and outlook: StarRocks enabled convergence of five‑six legacy engines, delivering real‑time analytics, cost reduction, and plans to explore resource isolation, near‑real‑time solutions, and version unification.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

StarRocksOLAPreal-time data warehouse
DataFunTalk
Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.