Big Data Technology & Architecture
Big Data Technology & Architecture
Oct 23, 2020 · Big Data

Overview of Real-Time Big Data Processing: Spark Structured Streaming, CarbonData, Flink, and Cloud Stream

This article provides a comprehensive overview of modern real‑time big‑data solutions, detailing Spark Structured Streaming capabilities, CarbonData’s storage architecture, Meituan’s Flink deployments, and Huawei Cloud Stream’s unified streaming service, highlighting their features, challenges, and future directions.

CarbonDataFlinkReal-time analytics
0 likes · 17 min read
Overview of Real-Time Big Data Processing: Spark Structured Streaming, CarbonData, Flink, and Cloud Stream
Hulu Beijing
Hulu Beijing
Dec 20, 2016 · Big Data

How Hulu Supercharges OLAP Queries with CarbonData: Real‑World Optimizations

This article describes Hulu’s real‑world OLAP query optimization, covering the fundamentals of OLAP, comparisons of row‑ and column‑based storage formats, detailed indexing mechanisms of Parquet, ORC and CarbonData, and the specific schema, shuffle, block size, speculation and GC tuning techniques that enabled CarbonData to dramatically accelerate wide‑table queries on SparkSQL.

Big DataCarbonDataColumnar Storage
0 likes · 17 min read
How Hulu Supercharges OLAP Queries with CarbonData: Real‑World Optimizations
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jul 14, 2016 · Big Data

What Makes Huawei’s CarbonData a Game-Changer for Big Data Analytics?

Huawei’s CarbonData, now an Apache incubator project, is a lightweight, low‑latency columnar storage format that separates storage and compute, offering multi‑dimensional analytics, high compression, and seamless integration with Spark and Hadoop, while addressing the limitations of traditional NoSQL, search engines, and SQL‑on‑Hadoop solutions.

Apache IncubatorCarbonDataOLAP
0 likes · 14 min read
What Makes Huawei’s CarbonData a Game-Changer for Big Data Analytics?