Tag

SQL on Hadoop

0 views collected around this technical thread.

DataFunTalk
DataFunTalk
Mar 10, 2021 · Big Data

Hive MetaStore Challenges and Optimizations at Kuaishou

At Kuaishou, the Hive MetaStore service, which stores metadata for Hive, faced scalability and performance challenges due to massive dynamic partitions and high query volume, leading to a series of architectural optimizations—including read‑write separation, API enhancements, traffic control, and federation—to improve stability and efficiency.

HiveKuaishouMetaStore
0 likes · 15 min read
Hive MetaStore Challenges and Optimizations at Kuaishou
Art of Distributed System Architecture Design
Art of Distributed System Architecture Design
Jul 12, 2015 · Big Data

Architectural Overview and Optimization Techniques for SQL‑on‑Hadoop Systems

This article provides a comprehensive analysis of SQL‑on‑Hadoop architectures, comparing runtime‑framework‑based engines like Hive with MPP‑style engines such as Impala, detailing core components, compilation pipelines, optimizer strategies, CPU/IO performance tricks, columnar storage formats, and resource management in modern big‑data query platforms.

Query EngineSQL on Hadoopbig data
0 likes · 22 min read
Architectural Overview and Optimization Techniques for SQL‑on‑Hadoop Systems