Tagged articles
5 articles
Page 1 of 1
DataFunTalk
DataFunTalk
Mar 10, 2021 · Big Data

Hive MetaStore Challenges and Optimizations at Kuaishou

At Kuaishou, the Hive MetaStore service, which stores metadata for Hive, faced scalability and performance challenges due to massive dynamic partitions and high query volume, leading to a series of architectural optimizations—including read‑write separation, API enhancements, traffic control, and federation—to improve stability and efficiency.

Big DataHiveKuaishou
0 likes · 15 min read
Hive MetaStore Challenges and Optimizations at Kuaishou
Big Data Technology & Architecture
Big Data Technology & Architecture
Jun 7, 2020 · Big Data

A Unified View of SQL‑on‑Hadoop Systems: Architecture, Execution Plans, Optimizations, and Storage Formats

The article provides a comprehensive overview of SQL‑on‑Hadoop query engines such as Hive, Impala, Presto and Spark SQL, comparing their runtime frameworks, core components, compilation steps, optimizer strategies, CPU/IO efficiency techniques, storage formats like ORC and Parquet, and resource management in a unified perspective.

Big DataQuery EngineSQL on Hadoop
0 likes · 24 min read
A Unified View of SQL‑on‑Hadoop Systems: Architecture, Execution Plans, Optimizations, and Storage Formats
dbaplus Community
dbaplus Community
Jul 10, 2019 · Big Data

How Kuaishou Scales SQL on Hadoop: Architecture, Optimizations, and Lessons Learned

This article explains the SQL‑on‑Hadoop ecosystem—including Hive, Spark, SparkSQL, Presto and other solutions—then details Kuaishou's large‑scale platform architecture, performance bottlenecks, routing logic, high‑availability mechanisms, and a series of concrete optimizations that improve query speed, resource utilization, and operational stability.

HiveSQL on HadoopSpark
0 likes · 19 min read
How Kuaishou Scales SQL on Hadoop: Architecture, Optimizations, and Lessons Learned
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Nov 1, 2016 · Big Data

Will SQL on Hadoop Replace Hybrid Architectures? Key Big Data Trends Unveiled

The article analyzes four major big‑data evolution trends—SQL on Hadoop overtaking hybrid architectures, SSDs becoming cache in Hadoop clusters, the rise of real‑time analytics, and the convergence of cloud computing with big data—while presenting supporting data, predictions, and architectural diagrams.

Big DataReal-time analyticsSQL on Hadoop
0 likes · 15 min read
Will SQL on Hadoop Replace Hybrid Architectures? Key Big Data Trends Unveiled

Architectural Overview and Optimization Techniques for SQL‑on‑Hadoop Systems

This article provides a comprehensive analysis of SQL‑on‑Hadoop architectures, comparing runtime‑framework‑based engines like Hive with MPP‑style engines such as Impala, detailing core components, compilation pipelines, optimizer strategies, CPU/IO performance tricks, columnar storage formats, and resource management in modern big‑data query platforms.

Columnar StorageQuery EngineSQL on Hadoop
0 likes · 22 min read
Architectural Overview and Optimization Techniques for SQL‑on‑Hadoop Systems