Big Data Technology & Architecture
Apr 8, 2020 · Big Data
Spark Job Execution Principles and Parameter Tuning for Hive on Spark
This article explains how Spark jobs run on YARN, describes the impact of stages, shuffle and task parallelism, and provides detailed recommendations for tuning Spark executor, memory, core, and parallelism settings to dramatically improve Hive‑on‑Spark TPCx‑BB benchmark performance on large datasets.
Big DataHiveParameter Tuning
0 likes · 12 min read
