Big Data Technology & Architecture
Jul 28, 2022 · Big Data
Spark SQL UNION Causing driver.maxResultSize Error and Its Resolution
When executing a Spark SQL query with dozens of UNION subqueries that each contain JOIN operations on Spark 3.1.2, the job fails because the total serialized result size of the tasks exceeds the driver’s maxResultSize limit, and the issue can be resolved by reducing the initial partition number used by Adaptive Query Execution.
DriverMaxResultSizePerformanceTuningSQL
0 likes · 10 min read
