dbaplus Community
Jun 1, 2021 · Big Data
How Didi Boosted SQL Performance by 40%: Migrating 10k Hive Jobs to Spark
Didi migrated over 10,000 Hive SQL tasks to Spark SQL, achieving 85% Spark task share, cutting execution time by 40%, and reducing CPU and memory usage by 21% and 49% respectively, through a systematic migration process that addressed syntax, UDF, performance, and functional differences between the two engines.
Big DataHivePerformance Optimization
0 likes · 20 min read
