Big Data Technology Architecture
Aug 8, 2020 · Big Data
Performance Comparison of SparkR with Vectorized Execution Using Apache Arrow
This article explains how SparkR’s performance compares to native Spark APIs, shows the slowdown caused by JVM‑R serialization, and demonstrates how enabling Apache Arrow’s vectorized execution in Spark 3.0 can accelerate SparkR operations by up to dozens of times.
Apache ArrowSparkRVectorized Execution
0 likes · 7 min read