Cloud Native 3 min read

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native platforms, analyzes how cloud storage cost models affect performance optimization, and presents a case study of Uber's Presto deployment that reveals fragmented access patterns and new I/O cost considerations.

DataFunSummit
DataFunSummit
DataFunSummit
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This article explores the widespread industry trend of migrating data‑intensive analytics applications from on‑premises to cloud‑native environments, highlighting that the unique cost model of cloud storage demands a more detailed understanding of performance optimization.

Through empirical observation of Uber's production Presto workload, the study reveals that traditional I/O optimizations often ignore the financial cost of storage API calls, which can lead to unexpectedly high expenses in cloud settings.

The research shows that over 50% of data accesses are smaller than 10 KB and more than 90% are under 1 MB, indicating a highly fragmented access pattern that has different implications for cloud platforms compared to traditional data systems.

Based on this case study, the paper provides logical I/O optimization strategies tailored for cloud environments, aiming to help readers design efficient I/O solutions that significantly improve cost‑performance ratios for data‑intensive applications.

Readers will gain a new perspective on system design in the cloud computing domain and practical guidance for addressing the rapid growth of data‑intensive workloads.

Case Studycloud nativeBig Dataprestoio-optimizationcost model
DataFunSummit
Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.