Tagged articles
31 articles
Page 1 of 1
DataFunSummit
DataFunSummit
Aug 11, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native platforms, revealing how cloud storage cost models affect performance optimization, and presents case‑study findings from Uber’s Presto production environment that highlight fragmented I/O patterns and the financial impact of storage API calls.

Cost ModelI/O optimizationPresto
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
DataFunTalk
DataFunTalk
Aug 11, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native environments, revealing how cloud storage cost models affect I/O optimization, and presents Uber Presto case‑study findings that highlight fragmented access patterns and financial implications of storage API calls.

Cost ModelI/O optimizationPresto
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
DataFunSummit
DataFunSummit
Aug 10, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper investigates the migration of data‑intensive analytics to cloud‑native environments, using Uber’s Presto workload to expose how cloud storage cost models and fragmented I/O patterns affect performance, and proposes optimized I/O strategies to improve cost‑effectiveness and system design.

Cloud NativeCost ModelI/O optimization
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
DataFunTalk
DataFunTalk
Aug 10, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native environments, analyzes the unique cost model of cloud storage, and presents case‑study findings from Uber’s Presto production system that reveal fragmented I/O patterns and propose optimization strategies to improve cost‑performance in the cloud.

Cost ModelI/O optimization
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
DataFunTalk
DataFunTalk
Aug 2, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber’s Presto Deployment

This whitepaper examines the trend of moving data‑intensive analytics workloads to cloud‑native platforms, revealing how cloud storage cost models affect I/O optimization, and using Uber’s Presto production data to show that traditional I/O strategies overlook costly storage API calls, leading to high expenses.

Case StudyCost ModelI/O optimization
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber’s Presto Deployment
DataFunSummit
DataFunSummit
Jul 27, 2024 · Cloud Native

Migrating Data‑Intensive Analytics to Cloud‑Native Environments: Cost‑Aware I/O Optimization Insights from Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native platforms, revealing how cloud storage’s unique cost model demands finer‑grained I/O optimization, illustrated through an empirical case study of Uber’s Presto production environment and its fragmented access patterns.

Case StudyCost ModelData Analytics
0 likes · 3 min read
Migrating Data‑Intensive Analytics to Cloud‑Native Environments: Cost‑Aware I/O Optimization Insights from Uber Presto
DataFunSummit
DataFunSummit
Jul 26, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: A Case Study of Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native platforms, analyzing how cloud storage cost models affect performance optimization, and presents a case study of Uber’s Presto production environment that reveals fragmented I/O patterns and the financial impact of storage API calls.

Cost ModelI/O optimizationPresto
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: A Case Study of Uber Presto
DataFunTalk
DataFunTalk
Jul 26, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native environments, analyzes how cloud storage cost models affect performance optimization, and presents Uber Presto case‑study findings that reveal fragmented access patterns and hidden financial costs of traditional I/O strategies.

Case StudyCost ModelI/O optimization
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
DataFunTalk
DataFunTalk
Jul 21, 2024 · Cloud Native

Optimizing I/O for Data-Intensive Applications in Cloud-Native Environments: Insights from Uber Presto

This whitepaper examines the industry shift of moving data‑intensive analytics to cloud‑native platforms, revealing how cloud storage’s unique cost model demands nuanced I/O optimization, and presents Uber Presto case‑study findings that highlight fragmented access patterns and cost‑effective design strategies for high‑performance cloud workloads.

Cost ModelI/O optimizationcloud-native
0 likes · 3 min read
Optimizing I/O for Data-Intensive Applications in Cloud-Native Environments: Insights from Uber Presto
DataFunSummit
DataFunSummit
Jun 22, 2024 · Cloud Native

Optimizing I/O for Data-Intensive Analytics in Cloud-Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of migrating data‑intensive analytics workloads to cloud‑native environments, revealing how cloud storage’s unique cost model demands finer‑grained performance optimization, and presents Uber Presto case‑study findings that expose fragmented I/O patterns and associated financial impacts.

Cloud NativeCost ModelData Analytics
0 likes · 3 min read
Optimizing I/O for Data-Intensive Analytics in Cloud-Native Environments: Insights from Uber Presto
DataFunTalk
DataFunTalk
Jun 9, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native platforms, analyzes how cloud storage cost models affect performance optimization, and presents Uber Presto case‑study findings that reveal fragmented access patterns and new I/O strategies to improve cost‑effectiveness.

Case StudyCost ModelI/O optimization
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
DataFunSummit
DataFunSummit
Jun 8, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry shift of moving data‑intensive analytics to cloud‑native platforms, analyzes how cloud storage cost models affect performance optimization, and presents Uber Presto case‑study findings that reveal fragmented access patterns and the financial impact of traditional I/O strategies in the cloud.

Cloud NativeCost ModelData Analytics
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
DataFunSummit
DataFunSummit
Jun 6, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native platforms, analyzes the unique cost model of cloud storage, and presents case‑study findings from Uber's Presto production environment to guide efficient I/O design and cost‑effective performance optimization.

Cost ModelI/O optimizationPresto
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
DataFunSummit
DataFunSummit
Jun 5, 2024 · Cloud Native

Migrating Data‑Intensive Analytics to Cloud‑Native Environments: Cost‑Aware I/O Optimization Insights from Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native platforms, revealing how cloud storage cost models affect performance optimization and presenting case‑study‑based I/O strategies derived from Uber's Presto production environment.

Case StudyCost ModelI/O optimization
0 likes · 3 min read
Migrating Data‑Intensive Analytics to Cloud‑Native Environments: Cost‑Aware I/O Optimization Insights from Uber Presto
JD Retail Technology
JD Retail Technology
May 8, 2024 · Databases

Understanding MySQL Cost Model for Index Optimization and Conflict Resolution

This article explains MySQL's cost‑based optimizer, demonstrates how to calculate query costs for full‑table, covering, ref and range scans using actual source‑code constants, and applies the model to resolve index‑conflict cases in a store‑goods table, offering practical optimization guidelines and future tool ideas.

Cost ModelDatabase PerformanceIndex Optimization
0 likes · 22 min read
Understanding MySQL Cost Model for Index Optimization and Conflict Resolution
Meituan Technology Team
Meituan Technology Team
Apr 6, 2023 · Databases

AI-Driven Index Recommendation for Slow Queries at Meituan

This article details a joint research effort between Meituan and East China Normal University that combines cost‑based methods with AI‑driven, data‑centric models to automatically generate and evaluate missing indexes for billions of daily slow queries, improving recommendation accuracy and query performance.

AICost ModelIndex Recommendation
0 likes · 16 min read
AI-Driven Index Recommendation for Slow Queries at Meituan
Baidu Geek Talk
Baidu Geek Talk
Mar 21, 2023 · Artificial Intelligence

Infrastructure Challenges and Solutions for Large‑Scale AI Model Training

The article explains how the massive compute and storage demands of today’s large language models create a “compute wall” and “storage wall,” and describes Baidu Intelligent Cloud’s four‑layer full‑stack infrastructure—combining advanced parallelism techniques, optimized GPU networking, static‑graph compilation, and cost‑model‑driven placement—to train trillion‑parameter models efficiently.

AI InfrastructureCost ModelDistributed Training
0 likes · 27 min read
Infrastructure Challenges and Solutions for Large‑Scale AI Model Training
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Feb 23, 2023 · Artificial Intelligence

How Baidu’s Cloud Infrastructure Tackles the Challenges of Training Massive AI Models

This article explains how Baidu's intelligent cloud overcomes the compute and storage walls of large‑scale model training by combining hardware design, network topology, and software optimizations such as pipeline, tensor, and expert parallelism, cost‑model‑driven placement, and future‑proof AI infrastructure evolution.

AI InfrastructureBaidu CloudCost Model
0 likes · 28 min read
How Baidu’s Cloud Infrastructure Tackles the Challenges of Training Massive AI Models
Aikesheng Open Source Community
Aikesheng Open Source Community
Nov 30, 2022 · Databases

Understanding Index Condition Pushdown (ICP) in MySQL and Its Impact on Query Cost

Index Condition Pushdown (ICP) in MySQL pushes filter conditions to the storage engine to reduce row lookups, but this article demonstrates through experiments that while ICP lowers runtime by decreasing back‑table accesses, the optimizer’s cost model often ignores its benefits, leading to suboptimal plan choices.

Cost Modeldatabaseindex condition pushdown
0 likes · 10 min read
Understanding Index Condition Pushdown (ICP) in MySQL and Its Impact on Query Cost
StarRocks
StarRocks
Nov 2, 2022 · Databases

Mastering Join Optimization in StarRocks: Techniques, Algorithms, and Distributed Planning

This article provides a comprehensive, step‑by‑step guide to StarRocks join optimization, covering join types, logical rewrite rules, predicate push‑down, join reorder algorithms, cost modeling, distributed join strategies, and runtime filters, while offering practical tips for achieving high‑performance query execution.

Cost ModelDistributed SQLJOIN optimization
0 likes · 26 min read
Mastering Join Optimization in StarRocks: Techniques, Algorithms, and Distributed Planning
dbaplus Community
dbaplus Community
Oct 31, 2022 · Databases

Why MySQL Picks the Wrong Index for ORDER BY LIMIT Queries and How to Fix It

This article investigates a recurring MySQL CPU‑100% alarm caused by a slow SELECT with ORDER BY id ASC LIMIT, explains why the optimizer mistakenly chooses the primary‑key index over a suitable composite index, and presents six practical experiments—including force‑index, ORDER BY tweaks, and LIMIT adjustments—that reliably restore optimal index usage.

Cost ModelIndex OptimizationQuery Optimizer
0 likes · 14 min read
Why MySQL Picks the Wrong Index for ORDER BY LIMIT Queries and How to Fix It
Meituan Technology Team
Meituan Technology Team
Apr 21, 2022 · Databases

Meituan's Cost-Based Optimizer for Slow Query Index Recommendation

The article explains how Meituan uses MySQL's cost‑based optimizer to analyze slow queries, generate virtual index candidates, evaluate their costs with detailed statistics, and deploy a recommendation system that validates, tracks, and governs index suggestions to reduce CPU/IO waste and prevent database failures.

Cost ModelDatabase OptimizationFakeindex
0 likes · 22 min read
Meituan's Cost-Based Optimizer for Slow Query Index Recommendation
Tencent Qidian Tech Team
Tencent Qidian Tech Team
Nov 24, 2021 · Databases

Why MySQL Picks index_author_id Over index_title: Execution & Cost Insights

This article explains MySQL's architecture, the server and storage‑engine layers, query execution phases, status states, query cache behavior, the optimizer's parsing and planning steps, EXPLAIN output fields, the cost model, and why the optimizer selects index_author_id instead of index_title for a given query.

Cost ModelIndex SelectionStorage Engine
0 likes · 14 min read
Why MySQL Picks index_author_id Over index_title: Execution & Cost Insights
Tencent Database Technology
Tencent Database Technology
May 31, 2021 · Databases

TXSQL Query Optimizer Framework: Transformation, Join Reorder, and Cost Model

This article introduces the TXSQL query optimizer built on MySQL 8.0.22, detailing its cascades‑style framework, transformation rewrite rules such as outer‑join elimination and subquery flattening, join‑order heuristics, cost‑model configuration, and execution strategies, providing a comprehensive overview of its design and enhancements.

Cost ModelJoin ReorderQuery Optimizer
0 likes · 25 min read
TXSQL Query Optimizer Framework: Transformation, Join Reorder, and Cost Model
AntTech
AntTech
May 29, 2019 · Databases

OceanBase Query Optimizer: Challenges, Techniques, and Engineering Practices

This article examines the core challenges of query optimization in relational databases—accurate statistics, massive plan spaces, and efficient plan management—and explains how OceanBase addresses them through logical/physical row concepts, real‑time statistics, distributed two‑stage planning, adaptive caching, and plan evolution mechanisms.

Cost ModelLSM-TreeOceanBase
0 likes · 15 min read
OceanBase Query Optimizer: Challenges, Techniques, and Engineering Practices
dbaplus Community
dbaplus Community
Nov 12, 2018 · Databases

Unlocking MySQL 8.0 Optimizer: Cost Model Configuration and Histogram Usage

This article explains how MySQL 8.0 improves the optimizer by introducing configurable cost model constants and a histogram feature, showing how to query and update system tables, use ANALYZE TABLE to create and manage histograms, and explore the underlying code structures and future tuning possibilities.

Cost ModelHistogramdatabase
0 likes · 19 min read
Unlocking MySQL 8.0 Optimizer: Cost Model Configuration and Histogram Usage
dbaplus Community
dbaplus Community
Nov 18, 2015 · Databases

Demystifying DB2 Optimizer: How Cost Models Shape Query Performance

This article explains the inner workings of the DB2 optimizer, its four-step processing flow, cost‑based decision making, and detailed examples comparing full‑table and index scans, followed by practical tuning tips and a Q&A session for real‑world query optimization.

Cost ModelDB2Database Performance
0 likes · 19 min read
Demystifying DB2 Optimizer: How Cost Models Shape Query Performance