Tagged articles
407 articles
Page 2 of 5
Sohu Tech Products
Sohu Tech Products
Jul 10, 2024 · Industry Insights

How StarRocks and Apache Paimon Transform Data Lake Analytics and Migration

This article provides a practical deep‑dive into StarRocks and Apache Paimon, covering data‑lake fundamentals, the technical advantages of both platforms, performance gains over traditional engines, step‑by‑step migration strategies, deployment options on Alibaba Cloud EMR, and future roadmap plans.

Apache PaimonData LakeReal-time analytics
0 likes · 15 min read
How StarRocks and Apache Paimon Transform Data Lake Analytics and Migration
Java Tech Enthusiast
Java Tech Enthusiast
Jul 7, 2024 · Databases

MySQL Query Optimization and Performance Tuning Guide

This guide presents a thorough MySQL performance checklist, covering efficient SQL writing, data‑type choices, selective column retrieval, smart joins, batch inserts, and query‑cache settings; it details key InnoDB and server variable tuning, proper index creation and usage, common pitfalls, and how to analyze and improve queries with EXPLAIN.

databasemysqlquery optimization
0 likes · 17 min read
MySQL Query Optimization and Performance Tuning Guide
Tencent Cloud Developer
Tencent Cloud Developer
Jul 5, 2024 · Databases

Database Systems Reading List: Fundamentals, Design, Engines, and Optimizations

This curated bibliography compiles seminal database‑system papers—spanning foundational relational theory, system design for RDBMS and NoSQL, SQL engine optimization, storage structures, transaction and scheduling mechanisms, plus workload, networking, quality, and tuning research—each entry listing title, authors, year and a PDF link for scholars and practitioners.

Storage Enginedatabase systemsquery optimization
0 likes · 25 min read
Database Systems Reading List: Fundamentals, Design, Engines, and Optimizations
Python Programming Learning Circle
Python Programming Learning Circle
Jun 11, 2024 · Databases

SQL Best Practices: Avoid Common Mistakes and Write Efficient Queries

This article presents a reverse‑analysis of common SQL pitfalls—such as ambiguous column names, selecting all columns, unnecessary joins, missing WHERE clauses, lack of ORDER BY, unclear aliases, absent error handling, redundant subqueries, and missing comments—and provides clear "Don’t" and "Do" examples with code to help you write clean, optimized, and reliable SQL statements.

Error Handlingbest practicesquery optimization
0 likes · 7 min read
SQL Best Practices: Avoid Common Mistakes and Write Efficient Queries
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jun 3, 2024 · Databases

Master OpenGemini: From Schema Design to Performance Tuning Best Practices

This article summarizes a live session where Shawn explains how understanding business scenarios drives effective OpenGemini database design, and provides comprehensive best‑practice guidance on library and table design, data ingestion, query optimization, and performance tuning for time‑series workloads.

Time Series Databasedata ingestionopenGemini
0 likes · 7 min read
Master OpenGemini: From Schema Design to Performance Tuning Best Practices
Top Architect
Top Architect
May 24, 2024 · Databases

How to Perform Fuzzy Search on Encrypted Data

This article examines various techniques for enabling fuzzy queries on encrypted fields, comparing naive memory‑based methods, conventional token‑based approaches that leverage database indexes, and advanced cryptographic schemes, and recommends practical solutions for real‑world applications.

database encryptionencrypted datafuzzy-search
0 likes · 12 min read
How to Perform Fuzzy Search on Encrypted Data
Lobster Programming
Lobster Programming
Apr 28, 2024 · Databases

How to Solve MySQL Deep Pagination: Causes and 6 Effective Optimization Strategies

Deep pagination in MySQL becomes a performance bottleneck when tables reach millions of rows, causing excessive row scans and costly index lookups; this article explains its root causes and presents six practical optimization techniques—including keyset pagination, delayed joins, covering indexes, data partitioning, and external search engines—to dramatically improve query speed.

Keyset Paginationcovering indexdeep pagination
0 likes · 7 min read
How to Solve MySQL Deep Pagination: Causes and 6 Effective Optimization Strategies
StarRocks
StarRocks
Apr 18, 2024 · Databases

Master StarRocks Deployment and Optimization: From Capacity Planning to Query Tuning

This comprehensive guide walks you through StarRocks production deployment, covering capacity planning, hardware specs, environment setup, table modeling, partitioning, bucketing, index selection, data import best practices, query optimization, monitoring, and resource‑group configuration, all backed by concrete SQL examples and configuration commands.

DeploymentModelingStarRocks
0 likes · 20 min read
Master StarRocks Deployment and Optimization: From Capacity Planning to Query Tuning
ITPUB
ITPUB
Apr 11, 2024 · Big Data

Query 100K Items from 10M+ Records: CK, ES Scroll, HBase, RediSearch

When faced with a business requirement to filter up to 100 000 records from a pool of tens of millions and then sort and de‑duplicate them, this article explores four technical solutions—multithreaded ClickHouse pagination, Elasticsearch scroll‑scan, a combined Elasticsearch‑HBase approach, and RediSearch with RedisJSON—detailing their design, implementation, performance testing, and trade‑offs.

Big DataClickHouseElasticsearch
0 likes · 12 min read
Query 100K Items from 10M+ Records: CK, ES Scroll, HBase, RediSearch
dbaplus Community
dbaplus Community
Apr 10, 2024 · Databases

How MySQL Chooses the Cheapest Index for COUNT(*) and When It Gets It Wrong

This article examines whether SELECT COUNT(*) causes full‑table scans, explains MySQL’s optimizer cost‑based index selection (including IO and CPU costs), demonstrates with a 100k‑row table how auxiliary indexes are chosen, and shows cases where the optimizer’s estimates mislead performance.

Index CostOptimizer_trace_count
0 likes · 9 min read
How MySQL Chooses the Cheapest Index for COUNT(*) and When It Gets It Wrong
Baidu Geek Talk
Baidu Geek Talk
Apr 10, 2024 · Big Data

TDA: A One‑Stop Self‑Service BI Platform – Architecture, Challenges, and Solutions

The article presents Turing Data Analysis (TDA), a self‑service BI platform that replaces fragile traditional pipelines with a unified DWD‑based data model, drag‑and‑drop analytics, multi‑engine query optimization and caching, delivering sub‑10‑second queries on billions of rows, fine‑grained permissions, and rapid dashboard creation, while reporting significant usage growth and outlining AI‑driven future enhancements.

BIBig DataData Platform
0 likes · 15 min read
TDA: A One‑Stop Self‑Service BI Platform – Architecture, Challenges, and Solutions
DataFunTalk
DataFunTalk
Mar 16, 2024 · Big Data

Performance Optimization Practices for KwaiBI Big Data Analysis Platform

This article introduces KwaiBI, the internal data analysis product of Kuaishou, outlines its five major functional areas, details the performance challenges of large‑scale analytics, and presents a comprehensive set of optimization techniques—including cache warming, query rewriting, materialized acceleration, and the Bleem lake‑house engine—along with future directions and a brief Q&A.

Big DataData AnalyticsKwaiBI
0 likes · 15 min read
Performance Optimization Practices for KwaiBI Big Data Analysis Platform
Java Tech Enthusiast
Java Tech Enthusiast
Mar 10, 2024 · Databases

Why IN and NOT IN Should Be Avoided in SQL Queries

Because IN and NOT IN often bypass indexes on large tables, can yield incorrect results when column names differ or NULL values are present, and may cause severe performance penalties, they should be replaced with EXISTS/NOT EXISTS subqueries or appropriate JOINs, reserving IN only for small literal lists.

IN clauseNOT INSQL
0 likes · 4 min read
Why IN and NOT IN Should Be Avoided in SQL Queries
MaGe Linux Operations
MaGe Linux Operations
Feb 12, 2024 · Databases

Why Hash Join Beats Nested Loop Join and When It Fails

This article explains why hash joins usually outperform nested‑loop joins, how to force hash joins in SQL, the data‑type restrictions that prevent hash joins, and practical tips for optimizing join performance in TD and Oracle compatibility modes.

Data TypesHash JoinNested Loop
0 likes · 6 min read
Why Hash Join Beats Nested Loop Join and When It Fails
Sohu Tech Products
Sohu Tech Products
Jan 24, 2024 · Databases

Optimizing Database Expression Evaluation with JIT Technology Using Gandiva

The article explains how database expression evaluation—especially in WHERE and SELECT clauses—can be dramatically accelerated by replacing interpreted AST traversal with Just‑In‑Time compilation using Apache Gandiva, which leverages LLVM to generate SIMD‑optimized machine code for Arrow columnar data, and discusses extensions such as timestamp, array, higher‑order functions, and UDF support.

Apache ArrowApache GandivaDatabase Optimization
0 likes · 17 min read
Optimizing Database Expression Evaluation with JIT Technology Using Gandiva
ITPUB
ITPUB
Jan 8, 2024 · Databases

How I Cut an 8‑Second SQL Query to 0.7 Seconds: Step‑by‑Step Optimization

This article walks through a real‑world case of a slow MySQL query that took 8 seconds, explains how the original use of EXISTS and NOT EXISTS caused performance bottlenecks, and details two successive rewrites—first using JOIN + GROUP BY, then replacing NOT EXISTS with NOT IN—bringing execution time down to 0.7 seconds while highlighting key indexing and testing lessons.

SQLdatabaseindexes
0 likes · 7 min read
How I Cut an 8‑Second SQL Query to 0.7 Seconds: Step‑by‑Step Optimization
DeWu Technology
DeWu Technology
Dec 18, 2023 · Databases

Elasticsearch Best Practices: Query, Index, and Performance Optimizations

The guide outlines production‑ready Elasticsearch best practices, covering query tuning such as using shard request cache, filter context, size‑0 aggregations and composite aggregations; write strategies like auto‑generated IDs, bulk API sizing and refresh handling; optimal shard counts, explicit mappings with disabled unnecessary features, and general advice to use explicit index names and stored scripts.

Elasticsearchcachingindexing
0 likes · 22 min read
Elasticsearch Best Practices: Query, Index, and Performance Optimizations
Architecture Digest
Architecture Digest
Dec 6, 2023 · Databases

MySQL Query Optimization and Index Tuning: From Subqueries to Joins

This article analyzes a slow MySQL query that scans millions of rows to find students scoring 100 in Chinese, then demonstrates how adding appropriate single‑column and composite indexes, rewriting the query as a join, and using covering indexes can reduce execution time from hours to milliseconds.

SQLmysqlperformance tuning
0 likes · 10 min read
MySQL Query Optimization and Index Tuning: From Subqueries to Joins
JD Cloud Developers
JD Cloud Developers
Nov 14, 2023 · Databases

How MySQL Executes a Simple SQL Query: Step‑by‑Step Deep Dive

This article walks through the complete lifecycle of a MySQL query—from the client request, through connection handling, parsing, optimization, execution, and storage engine processing—illustrating each component with diagrams, code snippets, and a detailed analysis of execution order for both simple and join queries.

Database InternalsSQL Executionmysql
0 likes · 12 min read
How MySQL Executes a Simple SQL Query: Step‑by‑Step Deep Dive
ITPUB
ITPUB
Nov 1, 2023 · Databases

Doris 2.0.2 vs 1.2.3: Real‑World Query Performance Comparison

After upgrading a Doris cluster from version 1.2.3 to 2.0.2, the author runs a series of SQL benchmarks—including PK lookups, top‑client queries, distinct counts on low‑ and high‑cardinality columns, minute‑level session analysis, and full‑table deduplication—to measure execution times, revealing mixed performance gains and regressions across the seven test scenarios.

Database UpgradeSQLdoris
0 likes · 9 min read
Doris 2.0.2 vs 1.2.3: Real‑World Query Performance Comparison
Java Architect Essentials
Java Architect Essentials
Oct 17, 2023 · Databases

Understanding LEFT JOIN ON vs WHERE Conditions in SQL

This article explains how LEFT JOIN works in SQL, clarifying the difference between ON‑clause conditions that affect the join process and WHERE‑clause filters that operate on the resulting temporary table, using multiple example queries and visual illustrations.

LEFT JOINON clauseSQL
0 likes · 5 min read
Understanding LEFT JOIN ON vs WHERE Conditions in SQL
Su San Talks Tech
Su San Talks Tech
Oct 17, 2023 · Databases

Why MySQL LIMIT + ORDER BY Returns Unexpected Rows – Priority Queue Threshold

This article investigates the nondeterministic behavior of MySQL when combining ORDER BY with LIMIT, reproduces the official example, identifies a critical row count threshold where the optimizer switches to a priority‑queue (heap) sort, and explains the underlying source‑code logic and its impact on query results.

Optimizer_traceOrder Byheap sort
0 likes · 19 min read
Why MySQL LIMIT + ORDER BY Returns Unexpected Rows – Priority Queue Threshold
ITPUB
ITPUB
Sep 28, 2023 · Databases

Mastering MySQL Index Merge: How Three Algorithms Boost Query Performance

Learn how MySQL's index merge optimization works, explore its three algorithms—intersect, union, and sort_union—understand when each applies, see practical setup and sample queries, and discover configuration flags and best‑practice tips to improve query efficiency.

databaseindex mergemysql
0 likes · 9 min read
Mastering MySQL Index Merge: How Three Algorithms Boost Query Performance
DataFunSummit
DataFunSummit
Sep 25, 2023 · Big Data

Trino in Bilibili Lakehouse: Compute Engine, Stability, and Containerization Practices

This article presents Bilibili's practical implementation of Trino within a lakehouse architecture, focusing on the compute engine placement, stability enhancements, and containerized deployment, while detailing indexing strategies, pre‑computation techniques, Iceberg metadata optimizations, and performance gains for large‑scale analytical queries.

IcebergLakehousePrecomputation
0 likes · 14 min read
Trino in Bilibili Lakehouse: Compute Engine, Stability, and Containerization Practices
JD Retail Technology
JD Retail Technology
Sep 13, 2023 · Databases

Optimizing Pagination Queries for Billion‑Row MySQL Tables

The article analyzes the performance problems of deep pagination on massive MySQL tables storing billions of fan records and presents three progressive solutions—simple LIMIT, tag‑record (maxId) pagination, and range‑limited pagination with async and offline minId caching—along with general indexing best‑practices for high‑throughput queries.

Large Dataindexingmysql
0 likes · 10 min read
Optimizing Pagination Queries for Billion‑Row MySQL Tables
Liangxu Linux
Liangxu Linux
Sep 10, 2023 · Databases

Master MySQL Query Optimization: Architecture, Caching, and Index Strategies

This article explains MySQL's logical architecture, query execution flow, caching mechanisms, and detailed index design techniques, providing practical tips and examples so developers can understand the underlying principles and apply effective performance optimizations in real‑world applications.

Database PerformanceSQLindexes
0 likes · 38 min read
Master MySQL Query Optimization: Architecture, Caching, and Index Strategies
Ximalaya Technology Team
Ximalaya Technology Team
Sep 6, 2023 · Backend Development

Design Analysis of Lucene and In-Memory Inverted Index Service for Advertising Retrieval

The team analyzed Lucene’s disk‑based inverted index and built a custom in‑memory inverted‑index service for Himalaya’s ad engine, encoding terms as 64‑bit keys, supporting real‑time updates and BooleanQuery‑style and custom expression filtering, which cut query latency from ~50 ms to under 5 ms and enabled massive scaling.

Data StructuresJavaMemory Search
0 likes · 27 min read
Design Analysis of Lucene and In-Memory Inverted Index Service for Advertising Retrieval
MaGe Linux Operations
MaGe Linux Operations
Aug 30, 2023 · Databases

Understanding SQL Execution Order: From FROM to LIMIT Explained

This article walks through the exact sequence a database follows when executing a SQL query—starting with FROM and JOIN, then WHERE, GROUP BY, HAVING, SELECT, DISTINCT, ORDER BY, and finally LIMIT—highlighting key differences from the textual order of the statement.

Execution OrderSQLSQL tutorial
0 likes · 5 min read
Understanding SQL Execution Order: From FROM to LIMIT Explained
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Aug 16, 2023 · Databases

Common Cases When MySQL Indexes Are Ignored

This article explains how to set up a MySQL test table and indexes, then enumerates nine typical scenarios—such as leading‑wildcard LIKE, arithmetic or functions on indexed columns, type mismatches, composite‑index misuse, character‑set differences, and OR conditions with non‑indexed fields—that cause queries to bypass indexes and perform full table scans.

SQLdatabaseindex
0 likes · 8 min read
Common Cases When MySQL Indexes Are Ignored
Java Architect Essentials
Java Architect Essentials
Aug 13, 2023 · Information Security

How to Perform Fuzzy Searches on Encrypted Data Without Breaking Security

This article examines three categories of approaches—naïve, conventional, and advanced—for enabling fuzzy queries on encrypted fields, comparing their implementation steps, performance trade‑offs, storage costs, and security implications, and provides practical examples such as in‑memory decryption, tag mapping, database functions, tokenization, and algorithm‑level designs.

Database Securityencrypted dataencryption
0 likes · 11 min read
How to Perform Fuzzy Searches on Encrypted Data Without Breaking Security
dbaplus Community
dbaplus Community
Aug 13, 2023 · Databases

Why Does This PostgreSQL Self‑Join Behave Unexpectedly? InitPlan & One‑Time Filter

The article examines a puzzling PostgreSQL self‑join query that yields surprising true/false results, walks through step‑by‑step rewrites, analyzes the resulting execution plans—including the appearance of InitPlan and One‑Time Filter nodes—and explains how correlated versus uncorrelated subqueries affect query evaluation.

InitPlanPostgreSQLSQL
0 likes · 11 min read
Why Does This PostgreSQL Self‑Join Behave Unexpectedly? InitPlan & One‑Time Filter
Top Architect
Top Architect
Aug 8, 2023 · Databases

MySQL Pagination Query Optimization Techniques and Performance Testing

The article examines various MySQL pagination strategies—including simple LIMIT clauses, sub‑query approaches, ID‑range filtering, and temporary‑table methods—provides performance measurements on a table with millions of rows, and offers practical recommendations for reducing query latency in large datasets.

databasemysqlpagination
0 likes · 9 min read
MySQL Pagination Query Optimization Techniques and Performance Testing
DataFunTalk
DataFunTalk
Jul 30, 2023 · Databases

MatrixOne Architecture and OLAP Engine Design Overview

This article presents an in‑depth overview of MatrixOne, an open‑source hyper‑converged cloud‑native database, detailing its three‑tier architecture of compute, data and file services, and explains the design and implementation of its OLAP engine, including parser, planner, optimizer, and push‑based execution pipeline.

Database ArchitectureExecution EngineMatrixOne
0 likes · 18 min read
MatrixOne Architecture and OLAP Engine Design Overview
21CTO
21CTO
Jul 17, 2023 · Big Data

How WeChat Cut Query Latency from Seconds to 100 ms with Druid Optimizations

This case study explains how the WeChat multi‑dimensional monitoring platform identified performance bottlenecks in its Druid‑based data layer, analyzed user query patterns, and applied sub‑query splitting, Redis caching, and segment size reductions to achieve over 85% cache‑hit rates and bring average query latency down to around 100 ms.

Big DataDruidcaching
0 likes · 13 min read
How WeChat Cut Query Latency from Seconds to 100 ms with Druid Optimizations
Su San Talks Tech
Su San Talks Tech
Jul 8, 2023 · Databases

Master MySQL Index Optimization: 20 Proven Rules to Boost Query Performance

This comprehensive guide explains 20 practical MySQL index optimization rules—including when NOT to use indexes, how to choose the right index type, and techniques such as prefix indexes, covering indexes, and query rewriting—to dramatically improve SQL query efficiency and reduce database load.

Database TuningIndex OptimizationSQL Performance
0 likes · 19 min read
Master MySQL Index Optimization: 20 Proven Rules to Boost Query Performance
Laravel Tech Community
Laravel Tech Community
Jul 5, 2023 · Databases

Performance Testing and Optimization of Large‑Scale MySQL Queries

This article demonstrates how to generate, insert, and query millions of rows in a MySQL 5.7 table, measures pagination performance under varying offsets and result sizes, and presents several optimization techniques—including sub‑queries, indexed look‑ups, and column selection—to dramatically reduce query latency.

Large DataPerformance Testingdatabase
0 likes · 10 min read
Performance Testing and Optimization of Large‑Scale MySQL Queries
Big Data Technology Architecture
Big Data Technology Architecture
Jul 4, 2023 · Databases

Apache Doris 2.0‑beta Release: New Query Optimizer, Pipeline Engine, Workload Management and Performance Enhancements

Apache Doris 2.0‑beta, released on July 3, 2023, introduces a modern Cascades‑based query optimizer, a data‑driven pipeline execution engine, fine‑grained workload groups, enhanced memory management, partial‑column updates, compute nodes, cold‑hot tiering and cross‑cluster replication, delivering up to tenfold speedups and significant cost reductions for real‑time analytics.

Apache DorisPipeline ExecutionSQL Engine
0 likes · 24 min read
Apache Doris 2.0‑beta Release: New Query Optimizer, Pipeline Engine, Workload Management and Performance Enhancements
dbaplus Community
dbaplus Community
Jun 25, 2023 · Big Data

WeChat’s 10× Query Speedup: From 1000ms to 100ms with Druid & Redis

WeChat’s multi‑dimensional monitoring platform faced severe query latency and I/O bottlenecks, so the team analyzed user behavior and Druid architecture, then introduced sub‑query splitting, Redis caching, and segment size reductions, achieving over 85% cache hit rate and reducing average query time to around 100 ms.

Big DataCacheDruid
0 likes · 12 min read
WeChat’s 10× Query Speedup: From 1000ms to 100ms with Druid & Redis
Architect
Architect
Jun 23, 2023 · Big Data

Optimizing Query Performance in WeChat's Multi‑Dimensional Monitoring Platform

This article details how the WeChat multi‑dimensional monitoring platform reduced average query latency from over 1000 ms to around 100 ms by analyzing user query patterns, redesigning the Druid data layer, splitting sub‑queries, introducing Redis caching, and employing sub‑dimension tables, achieving cache hit rates above 85%.

DruidWeChatmonitoring
0 likes · 13 min read
Optimizing Query Performance in WeChat's Multi‑Dimensional Monitoring Platform
Big Data Technology & Architecture
Big Data Technology & Architecture
Jun 13, 2023 · Big Data

Iceberg Data Lake Implementation and Optimization at iQIYI

This article details iQIYI's adoption of Iceberg for its data lake, covering the OLAP architecture, reasons for a data lake, Iceberg's table format advantages over Hive, platform construction, streaming ingestion, query and performance optimizations, real‑world business deployments, and future plans.

Big DataData LakeFlink
0 likes · 21 min read
Iceberg Data Lake Implementation and Optimization at iQIYI
FunTester
FunTester
Jun 7, 2023 · Big Data

Optimizing Query Performance in WeChat's Multi‑Dimensional Monitoring Platform with Druid and Redis

The article details how WeChat's multi‑dimensional metric monitoring platform, which handles billions of data points per minute, reduced average query latency from over 1000 ms to around 140 ms and achieved over 85% cache hit rate by analyzing query behavior, redesigning the data layer architecture, splitting queries into sub‑queries, adding Redis caching, and introducing sub‑dimension tables.

Big DataCacheDruid
0 likes · 13 min read
Optimizing Query Performance in WeChat's Multi‑Dimensional Monitoring Platform with Druid and Redis
Java High-Performance Architecture
Java High-Performance Architecture
Jun 5, 2023 · Databases

Master MySQL Indexes: Types, Usage, and Performance Optimization

This guide explains MySQL indexes—what they are, their data structures, advantages and drawbacks, when to create them, various index types, how to interpret EXPLAIN output, common pitfalls that cause index loss, and practical optimization techniques for queries, joins, ordering, grouping, and slow‑query logging.

Database Performanceexplainindex
0 likes · 19 min read
Master MySQL Indexes: Types, Usage, and Performance Optimization
Top Architect
Top Architect
Jun 1, 2023 · Databases

Comprehensive Guide to MySQL Indexes, Types, and Query Optimization

This article explains the definition, data structures, advantages, disadvantages, and appropriate usage scenarios of MySQL indexes, details primary, unique, single‑column and composite index types, analyzes common performance bottlenecks with EXPLAIN, demonstrates index creation, invalidation cases, query and join optimizations, and shows how to configure and use the slow‑query log.

Database PerformanceSQLexplain
0 likes · 19 min read
Comprehensive Guide to MySQL Indexes, Types, and Query Optimization
DataFunTalk
DataFunTalk
May 30, 2023 · Big Data

Optimizing Chart Query Performance in YouShu BI: Data Query Principles, Intelligent Caching, Query Merging, and Diagnostics

This article explains the data query fundamentals of YouShu BI charts, introduces intelligent caching design, describes query merging and various optimization techniques—including partition filters, value acceleration, and SQL generation—and outlines performance diagnosis methods to improve BI chart responsiveness.

BIBig DataChart Performance
0 likes · 16 min read
Optimizing Chart Query Performance in YouShu BI: Data Query Principles, Intelligent Caching, Query Merging, and Diagnostics
php Courses
php Courses
May 26, 2023 · Databases

Using Indexes to Optimize MySQL Queries

This article explains what indexes are, when they should be applied, and provides step‑by‑step MySQL commands—including single‑column, composite, and index‑hint examples—to improve query performance on large datasets.

Database PerformanceSQLindexing
0 likes · 6 min read
Using Indexes to Optimize MySQL Queries
Programmer DD
Programmer DD
May 24, 2023 · Databases

Why SQL Struggles with Complex Queries and How SPL Offers a Solution

This article examines the inherent difficulties of SQL when handling multi‑step, ordered, and object‑referencing queries, illustrates them with practical examples, and introduces the Structured Process Language (SPL) as a more expressive alternative for data processing tasks.

CTESPLSQL
0 likes · 16 min read
Why SQL Struggles with Complex Queries and How SPL Offers a Solution
Sanyou's Java Diary
Sanyou's Java Diary
May 23, 2023 · Databases

Master MySQL Indexes: From B+ Trees to Index Merge and Optimization

This article provides a comprehensive guide to MySQL indexing, covering index types, data structures, clustering vs. non‑clustering, hash and B+Tree indexes, covering indexes, index push‑down, index merge strategies, cost‑based index selection, common pitfalls, and practical indexing principles with illustrative examples and SQL snippets.

B+TreeInnoDBdatabase
0 likes · 31 min read
Master MySQL Indexes: From B+ Trees to Index Merge and Optimization
iQIYI Technical Product Team
iQIYI Technical Product Team
May 12, 2023 · Operations

Performance Troubleshooting and Optimization of Prometheus Monitoring Queries

The article explains that high metric cardinality in Prometheus causes long query times and timeouts, and demonstrates how using recording rules to pre‑compute aggregates dramatically reduces cardinality and latency, while recommending scrape interval tuning and metric design best practices to keep charts responsive.

PrometheusRecording RulesSRE
0 likes · 10 min read
Performance Troubleshooting and Optimization of Prometheus Monitoring Queries
dbaplus Community
dbaplus Community
May 7, 2023 · Databases

Mastering MySQL Indexes: 15 Essential Q&A to Boost Query Performance

This article provides a comprehensive 15‑question guide on MySQL indexes, covering their definition, various types across data‑structure, physical and logical dimensions, situations where indexes fail or are unsuitable, B+‑tree advantages, search process, covering indexes, left‑most prefix, index push‑down, adding indexes to huge tables, using EXPLAIN to verify index usage, hash versus B+‑tree differences, pros and cons, and the distinction between clustered and non‑clustered indexes.

B+Treeindexmysql
0 likes · 18 min read
Mastering MySQL Indexes: 15 Essential Q&A to Boost Query Performance
Open Source Linux
Open Source Linux
May 5, 2023 · Databases

Mastering Subquery Unnesting: Turn Correlated Queries into Fast Joins

This article explains why correlated subqueries are a performance bottleneck, introduces the Apply (Correlated Join) operator and a systematic set of transformation rules that push Apply down and replace subqueries with efficient joins, covering scalar, existential, and quantified subqueries, aggregation handling, and set operations, all illustrated with TPC‑H examples.

Apply OperatorDatabase EnginesDecorrelate
0 likes · 17 min read
Mastering Subquery Unnesting: Turn Correlated Queries into Fast Joins
ITPUB
ITPUB
May 3, 2023 · Databases

Master MySQL Indexes: Types, Usage, and Optimization Tips

This comprehensive guide explains what MySQL indexes are, details various index types—including B+‑tree, hash, full‑text, and spatial—covers when indexes become ineffective, shows how to design, use, and troubleshoot them, and provides practical steps for large‑scale index management.

B+TreeInnoDBdatabase
0 likes · 19 min read
Master MySQL Indexes: Types, Usage, and Optimization Tips
Liangxu Linux
Liangxu Linux
Apr 25, 2023 · Databases

How to Unnest Correlated Subqueries for Faster SQL Execution

This article explains why correlated subqueries are costly, introduces the Apply (correlated join) operator, and presents a series of systematic transformation rules that convert scalar, existential, and aggregate subqueries into efficient join‑based plans while preserving SQL semantics.

Apply OperatorDecorrelationSQL
0 likes · 17 min read
How to Unnest Correlated Subqueries for Faster SQL Execution
StarRocks
StarRocks
Apr 23, 2023 · Databases

Why Query Performance Optimization Matters and How to Master It

This guide explains the importance of query performance optimization for database products and engineers, outlines latency and throughput goals, shows how to locate bottlenecks with observability tools and Linux profilers, and provides practical high‑level and low‑level optimization techniques along with testing best practices.

BenchmarkingCPU profilingStarRocks
0 likes · 16 min read
Why Query Performance Optimization Matters and How to Master It
ITPUB
ITPUB
Apr 18, 2023 · Databases

Mastering MySQL Index Merge: Principles, Algorithms, and Practical Tips

This article explains MySQL's index‑merge optimization, detailing how the EXPLAIN output indicates its use, the three underlying algorithms (intersect, union, sort‑union), practical query examples, configuration flags, and step‑by‑step guidance for testing with sample tables and indexes.

AlgorithmsSQLdatabase
0 likes · 9 min read
Mastering MySQL Index Merge: Principles, Algorithms, and Practical Tips
Sohu Tech Products
Sohu Tech Products
Mar 29, 2023 · Databases

Understanding the Purpose and Impact of WHERE 1=1 in SQL Queries

This article explains why developers often include the redundant condition “WHERE 1=1” in SQL statements, demonstrating through performance tests that it has no effect on execution time, and describing its practical use for simplifying the addition of further conditions in both static and dynamic queries.

Dynamic SQLSQLWHERE clause
0 likes · 6 min read
Understanding the Purpose and Impact of WHERE 1=1 in SQL Queries
WeiLi Technology Team
WeiLi Technology Team
Mar 29, 2023 · Databases

Why Did MongoDB’s Query Planner Suddenly Slow Down? A Deep Dive into Index Cache Failures

The article explains how null or empty query values caused MongoDB to ignore the intended index, leading to massive slow queries and service timeouts, and details the step‑by‑step investigation, plan‑cache invalidation, and the corrective addition of a compound index that restored normal performance.

MongoDBPerformance debuggingPlan Cache
0 likes · 21 min read
Why Did MongoDB’s Query Planner Suddenly Slow Down? A Deep Dive into Index Cache Failures
ITPUB
ITPUB
Mar 25, 2023 · Big Data

Mastering Efficient SQL in ODPS: Union, Count‑Distinct, and Join Optimizations

This article walks through common SQL development scenarios on ODPS, examining why naïve UNION and COUNT DISTINCT can be slow, how to rewrite queries with GROUP BY, UNION ALL, JSON aggregation, and map‑join techniques, and shows the resulting execution‑plan improvements with concrete code and performance numbers.

Big DataCountDistinctMapJoin
0 likes · 17 min read
Mastering Efficient SQL in ODPS: Union, Count‑Distinct, and Join Optimizations
DataFunTalk
DataFunTalk
Mar 21, 2023 · Databases

Design and Technical Details of Apache Doris for Lakehouse Architecture

This article explains how Apache Doris extends its real‑time OLAP capabilities to support Lakehouse architectures, covering unified metadata, query acceleration, elastic compute, performance benchmarks, and future roadmap for richer data‑source integration and resource isolation.

Apache DorisBig DataData Warehouse
0 likes · 20 min read
Design and Technical Details of Apache Doris for Lakehouse Architecture
dbaplus Community
dbaplus Community
Mar 12, 2023 · Databases

Why MySQL 8.0 Queries Fail After Upgrading from 5.7: Charset & Collation Pitfalls

When upgrading MySQL 5.7 tables that use utf8 or utf8mb3 to MySQL 8.0, mismatched character sets and collations can cause index loss and inefficient joins, but converting all tables to utf8mb4 and adjusting the driver table can restore optimal query performance, as demonstrated with detailed examples and profiling results.

Charsetcollationmigration
0 likes · 7 min read
Why MySQL 8.0 Queries Fail After Upgrading from 5.7: Charset & Collation Pitfalls
Selected Java Interview Questions
Selected Java Interview Questions
Mar 12, 2023 · Big Data

Design and Optimization of Querying 100K Records from Tens of Millions of Data Using ClickHouse, Elasticsearch, HBase, and RediSearch

This article presents a comprehensive design and performance‑optimization study for extracting up to 100 000 records from a pool of tens of millions, comparing multithreaded ClickHouse pagination, Elasticsearch scroll‑scan, ES + HBase, and RediSearch + RedisJSON solutions, and provides practical recommendations based on measured latency and throughput.

ClickHouseHBaseRediSearch
0 likes · 11 min read
Design and Optimization of Querying 100K Records from Tens of Millions of Data Using ClickHouse, Elasticsearch, HBase, and RediSearch
Laravel Tech Community
Laravel Tech Community
Mar 6, 2023 · Databases

SQL Optimization Best Practices: Avoid SELECT *, Use Specific Fields, Index Strategies, and Other Performance Tips

This article presents a comprehensive set of MySQL performance guidelines, including avoiding SELECT *, preferring specific columns, eliminating OR in WHERE clauses, using numeric types, choosing VARCHAR over CHAR, applying proper JOINs, leveraging LIMIT, UNION ALL, composite indexes, and other practical tips to improve query efficiency and resource usage.

SQLdatabaseindexing
0 likes · 17 min read
SQL Optimization Best Practices: Avoid SELECT *, Use Specific Fields, Index Strategies, and Other Performance Tips
DataFunTalk
DataFunTalk
Feb 12, 2023 · Big Data

Optimizing Bilibili Presto Cluster Query Performance with Alluxio and Local Cache

This article presents a comprehensive technical overview of Bilibili's Presto cluster architecture, the challenges of query performance on Hadoop, and the systematic optimizations—including Alluxio integration, local cache mechanisms, multi‑active coordinators, label‑based scheduling, and real‑time penalties—that together improve availability, stability, and latency for large‑scale analytics workloads.

AlluxioBig DataCache
0 likes · 23 min read
Optimizing Bilibili Presto Cluster Query Performance with Alluxio and Local Cache
Architect's Guide
Architect's Guide
Jan 24, 2023 · Databases

Understanding Database Indexes: How They Accelerate Query Performance

This article explains the evolution of data storage, the fundamentals of computer storage devices, how database indexes function like a book's table of contents, the role of binary search, the benefits and drawbacks of indexes, clustered versus non‑clustered indexes, and common SQL optimization techniques.

Clustered IndexSQLdatabase
0 likes · 11 min read
Understanding Database Indexes: How They Accelerate Query Performance
DataFunTalk
DataFunTalk
Jan 5, 2023 · Big Data

Five Optimization Strategies for Improving DataTester Query Performance

This article describes how DataTester, Volcano Engine's A/B testing platform, achieved over four‑fold query speedup by applying five technical optimizations—including pre‑aggregation, join reduction, GroupBy redesign, AU‑metric caching, and asynchronous query handling—targeting both data construction and execution layers.

A/B testingClickHouseDataTester
0 likes · 12 min read
Five Optimization Strategies for Improving DataTester Query Performance
Liangxu Linux
Liangxu Linux
Jan 2, 2023 · Databases

13 Must‑Know MySQL Functions and Tricks for Everyday Development

This guide walks through a collection of practical MySQL functions and commands—including GROUP_CONCAT, CHAR_LENGTH, LOCATE, REPLACE, NOW, INSERT…SELECT, INSERT…IGNORE, SELECT FOR UPDATE, ON DUPLICATE KEY UPDATE, SHOW CREATE TABLE, CREATE TABLE … SELECT, EXPLAIN, SHOW PROCESSLIST and mysqldump—providing clear syntax, example queries, result screenshots and key usage notes to help developers write more efficient and reliable SQL.

Data MigrationSQLdatabase
0 likes · 14 min read
13 Must‑Know MySQL Functions and Tricks for Everyday Development
ITPUB
ITPUB
Dec 28, 2022 · Databases

How TDSQL‑PG Achieves Real‑Time HTAP: Architecture, Storage, and Optimization Insights

This article presents a comprehensive overview of TDSQL‑PG's HTAP capabilities, detailing the evolution of its storage, compute, transaction management, hybrid row‑column storage, cost‑based optimizer, vectorized execution, and resource isolation strategies for handling mixed OLTP and OLAP workloads.

HTAPHybrid storageResource Isolation
0 likes · 18 min read
How TDSQL‑PG Achieves Real‑Time HTAP: Architecture, Storage, and Optimization Insights
Code Ape Tech Column
Code Ape Tech Column
Dec 28, 2022 · Big Data

Design and Optimization of Querying 100k Records from Tens of Millions Using ClickHouse, Elasticsearch, HBase, and Redis

This article presents a comprehensive analysis and multiple design alternatives—including multithreaded ClickHouse pagination, Elasticsearch scroll‑scan, ES+HBase hybrid, and RediSearch+RedisJSON—to efficiently filter, sort, and de‑duplicate up to 100,000 records from a pool of tens of millions, with detailed performance comparisons and code examples.

HBasequery optimizationredis
0 likes · 10 min read
Design and Optimization of Querying 100k Records from Tens of Millions Using ClickHouse, Elasticsearch, HBase, and Redis
Java Backend Technology
Java Backend Technology
Dec 21, 2022 · Databases

Master MySQL: 14 Essential Functions & Commands Every Developer Should Know

This guide walks through fourteen practical MySQL techniques—from aggregating rows with group_concat and measuring string length with char_length, to locating substrings, replacing text, retrieving timestamps, bulk inserting, handling duplicates, using pessimistic locks, inspecting tables, explaining queries, monitoring processes, and exporting data with mysqldump.

SQL functionsdata backupmysql
0 likes · 13 min read
Master MySQL: 14 Essential Functions & Commands Every Developer Should Know
DataFunSummit
DataFunSummit
Dec 20, 2022 · Big Data

JD Retail Big Data OLAP Application and Practice

This talk presents JD Retail’s big‑data OLAP solution, covering the massive, variable and complex traffic data challenges, the custom data‑ingestion and versioned update tools, ClickHouse query‑architecture upgrades, optimization techniques, and future plans for multi‑cluster querying and pre‑computation.

Big DataClickHouseJD Retail
0 likes · 21 min read
JD Retail Big Data OLAP Application and Practice
dbaplus Community
dbaplus Community
Nov 26, 2022 · Databases

How to Perform Fuzzy Searches on Encrypted Data: Methods, Pros & Cons

This article examines why encrypted data hinders fuzzy queries and compares three categories of solutions—naïve, conventional, and advanced—detailing their implementation steps, performance trade‑offs, storage costs, and practical suitability for real‑world systems.

data securityencryptionfuzzy-search
0 likes · 11 min read
How to Perform Fuzzy Searches on Encrypted Data: Methods, Pros & Cons
dbaplus Community
dbaplus Community
Nov 20, 2022 · Databases

Master MySQL Performance: From Tuning Pyramid to Explain Plans

This guide walks through MySQL performance tuning, covering the architectural tuning pyramid, query optimization techniques, slow‑query logging, detailed EXPLAIN execution‑plan analysis, and high‑performance indexing strategies to help you diagnose and accelerate database workloads.

Slow Query Logexplainmysql
0 likes · 43 min read
Master MySQL Performance: From Tuning Pyramid to Explain Plans
Tencent Cloud Developer
Tencent Cloud Developer
Nov 18, 2022 · Databases

How CDW PG Optimizer Finds Optimal Join Order for Multi-Table Queries

CDW PG’s optimizer determines the most efficient join order for multi‑table OLAP queries by combining bottom‑up dynamic programming for smaller joins with a genetic algorithm for larger ones, while jointly selecting scan paths, join algorithms, and data‑distribution strategies to minimize execution cost.

CDW PGJOIN optimizationMPP architecture
0 likes · 9 min read
How CDW PG Optimizer Finds Optimal Join Order for Multi-Table Queries
ITPUB
ITPUB
Nov 13, 2022 · Databases

Inside MySQL: How the Server Handles Connections, Parsing, and Storage Engines

This article explains MySQL's client‑server architecture and walks through the three main stages—connection handling, query parsing and optimization, and storage engine selection—detailing protocols, authentication, cache behavior, optimizer mechanics, and engine characteristics with practical commands and examples.

Connection HandlingDatabase InternalsStorage Engine
0 likes · 21 min read
Inside MySQL: How the Server Handles Connections, Parsing, and Storage Engines
vivo Internet Technology
vivo Internet Technology
Oct 19, 2022 · Big Data

SQL Parser Selection and Implementation: ANTLR vs Apache Calcite for Big Data Applications

The article explains why adding a SQL parser to big‑data platforms such as Hive, Spark, Flink or Kafka simplifies development, compares ANTLR and Apache Calcite implementations, shows code examples, and concludes that Calcite’s lower learning curve and greater flexibility make it the preferred choice for production‑grade SQL layers.

ANTLRCalciteSQL Parser
0 likes · 18 min read
SQL Parser Selection and Implementation: ANTLR vs Apache Calcite for Big Data Applications
Laravel Tech Community
Laravel Tech Community
Oct 10, 2022 · Databases

Optimizing MySQL Pagination for Large Datasets

The article analyzes why MySQL LIMIT pagination becomes slower as the offset grows, demonstrates the performance impact with sample queries on a 500,000‑row table, and presents three optimization strategies—using ordered primary keys, subqueries, and join‑based pagination—to reduce full‑table scans and improve query speed.

indexmysqlpagination
0 likes · 8 min read
Optimizing MySQL Pagination for Large Datasets
dbaplus Community
dbaplus Community
Oct 7, 2022 · Databases

30 Essential SQL Query Optimization Tips to Boost Performance

This guide presents thirty practical SQL Server optimization techniques—including proper indexing, avoiding full table scans, rewriting predicates, and minimizing temporary objects—to help developers write faster, more efficient queries and improve overall database performance.

SQLSQL Serverquery optimization
0 likes · 10 min read
30 Essential SQL Query Optimization Tips to Boost Performance
Bilibili Tech
Bilibili Tech
Sep 30, 2022 · Big Data

Bilibili's Efficient Lakehouse Platform Built on Trino and Iceberg

Bilibili’s new lake‑house platform, built on Trino and Iceberg, replaces Hive‑based pipelines by ingesting logs and DB data into Iceberg tables, applying advanced sorting, Z‑order/Hilbert clustering, bitmap and bloom indexes, virtual join columns and pre‑aggregation, enabling 70 000 daily queries on 2 PB with average scans of 2 GB and sub‑2‑second response times.

Big DataData SkippingIceberg
0 likes · 15 min read
Bilibili's Efficient Lakehouse Platform Built on Trino and Iceberg
HomeTech
HomeTech
Sep 13, 2022 · Big Data

Integrating Heterogeneous Data Sources with openLooKeng and Upgrading the Apache Kylin Connector at AutoHome

This article describes how AutoHome tackled the complexity of managing multiple relational, NoSQL, and Hive data stores by adopting openLooKeng for unified, cross‑source SQL queries, outlines its key features such as ANSI‑SQL support, diverse connectors, and query optimizations, and details the custom enhancements made to the Apache Kylin connector to better serve their commercial data analysis workloads.

Big DataConnectorsData Integration
0 likes · 13 min read
Integrating Heterogeneous Data Sources with openLooKeng and Upgrading the Apache Kylin Connector at AutoHome
ITPUB
ITPUB
Sep 8, 2022 · Databases

Master MySQL Performance: 5 Key Optimization Strategies You Must Know

This guide explores five essential dimensions of MySQL performance tuning—including connection configuration, architectural choices such as caching and read‑write splitting, optimizer settings, storage engine selection, and schema design—providing concrete commands, formulas, and best‑practice recommendations to accelerate query execution and reduce load.

Database Architecturemysqlperformance tuning
0 likes · 20 min read
Master MySQL Performance: 5 Key Optimization Strategies You Must Know
Java High-Performance Architecture
Java High-Performance Architecture
Aug 2, 2022 · Databases

Why MySQL IN Subqueries Can Be So Slow and How to Fix Them

This article examines why a MySQL query that uses an IN subquery on a massive users table becomes extremely slow, analyzes the execution plan revealing materialized temporary tables and semi‑join optimization, and demonstrates how disabling the optimizer or rewriting the query restores index usage and dramatically improves performance.

Semi-Joinexecution planindexing
0 likes · 8 min read
Why MySQL IN Subqueries Can Be So Slow and How to Fix Them
Top Architect
Top Architect
Aug 2, 2022 · Databases

Understanding MySQL Execution Process: Connectors, Permissions, Cache, Parser, Optimizer, and Executor

This article explains the complete MySQL execution flow, covering the connector's duties, permission tables, caching behavior, parsing of SQL statements, optimizer decision‑making, executor interaction with storage engines, processlist states, logical query order, and practical tips for ordering WHERE‑clause conditions.

CacheExecution ProcessPermissions
0 likes · 11 min read
Understanding MySQL Execution Process: Connectors, Permissions, Cache, Parser, Optimizer, and Executor
Top Architect
Top Architect
Jul 15, 2022 · Databases

SQL Query Optimization Tips to Avoid Full Table Scans

This article presents a comprehensive list of practical SQL query optimization techniques, emphasizing how to avoid full table scans by using proper indexing, rewriting WHERE clauses, avoiding certain operators, functions, and expressions, and providing example code snippets to improve database performance.

SQLT-SQLquery optimization
0 likes · 10 min read
SQL Query Optimization Tips to Avoid Full Table Scans