Tagged articles
636 articles
Page 3 of 7
dbaplus Community
dbaplus Community
Aug 1, 2023 · Operations

Why Is My Elasticsearch Cluster Consuming 15 GB Heap? A Deep Dive into Memory, Sharding, and Performance Bottlenecks

A 7‑node Elasticsearch cluster handling 500 million documents shows excessive heap usage, many deleted documents, high translog size, saturated OS memory, and inefficient sharding, prompting a detailed analysis of stats, root‑cause identification, and concrete recommendations for mapping, shard design, and JVM tuning.

Cluster MonitoringJVMMemory Optimization
0 likes · 16 min read
Why Is My Elasticsearch Cluster Consuming 15 GB Heap? A Deep Dive into Memory, Sharding, and Performance Bottlenecks
Top Architect
Top Architect
Jul 18, 2023 · Fundamentals

Comprehensive Introduction to Elasticsearch: Core Concepts, Architecture, and Practical Usage

This article provides a detailed overview of Elasticsearch, covering its underlying Lucene technology, data types, indexing mechanisms, cluster architecture, shard and replica management, mapping definitions, installation steps, health monitoring, write and storage processes, and performance optimization techniques for production deployments.

ElasticsearchReplicationindexing
0 likes · 36 min read
Comprehensive Introduction to Elasticsearch: Core Concepts, Architecture, and Practical Usage
DataFunSummit
DataFunSummit
Jul 6, 2023 · Big Data

Design and Practice of Alibaba Cloud's Billion‑Scale Real‑Time Log Analysis

This article presents Alibaba Cloud's SLS billion‑scale real‑time log analysis architecture, covering business background, core challenges such as low‑latency queries, massive data scale, high concurrency, and multi‑tenant isolation, and detailing key design solutions like LSM‑based storage, index‑columnar storage, data locality, layered caching, and future directions.

Big Datadistributed storagehigh concurrency
0 likes · 17 min read
Design and Practice of Alibaba Cloud's Billion‑Scale Real‑Time Log Analysis
Top Architect
Top Architect
Jun 30, 2023 · Databases

Optimizing MySQL LIMIT Pagination: Analysis and Solutions

This article examines why MySQL LIMIT pagination becomes slower with deeper offsets on a 500,000‑row table, demonstrates the performance impact with concrete queries, and presents three optimization strategies—including using ordered primary keys, subqueries, and join‑based approaches—to reduce scan range and improve query speed.

Database OptimizationSQLindexing
0 likes · 8 min read
Optimizing MySQL LIMIT Pagination: Analysis and Solutions
Bilibili Tech
Bilibili Tech
Jun 30, 2023 · Backend Development

Optimizing Bilibili Video Detail Page Backend with Business Association Index and Data Aggregation Gateway

The article describes how Bilibili tackled exploding fan‑out reads on its video detail page by introducing a Redis‑backed business‑association index and a generic Data Aggregation Gateway, which together cut downstream service traffic and load by over 90% while simplifying aggregation across multiple terminals and scenarios.

BFFBackenddata aggregation
0 likes · 13 min read
Optimizing Bilibili Video Detail Page Backend with Business Association Index and Data Aggregation Gateway
Programmer DD
Programmer DD
Jun 26, 2023 · Databases

Unlock Lightning-Fast Search: Proven Elasticsearch Performance Tuning Tips

This article presents comprehensive best‑practice recommendations for optimizing Elasticsearch deployments, covering hardware selection, RAID choices, index and shard planning, query and caching strategies, bulk indexing, refresh intervals, monitoring tools, version upgrades, and lifecycle management to achieve high performance, reliability, and scalability.

Elasticsearchindexingperformance tuning
0 likes · 18 min read
Unlock Lightning-Fast Search: Proven Elasticsearch Performance Tuning Tips
Big Data Technology & Architecture
Big Data Technology & Architecture
Jun 22, 2023 · Databases

Apache Doris 2.0 New Features: High‑Concurrency Data Serving Optimizations

Apache Doris 2.0 introduces a suite of high‑concurrency data‑serving enhancements—including row‑store format, partition‑bucket pruning, advanced indexing, materialized views, runtime filters, TOPN optimization, short‑circuit point‑query paths, prepared statements, and row cache—enabling single‑node tens of thousands QPS and dramatically reducing query latency.

Apache DorisBenchmarkData Serving
0 likes · 23 min read
Apache Doris 2.0 New Features: High‑Concurrency Data Serving Optimizations
Bilibili Tech
Bilibili Tech
Jun 20, 2023 · Big Data

Design and Evolution of Bilibili's Billions 3.0 Log Platform: A Lakehouse Architecture with ClickHouse, Iceberg, and Trino

Bilibili evolved its log platform from ClickHouse‑based Billions 2.0 to Billions 3.0 lakehouse using Iceberg, HDFS, Trino, retaining ClickHouse for acceleration; this reduces storage cost by over 20%, improves observability, solves the compute‑storage mismatch, adds flexible indexing, and supports complex ETL while staying open‑source.

ClickHouseIcebergLakehouse
0 likes · 36 min read
Design and Evolution of Bilibili's Billions 3.0 Log Platform: A Lakehouse Architecture with ClickHouse, Iceberg, and Trino
Aikesheng Open Source Community
Aikesheng Open Source Community
Jun 14, 2023 · Databases

MongoDB Crash Analysis: WT_SESSION Exhaustion Caused by Index Drop During Background Index Build

An original case study examines a MongoDB 4.0.14 replica set crash triggered by exceeding the WiredTiger session limit when a primary creates a background index and subsequently drops it, causing blocking on the secondary, leading to connection saturation, session exhaustion, and instance failure.

Database operationsMongoDBWiredTiger
0 likes · 10 min read
MongoDB Crash Analysis: WT_SESSION Exhaustion Caused by Index Drop During Background Index Build
政采云技术
政采云技术
Jun 13, 2023 · Databases

MySQL Logical Architecture Design and Performance Optimization

This article explains MySQL's four‑layer logical architecture, describes high‑performance read strategies, details data pages, buffer pool and LRU algorithm, and provides comprehensive guidance on index structures, optimization techniques, and best practices for improving query performance.

Database ArchitectureLRU algorithmbuffer pool
0 likes · 14 min read
MySQL Logical Architecture Design and Performance Optimization
IT Services Circle
IT Services Circle
Jun 10, 2023 · Databases

Comprehensive MySQL Interview Guide: Key Concepts and Practices

This article provides a systematic, interview‑focused overview of MySQL fundamentals—including normal forms, DML vs DDL, primary and foreign keys, storage engines, redo and binlog mechanisms, indexing strategies, transaction properties, MVCC, lock types, replication, and common performance pitfalls—helping candidates quickly master essential database knowledge for interview success.

SQLTransactionsdatabase
0 likes · 32 min read
Comprehensive MySQL Interview Guide: Key Concepts and Practices
Liangxu Linux
Liangxu Linux
Jun 6, 2023 · Databases

Why MySQL Chooses B+ Trees Over Skip Lists for Indexing

The article explains the structural differences between B+ trees and skip lists, compares their read‑write performance in MySQL and Redis, and shows why MySQL prefers B+ trees while Redis adopts skip lists for its in‑memory ZSET implementation.

B+Treedatabaseindexing
0 likes · 13 min read
Why MySQL Chooses B+ Trees Over Skip Lists for Indexing
php Courses
php Courses
May 26, 2023 · Databases

Using Indexes to Optimize MySQL Queries

This article explains what indexes are, when they should be applied, and provides step‑by‑step MySQL commands—including single‑column, composite, and index‑hint examples—to improve query performance on large datasets.

Database PerformanceSQLindexing
0 likes · 6 min read
Using Indexes to Optimize MySQL Queries
Su San Talks Tech
Su San Talks Tech
May 25, 2023 · Databases

Why MySQL Uses B+ Trees Instead of Skip Lists for Indexing

This article compares B+ trees and skip lists, explaining their structures, insertion and search complexities, and why MySQL chooses B+ trees for disk‑based indexing while Redis prefers skip lists for in‑memory sorted sets, highlighting trade‑offs in read/write performance and I/O costs.

B+Treedatabaseindexing
0 likes · 12 min read
Why MySQL Uses B+ Trees Instead of Skip Lists for Indexing
Java Architect Essentials
Java Architect Essentials
Apr 28, 2023 · Backend Development

12 Proven Strategies to Supercharge API Performance in Java

This article outlines twelve practical techniques—including batch processing, asynchronous execution, caching, pre‑processing, pooling, parallelization, indexing, transaction sizing, code refactoring, pagination, SQL tuning, and lock granularity—to dramatically reduce API latency and improve overall backend efficiency.

API performanceAsynchronousBatch Processing
0 likes · 9 min read
12 Proven Strategies to Supercharge API Performance in Java
IT Architects Alliance
IT Architects Alliance
Apr 23, 2023 · Backend Development

Common Interface Performance Optimization Strategies

This article outlines a comprehensive set of backend interface optimization techniques—including batch processing, asynchronous execution, caching, pre‑processing, pooling, parallelization, indexing, transaction sizing, code restructuring, deep pagination, SQL tuning, and lock granularity—to reduce latency and improve overall system efficiency.

API optimizationAsynchronous ExecutionBatch Processing
0 likes · 8 min read
Common Interface Performance Optimization Strategies
Architect
Architect
Apr 20, 2023 · Backend Development

Common Interface Performance Optimization Strategies

This article summarizes practical techniques for reducing API latency, including batch processing, asynchronous execution, caching, pre‑processing, pooling, parallelization, indexing, transaction size control, program structure refactoring, deep pagination handling, SQL tuning, and proper lock granularity.

API optimizationBackend Performanceindexing
0 likes · 9 min read
Common Interface Performance Optimization Strategies
Sohu Tech Products
Sohu Tech Products
Apr 12, 2023 · Databases

MySQL Query Optimization Strategies: Pagination, Joins, Subqueries, Sorting, Group By, and Count

This article presents practical MySQL query optimization techniques—including pagination, index‑based join improvements, subquery replacement with joins, order‑by and group‑by indexing, and count() efficiency—illustrated with real‑world examples, execution‑plan analysis, and ready‑to‑run SQL code.

Database TuningSQL Optimizationindexing
0 likes · 11 min read
MySQL Query Optimization Strategies: Pagination, Joins, Subqueries, Sorting, Group By, and Count
IT Architects Alliance
IT Architects Alliance
Apr 11, 2023 · Databases

MySQL Execution Process and Comprehensive Optimization Best Practices

This article explains MySQL's query execution flow and provides extensive best‑practice guidelines—including naming conventions, storage engine choices, primary‑key design, character set settings, query writing tips, indexing strategies, batch operations, and performance‑enhancing techniques—to help developers write efficient and maintainable SQL code.

Database PerformanceQuery TuningSQL Optimization
0 likes · 24 min read
MySQL Execution Process and Comprehensive Optimization Best Practices
Top Architect
Top Architect
Apr 7, 2023 · Databases

MySQL Performance Optimization Guidelines from a Senior Architect

This article presents a senior architect’s comprehensive MySQL optimization guide, covering execution process, naming conventions, storage engine choices, indexing strategies, query writing best practices, transaction handling, partitioning, and other performance‑tuning techniques to improve database efficiency and reliability.

DatabaseOptimizationSQLindexing
0 likes · 24 min read
MySQL Performance Optimization Guidelines from a Senior Architect
macrozheng
macrozheng
Apr 3, 2023 · Backend Development

Boost API Performance: 11 Proven Backend Optimization Techniques

This article presents a comprehensive set of backend API optimization strategies—including batch processing, asynchronous execution, caching, pre‑processing, pooling, parallelization, indexing, transaction management, code restructuring, deep pagination, SQL tuning, and lock granularity—to dramatically reduce response times and improve system efficiency.

API optimizationBackend Performanceasynchronous processing
0 likes · 10 min read
Boost API Performance: 11 Proven Backend Optimization Techniques
Top Architect
Top Architect
Mar 29, 2023 · Databases

MySQL Query Optimization and Index Tuning: From Subqueries to Joins and Composite Indexes

This article demonstrates MySQL query performance issues with large tables, shows how a subquery without indexes can take hours, and walks through step‑by‑step optimizations including adding single‑column indexes, converting to EXISTS and JOIN queries, creating composite indexes, covering indexes, and sorting improvements, achieving speedups of thousands of times.

Composite IndexDatabase PerformanceQuery Tuning
0 likes · 12 min read
MySQL Query Optimization and Index Tuning: From Subqueries to Joins and Composite Indexes
Aikesheng Open Source Community
Aikesheng Open Source Community
Mar 29, 2023 · Databases

Understanding Local and Global Indexes in OceanBase

The article explains the concepts, differences, and practical usage scenarios of local (partition‑aligned) and global indexes in OceanBase, providing SQL examples for queries with and without partition keys and guidance on creating unique indexes using either index type.

Local IndexOceanBaseSQL
0 likes · 7 min read
Understanding Local and Global Indexes in OceanBase
WeiLi Technology Team
WeiLi Technology Team
Mar 29, 2023 · Databases

Why Did MongoDB’s Query Planner Suddenly Slow Down? A Deep Dive into Index Cache Failures

The article explains how null or empty query values caused MongoDB to ignore the intended index, leading to massive slow queries and service timeouts, and details the step‑by‑step investigation, plan‑cache invalidation, and the corrective addition of a compound index that restored normal performance.

MongoDBPerformance debuggingPlan Cache
0 likes · 21 min read
Why Did MongoDB’s Query Planner Suddenly Slow Down? A Deep Dive into Index Cache Failures
Python Crawling & Data Mining
Python Crawling & Data Mining
Mar 23, 2023 · Backend Development

How to Exclude Folders in PyCharm to Speed Up Indexing

This article explains how to configure PyCharm to exclude specific folders from indexing, reducing update time and speeding up IDE startup, illustrated with step‑by‑step screenshots and practical tips for Python developers dealing with large projects.

IDE optimizationPyCharmdevelopment tips
0 likes · 3 min read
How to Exclude Folders in PyCharm to Speed Up Indexing
Laravel Tech Community
Laravel Tech Community
Mar 19, 2023 · Databases

How to Diagnose and Resolve MySQL Process CPU Usage Spiking to 900%

This article explains common scenarios that cause MySQL (and related services) to consume 900% CPU, outlines step‑by‑step diagnosis using tools like top and show processlist, and provides practical remediation such as killing offending threads, adding missing indexes, disabling slow‑log, and leveraging Redis caching to bring CPU usage down to acceptable levels.

CPUSQLcaching
0 likes · 6 min read
How to Diagnose and Resolve MySQL Process CPU Usage Spiking to 900%
Code Ape Tech Column
Code Ape Tech Column
Mar 13, 2023 · Backend Development

Diagnosing and Resolving 900% CPU Spikes in MySQL and Java Processes

This article explains common scenarios that cause CPU usage to soar above 200% in production, outlines step‑by‑step diagnosis and remediation for MySQL and Java processes, and shares real‑world case studies with command‑line tools, indexing, caching, and code adjustments to bring CPU load back to normal levels.

CPUDebuggingJava
0 likes · 11 min read
Diagnosing and Resolving 900% CPU Spikes in MySQL and Java Processes
Laravel Tech Community
Laravel Tech Community
Mar 6, 2023 · Databases

SQL Optimization Best Practices: Avoid SELECT *, Use Specific Fields, Index Strategies, and Other Performance Tips

This article presents a comprehensive set of MySQL performance guidelines, including avoiding SELECT *, preferring specific columns, eliminating OR in WHERE clauses, using numeric types, choosing VARCHAR over CHAR, applying proper JOINs, leveraging LIMIT, UNION ALL, composite indexes, and other practical tips to improve query efficiency and resource usage.

SQLdatabaseindexing
0 likes · 17 min read
SQL Optimization Best Practices: Avoid SELECT *, Use Specific Fields, Index Strategies, and Other Performance Tips
Alimama Tech
Alimama Tech
Feb 15, 2023 · Big Data

Dolphin: Alibaba's Hyper‑Converged Multi‑Modal Big Data Engine Overview

Dolphin, Alibaba’s hyper‑converged multi‑modal big‑data engine, unifies OLAP, AI, streaming, and batch workloads on a decoupled compute‑storage MPP foundation, offering a Dolphin SQL layer, advanced bitmap/GroupTable/AFile indexes, intelligent materialization, and one‑write‑multiple‑read storage that cuts costs over 70% while delivering sub‑millisecond queries on trillion‑row datasets.

AIBig DataOLAP
0 likes · 14 min read
Dolphin: Alibaba's Hyper‑Converged Multi‑Modal Big Data Engine Overview
Aikesheng Open Source Community
Aikesheng Open Source Community
Feb 15, 2023 · Databases

SQL Statement Optimization Practices for MySQL

This article explains how DBAs can improve MySQL query performance by applying logical and physical optimizations, analyzing table structures, indexes, and execution plans, and provides a step‑by‑step checklist for diagnosing and rewriting slow or poorly optimized SQL statements.

Database PerformanceQuery TuningSQL Optimization
0 likes · 8 min read
SQL Statement Optimization Practices for MySQL
dbaplus Community
dbaplus Community
Feb 14, 2023 · Databases

10 Common PostgreSQL Mistakes and How to Avoid Them

This article outlines the ten most frequent PostgreSQL pitfalls—from default configurations and poor schema design to inadequate backups and mismanaged extensions—providing concrete SQL queries, tuning tools, and best‑practice recommendations to help DBAs detect, diagnose, and prevent each issue before it harms production workloads.

BackupConnection PoolingDatabase Administration
0 likes · 16 min read
10 Common PostgreSQL Mistakes and How to Avoid Them
ITPUB
ITPUB
Feb 9, 2023 · Databases

Inside InnoDB: How MySQL Stores Data, Row Formats, and Indexes Explained

This article breaks down MySQL's InnoDB storage engine, detailing where data files reside, the different row formats (compact, redundant, dynamic, compressed), the internal 16 KB page layout, record header fields, overflow handling, and how B‑tree indexes (clustered and secondary) are built and searched.

B+TreeDatabase StorageInnoDB
0 likes · 24 min read
Inside InnoDB: How MySQL Stores Data, Row Formats, and Indexes Explained
JD Tech
JD Tech
Feb 7, 2023 · Databases

MySQL Database Performance Optimization: Principles, Architecture, Indexing, and Real‑World Cases

This article explains common MySQL database performance optimization techniques, underlying storage architecture, index construction and usage guidelines, and presents several real‑world cases from an internet hospital project, helping developers understand optimization strategies, avoid hidden pitfalls, and improve query efficiency.

Case StudyPerformance Optimizationindexing
0 likes · 15 min read
MySQL Database Performance Optimization: Principles, Architecture, Indexing, and Real‑World Cases
ITPUB
ITPUB
Jan 20, 2023 · Big Data

How Bilibili Supercharged OLAP Queries with Iceberg Lakehouse Optimizations

This article details Bilibili's practical deployment of an Iceberg lake‑warehouse architecture within its OLAP platform, covering the motivations for lake‑warehouse integration, core Iceberg optimizations such as data‑organization sorting, Z‑order and secondary indexes, the Magnus intelligent management platform, and future roadmap plans.

Precomputationbigdataindexing
0 likes · 16 min read
How Bilibili Supercharged OLAP Queries with Iceberg Lakehouse Optimizations
Top Architect
Top Architect
Jan 6, 2023 · Databases

Why MySQL Discourages UUID Primary Keys and a Performance Comparison of Auto‑Increment, UUID, and Random Long IDs

This article investigates MySQL's recommendation against using UUIDs as primary keys by creating three tables with auto‑increment, UUID, and random (snowflake) keys, running insertion benchmarks with Spring Boot/JdbcTemplate, analyzing index structures, and discussing the advantages and drawbacks of each key strategy.

Snowflake IDauto_incrementindexing
0 likes · 11 min read
Why MySQL Discourages UUID Primary Keys and a Performance Comparison of Auto‑Increment, UUID, and Random Long IDs
Java High-Performance Architecture
Java High-Performance Architecture
Jan 4, 2023 · Databases

Why Do Database Indexes Speed Up Queries? A Deep Dive into Storage and Optimization

This article explains how databases store data on physical devices, how indexes work like a book’s table of contents to accelerate queries, covers storage fundamentals, binary search, clustered vs non‑clustered indexes, and practical SQL optimization tips to avoid full‑table scans and index pitfalls.

SQL Optimizationindexingstorage
0 likes · 12 min read
Why Do Database Indexes Speed Up Queries? A Deep Dive into Storage and Optimization
IT Architects Alliance
IT Architects Alliance
Jan 3, 2023 · Fundamentals

Performance Optimization Techniques: Indexing, Caching, Compression, Prefetching, Throttling, and Batch Processing

The article explores a wide range of performance‑optimization strategies—including indexing, caching, compression, prefetching, peak‑shaving, and batch processing—explaining their trade‑offs, practical applications, and how they relate to hardware latency and system design in modern computing environments.

batch-processingcachingcompression
0 likes · 34 min read
Performance Optimization Techniques: Indexing, Caching, Compression, Prefetching, Throttling, and Batch Processing
Architects' Tech Alliance
Architects' Tech Alliance
Dec 25, 2022 · Operations

Mastering Time‑Space Trade‑offs: Indexing, Caching, Compression & More

This article consolidates three performance‑optimization posts, detailing six universal time‑space trade‑off techniques—indexing, caching, compression, prefetching, peak‑valley smoothing, and batch processing—plus four advanced parallelism strategies, illustrated with real‑world analogies and practical guidelines for developers.

Batch ProcessingPerformance OptimizationSoftware Engineering
0 likes · 22 min read
Mastering Time‑Space Trade‑offs: Indexing, Caching, Compression & More
Programmer DD
Programmer DD
Dec 23, 2022 · Backend Development

Unlock High Performance: 6 Proven Techniques to Supercharge Your Software

This article presents six practical performance‑optimization methods—indexing, compression, caching, prefetching, peak‑shaving, and batch processing—explains their principles, trade‑offs, and real‑world applications, and then dives into four advanced strategies for scaling and parallelism, all illustrated with memorable analogies.

Scalabilitycachingcompression
0 likes · 36 min read
Unlock High Performance: 6 Proven Techniques to Supercharge Your Software
php Courses
php Courses
Dec 9, 2022 · Databases

Elasticsearch Index and Document Operations Tutorial

This tutorial explains how to create, query, update, and delete Elasticsearch indices and documents using RESTful HTTP requests, covering basic CRUD operations, various query types, pagination, sorting, aggregations, highlighting, and mapping definitions with practical JSON examples.

Big DataElasticsearchJSON
0 likes · 8 min read
Elasticsearch Index and Document Operations Tutorial
dbaplus Community
dbaplus Community
Nov 19, 2022 · Databases

How to Speed Up MySQL Queries: Indexing and Execution Plan Optimization

This article walks through a MySQL performance case study, showing how nested subqueries on large tables can take hours, and demonstrates step‑by‑step how creating single‑column, composite, and covering indexes, as well as rewriting queries as joins, reduces execution time from tens of thousands of seconds to milliseconds.

SQLexecution planindexing
0 likes · 12 min read
How to Speed Up MySQL Queries: Indexing and Execution Plan Optimization
21CTO
21CTO
Nov 18, 2022 · Big Data

How to Supercharge Elasticsearch for Billion‑Row Queries: Proven Optimization Techniques

This article details a real‑world case study of optimizing Elasticsearch for massive daily data volumes, covering the underlying Lucene architecture, shard routing, index and search performance tweaks, practical configuration settings, and benchmark results that achieve sub‑second query responses on billions of records.

Searchbig-dataindexing
0 likes · 13 min read
How to Supercharge Elasticsearch for Billion‑Row Queries: Proven Optimization Techniques
DeWu Technology
DeWu Technology
Nov 11, 2022 · Databases

TiDB Overview: Architecture, Storage Model, and Execution Plan

TiDB is an open‑source, MySQL‑compatible distributed database that unifies OLTP and OLAP by using a stateless SQL layer, a Placement Driver for metadata and Raft‑managed TiKV/TiFlash storage, key‑value mapping with RocksDB, and a push‑down execution engine that splits work between coprocessor and root tasks for scalable, highly available workloads.

HTAPSQL Execution PlanStorage Engine
0 likes · 17 min read
TiDB Overview: Architecture, Storage Model, and Execution Plan
Architecture Digest
Architecture Digest
Nov 7, 2022 · Backend Development

Comprehensive Guide to Software Performance Optimization: Indexing, Compression, Caching, Prefetching, Throttling, and Batch Processing

This article presents a thorough, multi‑part exploration of software performance optimization techniques—including indexing, compression, caching, prefetching, peak‑shaving, and batch processing—explaining their principles, trade‑offs, practical applications, and how they relate to hardware constraints and system design.

Batch ProcessingScalabilitycaching
0 likes · 36 min read
Comprehensive Guide to Software Performance Optimization: Indexing, Compression, Caching, Prefetching, Throttling, and Batch Processing
DataFunSummit
DataFunSummit
Nov 4, 2022 · Big Data

Real-Time Data Lake Practice at ByteDance: Architecture, Challenges, and Solutions

ByteDance’s data platform team explains their real‑time data lake implementation, covering its evolving definition, six core capabilities, challenges such as data management, concurrent updates, performance and log ingestion, and detailed case studies of multi‑stage deployment, indexing, metadata services, and future roadmap.

HudiReal-time Data LakeStreaming
0 likes · 32 min read
Real-Time Data Lake Practice at ByteDance: Architecture, Challenges, and Solutions
Top Architect
Top Architect
Oct 29, 2022 · Databases

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive overview of Elasticsearch, covering its underlying Lucene architecture, data types, cluster components, shard allocation, indexing mechanisms, storage strategies, and performance tuning tips for building scalable, near‑real‑time search solutions.

Distributed SystemsPerformance Optimizationindexing
0 likes · 35 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
Architect's Guide
Architect's Guide
Oct 27, 2022 · Big Data

Elasticsearch Overview: Data Types, Lucene Foundations, Core Concepts, Cluster Architecture, Indexing, Storage, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering the distinction between structured and unstructured data, Lucene’s inverted index, ES core concepts such as clusters, nodes, shards and replicas, mapping, basic usage, storage mechanisms, and practical performance‑tuning tips for large‑scale search deployments.

Distributed SystemsElasticsearchPerformance Optimization
0 likes · 39 min read
Elasticsearch Overview: Data Types, Lucene Foundations, Core Concepts, Cluster Architecture, Indexing, Storage, and Performance Optimization
Programmer DD
Programmer DD
Oct 26, 2022 · Databases

Boost MySQL Performance: 20 Proven SQL Optimization Tips

This article presents a comprehensive set of MySQL optimization techniques, covering everything from avoiding SELECT * and OR conditions to proper index usage, join strategies, batch operations, and query planning, all aimed at reducing resource consumption, improving execution speed, and ensuring reliable database design.

Database PerformanceQuery TuningSQL Optimization
0 likes · 19 min read
Boost MySQL Performance: 20 Proven SQL Optimization Tips
Ctrip Technology
Ctrip Technology
Oct 20, 2022 · Databases

Practical Slow Query Optimization for MySQL at Ctrip

This article describes Ctrip's practical approach to identifying and optimizing MySQL slow queries, covering background, improved SQL review workflow, execution plan analysis, common index problems, query rewriting, pagination issues, resource contention, and best‑practice recommendations for long‑term performance.

Database OptimizationQuery PlanningSQL Performance
0 likes · 10 min read
Practical Slow Query Optimization for MySQL at Ctrip
Top Architect
Top Architect
Oct 19, 2022 · Big Data

Elasticsearch Architecture Overview and Core Concepts

This article provides a comprehensive overview of Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, shard allocation, indexing mechanisms, storage strategies, refresh and translog processes, segment merging, performance tuning, and JVM optimization for building scalable, near‑real‑time search solutions.

Big DataClusterElasticsearch
0 likes · 37 min read
Elasticsearch Architecture Overview and Core Concepts
MaGe Linux Operations
MaGe Linux Operations
Oct 15, 2022 · Databases

20 Essential MySQL Optimization Tips to Boost Query Performance

This article presents a comprehensive set of MySQL best‑practice guidelines—including avoiding SELECT *, steering clear of OR in WHERE clauses, preferring numeric over string types, using VARCHAR instead of CHAR, limiting DELETE/UPDATE, leveraging proper JOIN types, and many other indexing and query‑execution tricks—to dramatically improve query speed, reduce resource consumption, and maintain healthy database design.

Database designSQL Optimizationindexing
0 likes · 18 min read
20 Essential MySQL Optimization Tips to Boost Query Performance
Top Architect
Top Architect
Oct 14, 2022 · Databases

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, node roles, shard and replica mechanisms, mapping, basic usage, installation steps, health monitoring, indexing workflow, storage strategies, refresh and translog handling, segment merging, and practical performance tuning tips.

ClusterElasticsearchPerformance Optimization
0 likes · 36 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
Top Architect
Top Architect
Oct 8, 2022 · Databases

Optimizing MySQL Pagination for Large Datasets: Techniques and Examples

This article analyzes why MySQL LIMIT pagination becomes slower on large tables, demonstrates the performance impact with real query timings, explains the underlying full‑table scan behavior, and presents three practical optimization strategies—including ordered primary‑key scans, sub‑queries, and join‑based approaches—to improve query speed.

Database OptimizationSQLindexing
0 likes · 7 min read
Optimizing MySQL Pagination for Large Datasets: Techniques and Examples
Bilibili Tech
Bilibili Tech
Sep 30, 2022 · Big Data

Bilibili's Efficient Lakehouse Platform Built on Trino and Iceberg

Bilibili’s new lake‑house platform, built on Trino and Iceberg, replaces Hive‑based pipelines by ingesting logs and DB data into Iceberg tables, applying advanced sorting, Z‑order/Hilbert clustering, bitmap and bloom indexes, virtual join columns and pre‑aggregation, enabling 70 000 daily queries on 2 PB with average scans of 2 GB and sub‑2‑second response times.

Big DataData SkippingIceberg
0 likes · 15 min read
Bilibili's Efficient Lakehouse Platform Built on Trino and Iceberg
Su San Talks Tech
Su San Talks Tech
Sep 28, 2022 · Databases

Why MySQL Queries Slow Down and How to Speed Them Up

This article explains the common reasons MySQL queries become sluggish—beyond missing indexes—including connection limits, buffer pool size, and query cache, and provides practical solutions such as profiling, adjusting connection pools, tuning InnoDB settings, and using force index.

Connection Poolbuffer poolindexing
0 likes · 16 min read
Why MySQL Queries Slow Down and How to Speed Them Up
Architect
Architect
Sep 23, 2022 · Databases

Elasticsearch Index and Search Performance Optimization for Billion‑Scale Data

This article presents a comprehensive case study of optimizing Elasticsearch and its underlying Lucene structures to achieve sub‑second query responses on billions of records, covering architecture basics, index design, doc‑values tuning, bulk‑write strategies, and extensive performance testing.

big-dataindexinglucene
0 likes · 12 min read
Elasticsearch Index and Search Performance Optimization for Billion‑Scale Data
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Sep 14, 2022 · Databases

Master MySQL: 3 Normal Forms, Engine Differences, Indexing, Transactions & More

This comprehensive guide covers MySQL fundamentals such as the three normal forms, differences between MyISAM and InnoDB, redo log versus binlog, when queries bypass indexes, join types, various index structures, covering indexes, back‑row lookups, transaction properties and isolation levels, common performance bottlenecks, replication mechanics, lag mitigation, storage behavior after deletes, VARCHAR limits, lock types, systematic SQL tuning steps, the purpose and trade‑offs of indexes, B+‑tree advantages, MVCC internals, and the Snowflake distributed ID algorithm.

ReplicationTransactionsindexing
0 likes · 24 min read
Master MySQL: 3 Normal Forms, Engine Differences, Indexing, Transactions & More
Java High-Performance Architecture
Java High-Performance Architecture
Sep 6, 2022 · Databases

Top 20 MySQL Optimization Tips to Boost Query Performance

This article presents twenty practical MySQL optimization techniques—from selecting specific columns and avoiding OR in WHERE clauses to proper indexing, join strategies, and using EXPLAIN—helping developers improve query speed, reduce resource consumption, and write more maintainable SQL code.

best-practicesdatabaseindexing
0 likes · 18 min read
Top 20 MySQL Optimization Tips to Boost Query Performance
政采云技术
政采云技术
Aug 30, 2022 · Fundamentals

Understanding Lucene Document Writing Process: Core Classes, Workflow, and Flush Strategies

This article explains the key Lucene classes involved in document indexing, outlines the end‑to‑end write workflow—including preUpdate, obtainAndLock, updateDocument, exception handling, and post‑update flush logic—and discusses the strategies and thresholds that control when in‑memory buffers are flushed to disk.

Document WritingJavaconcurrency
0 likes · 16 min read
Understanding Lucene Document Writing Process: Core Classes, Workflow, and Flush Strategies
WeChat Client Technology Team
WeChat Client Technology Team
Aug 30, 2022 · Databases

How WeChat Optimized Its Desktop Database for Speed, Size, and Reliability

This article analyzes the performance and storage problems of the Windows version of WeChat, explains why message growth causes slowness, large file size, and corruption risk, and presents a multi‑pronged solution involving sharding, indexing, payload compression, and robustness improvements that halve database size and boost I/O performance by about ten percent.

Database OptimizationSQLitecompression
0 likes · 11 min read
How WeChat Optimized Its Desktop Database for Speed, Size, and Reliability
IT Architects Alliance
IT Architects Alliance
Aug 25, 2022 · Databases

Relational Database Design: ER Modeling, Normalization, Storage Engine, Charset, Data Types, and Index Design

The article provides a comprehensive guide to relational database design using MySQL, covering ER modeling, normalization (BCNF and 3NF), storage engine selection, character set and data type choices, as well as index design principles illustrated with a student course selection example.

Data TypesDatabase designStorage Engine
0 likes · 17 min read
Relational Database Design: ER Modeling, Normalization, Storage Engine, Charset, Data Types, and Index Design
dbaplus Community
dbaplus Community
Aug 21, 2022 · Databases

Master MySQL Performance: Architecture, Indexes, and Advanced Query Tuning

This comprehensive guide walks you through MySQL's core architecture, explains why SQL optimization matters, details index types and creation methods, demystifies EXPLAIN output, and provides practical examples and best‑practice tips to dramatically improve query performance.

Database PerformanceExplain PlanQuery Tuning
0 likes · 34 min read
Master MySQL Performance: Architecture, Indexes, and Advanced Query Tuning
Architect's Guide
Architect's Guide
Aug 21, 2022 · Databases

Relational Database Design with MySQL: ER Modeling, Normalization, Storage Engine, Charset, Data Types, and Index Design

This article explains relational database design using MySQL, covering ER modeling, BCNF and 3NF normalization, storage engine and charset choices, data type selection, and index design, illustrated with examples and practical guidelines for building a student course selection system.

Data TypesDatabase designER Model
0 likes · 15 min read
Relational Database Design with MySQL: ER Modeling, Normalization, Storage Engine, Charset, Data Types, and Index Design
dbaplus Community
dbaplus Community
Aug 14, 2022 · Databases

Master MySQL Performance: 12 Proven SQL Optimization Techniques

This article explains MySQL's internal architecture and provides twelve practical SQL optimization strategies—including avoiding subqueries, using IN instead of OR, efficient pagination, minimizing ORDER BY, batch inserts, selective column retrieval, proper JOIN handling, and index best practices—to dramatically improve database query performance.

Database PerformanceQuery TuningSQL Optimization
0 likes · 20 min read
Master MySQL Performance: 12 Proven SQL Optimization Techniques
dbaplus Community
dbaplus Community
Aug 6, 2022 · Databases

Mastering MySQL JSON: From Basics to Advanced Partial Updates

This comprehensive guide explains MySQL's native JSON data type, its advantages over text storage, detailed CRUD operations, indexing strategies, migration from TEXT columns, the powerful Partial Updates feature, performance benchmarks, and a wide range of JSON functions with practical examples.

JSONPartial Updatesfunctions
0 likes · 39 min read
Mastering MySQL JSON: From Basics to Advanced Partial Updates
Java High-Performance Architecture
Java High-Performance Architecture
Aug 2, 2022 · Databases

Why MySQL IN Subqueries Can Be So Slow and How to Fix Them

This article examines why a MySQL query that uses an IN subquery on a massive users table becomes extremely slow, analyzes the execution plan revealing materialized temporary tables and semi‑join optimization, and demonstrates how disabling the optimizer or rewriting the query restores index usage and dramatically improves performance.

Semi-Joinexecution planindexing
0 likes · 8 min read
Why MySQL IN Subqueries Can Be So Slow and How to Fix Them
Top Architect
Top Architect
Jul 28, 2022 · Databases

SQL Optimization Steps and Common Scenarios: Index Usage, EXPLAIN, Profiling, and Trace

This article explains how to identify and resolve performance bottlenecks in MySQL by locating slow queries, analyzing execution plans with EXPLAIN, focusing on type, rows and extra information, using profiling and trace tools, and applying practical optimization techniques illustrated through multiple real‑world cases.

Database OptimizationSQLindexing
0 likes · 11 min read
SQL Optimization Steps and Common Scenarios: Index Usage, EXPLAIN, Profiling, and Trace
dbaplus Community
dbaplus Community
Jul 26, 2022 · Backend Development

Unlocking Elasticsearch: Core Concepts, Architecture, and Performance Tips

This comprehensive guide explains Elasticsearch’s role in searching structured and unstructured data, covers Lucene’s inverted index, details cluster components, shard and replica mechanics, mapping types, installation steps, indexing workflow, storage strategies, and practical performance optimizations for real‑world deployments.

Distributed SystemsElasticsearchindexing
0 likes · 39 min read
Unlocking Elasticsearch: Core Concepts, Architecture, and Performance Tips
21CTO
21CTO
Jul 20, 2022 · Backend Development

Boost Your Python Projects with Whoosh: A Lightweight Search Engine Tutorial

This article introduces the lightweight pure‑Python search library Whoosh, outlines its key features, demonstrates how to define a schema, build an index from a CSV of Chinese poems, and perform full‑text queries with example code, making it ideal for small search projects.

Full‑Text Searchexampleindexing
0 likes · 7 min read
Boost Your Python Projects with Whoosh: A Lightweight Search Engine Tutorial
IT Architects Alliance
IT Architects Alliance
Jul 14, 2022 · Big Data

Elasticsearch Overview: Core Concepts, Architecture, and Practical Usage

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, node roles, shard and replica mechanisms, mapping, installation, health monitoring, indexing principles, storage strategies, refresh and translog handling, segment merging, performance tuning, and JVM optimization for large‑scale search applications.

Big DataElasticsearchPerformance Optimization
0 likes · 35 min read
Elasticsearch Overview: Core Concepts, Architecture, and Practical Usage
Top Architect
Top Architect
Jul 14, 2022 · Big Data

A Comprehensive Introduction to Elasticsearch: Architecture, Core Concepts, and Practical Usage

This article provides a detailed overview of Elasticsearch, covering its data model, Lucene foundation, cluster architecture, shard and replica mechanisms, index mapping, installation steps, health monitoring, write and storage processes, segment management, and performance tuning techniques for large‑scale search applications.

Big DataElasticsearchindexing
0 likes · 35 min read
A Comprehensive Introduction to Elasticsearch: Architecture, Core Concepts, and Practical Usage
HelloTech
HelloTech
Jul 6, 2022 · Big Data

Investigation and Resolution of Elasticsearch Write Timeout Issues in a Real-Time Flink Data Sync Pipeline

The team diagnosed intermittent Elasticsearch write‑timeout failures in their real‑time Flink‑to‑Elasticsearch pipeline as lock contention from frequent duplicate updates to the same document IDs, and eliminated the issue by aggregating binlog events in a 5‑second sliding window to deduplicate writes, adjusting refresh intervals, using async translog durability, and disabling non‑essential fields.

Big DataElasticsearchFlink
0 likes · 7 min read
Investigation and Resolution of Elasticsearch Write Timeout Issues in a Real-Time Flink Data Sync Pipeline
php Courses
php Courses
Jun 24, 2022 · Databases

Comprehensive List of MySQL Interview Questions

This article compiles 66 MySQL interview questions covering fundamentals such as joins, data types, indexes, storage engines, transaction properties, locking mechanisms, replication, sharding, performance tuning, and high‑availability concepts for interview preparation.

ReplicationSQLdatabase
0 likes · 3 min read
Comprehensive List of MySQL Interview Questions
DataFunTalk
DataFunTalk
Jun 23, 2022 · Big Data

Real‑Time Low‑Latency Log Monitoring and Storage at Ctrip: Architecture, Clog System, CAT Tracing, and TSDB

This article details Ctrip's large‑scale, real‑time log monitoring solution, covering the overall monitoring architecture, the Clog log system, the CAT tracing platform, and the TSDB metric store, and explains design choices such as write‑heavy indexing, segment‑based storage, and migration to ClickHouse for high‑cardinality data.

Distributed SystemsLog MonitoringReal-time Processing
0 likes · 11 min read
Real‑Time Low‑Latency Log Monitoring and Storage at Ctrip: Architecture, Clog System, CAT Tracing, and TSDB
Top Architect
Top Architect
Jun 13, 2022 · Databases

Comprehensive MySQL Performance Optimization Guide

This article provides a detailed guide on optimizing MySQL performance, covering SQL statement tuning, index usage, table design normalization, system configuration, hardware choices, and the internal architecture of MySQL to help developers write faster and more efficient queries.

databaseindexingmysql
0 likes · 19 min read
Comprehensive MySQL Performance Optimization Guide
Tencent Cloud Developer
Tencent Cloud Developer
Jun 7, 2022 · Databases

Mastering MongoDB: From Basics to Advanced Performance Tuning

This comprehensive guide explores MongoDB’s core features—including schema flexibility, high availability, sharding, storage engine internals, indexing, and performance tuning—while providing practical examples, configuration tips, and best‑practice recommendations for developers and architects seeking to efficiently deploy and operate MongoDB in production environments.

MongoDBReplicationWiredTiger
0 likes · 40 min read
Mastering MongoDB: From Basics to Advanced Performance Tuning
Big Data Technology Architecture
Big Data Technology Architecture
Jun 7, 2022 · Big Data

Multi-Modal Index in Apache Hudi 0.11.0: Design, Implementation, and Performance Benefits

This article explains the motivation, design principles, implementation details, and performance improvements of the new multi‑modal indexing subsystem introduced in Apache Hudi 0.11.0 for Lakehouse architectures, covering scalable metadata, ACID updates, fast lookups, file listing, data skipping, upsert performance, and future work.

Apache Hudiindexingmetadata
0 likes · 19 min read
Multi-Modal Index in Apache Hudi 0.11.0: Design, Implementation, and Performance Benefits
Python Programming Learning Circle
Python Programming Learning Circle
May 17, 2022 · Fundamentals

Python Basics: Indexing, Slicing, String Alignment, List Comprehensions, Variable Arguments, and NumPy Linear Algebra

This article introduces essential Python techniques—including list indexing and slicing, string alignment methods, list and dictionary comprehensions, handling variable-length arguments, and using NumPy for linear‑algebra operations—providing clear code examples and explanations for each concept.

NumPyPythonSlicing
0 likes · 8 min read
Python Basics: Indexing, Slicing, String Alignment, List Comprehensions, Variable Arguments, and NumPy Linear Algebra
IT Xianyu
IT Xianyu
May 17, 2022 · Databases

SQL Optimization Steps and Common Scenarios

This article explains how to identify slow SQL statements, analyze execution plans with EXPLAIN, use profiling and tracing tools, and apply specific optimization techniques such as index tuning, query rewriting, and alternative storage solutions across a range of real‑world cases.

ProfilingSQLexplain
0 likes · 11 min read
SQL Optimization Steps and Common Scenarios
Python Crawling & Data Mining
Python Crawling & Data Mining
May 14, 2022 · Fundamentals

Unlock Python’s Hidden Powers: Advanced Indexing, Comprehensions, and NumPy Tricks

Discover essential Python techniques—from powerful list indexing and slicing, string alignment methods, and list/dict comprehensions to handling variable arguments and performing efficient linear algebra with NumPy—illustrated with clear code examples that boost readability and performance for developers.

Advanced TechniquesComprehensionsindexing
0 likes · 7 min read
Unlock Python’s Hidden Powers: Advanced Indexing, Comprehensions, and NumPy Tricks
ITPUB
ITPUB
May 12, 2022 · Databases

Build a Tiny Shell‑Based Database in Two Commands – How It Works

This article demonstrates how a functional, persistent key‑value store can be created with just two shell commands—one to append data and another to retrieve the latest entry—while explaining the underlying grep and sed techniques, performance trade‑offs, and basic indexing concepts.

GrepShelldatabase
0 likes · 5 min read
Build a Tiny Shell‑Based Database in Two Commands – How It Works
StarRocks
StarRocks
May 12, 2022 · Databases

How StarRocks’ Primary Key Model Delivers 3‑5× Faster Real‑Time Queries

This article explains the design and implementation of StarRocks 2.x Primary Key tables, covering real‑time update mechanisms, write and commit workflows, in‑memory primary indexing, compaction, read‑path optimizations, performance benchmarks, and upcoming features such as partial and conditional updates.

OLAPStarRockscompaction
0 likes · 19 min read
How StarRocks’ Primary Key Model Delivers 3‑5× Faster Real‑Time Queries
Top Architect
Top Architect
May 4, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, node roles, shard and replica mechanisms, mapping, basic usage, health monitoring, indexing workflow, storage strategies, and practical performance tuning techniques.

ElasticsearchPerformance Optimizationindexing
0 likes · 36 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
Laiye Technology Team
Laiye Technology Team
Apr 29, 2022 · Artificial Intelligence

Using Faiss for Efficient Vector Similarity Search: Installation, Index Construction, and Performance Optimization

This tutorial explains what Faiss is, how to install it, construct various indexes such as IndexFlatL2, IndexIVFFlat, and IndexIVFPQ, and demonstrates code examples for building and querying vector similarity search pipelines while discussing speed‑accuracy trade‑offs.

AIFAISSapproximate nearest neighbor
0 likes · 11 min read
Using Faiss for Efficient Vector Similarity Search: Installation, Index Construction, and Performance Optimization
Su San Talks Tech
Su San Talks Tech
Apr 28, 2022 · Databases

Why MySQL Chooses B+ Trees Over Skip Lists for Indexing

This article explains the structures of B+ trees and skip lists, compares their insertion and search behaviors, and shows why MySQL prefers B+ trees for disk‑based indexing while Redis adopts skip lists for in‑memory operations.

B+Treedatabaseindexing
0 likes · 12 min read
Why MySQL Chooses B+ Trees Over Skip Lists for Indexing