Databases 13 min read

Understanding MySQL Architecture: Storage Engine, Buffer Pool, Logs, and Indexes

This article explains how MySQL stores and manages data by introducing the InnoDB storage engine, its buffer pool, data pages, B+Tree and adaptive hash indexes, change buffer, undo/redo logs, binlog, and the server layer that parses, optimizes, and executes SQL statements.

IT Services Circle

Dec 21, 2024

Understanding MySQL Architecture: Storage Engine, Buffer Pool, Logs, and Indexes

What is MySQL?

MySQL is a relational database that acts as a middle layer between applications and data, providing core CRUD (Create, Read, Update, Delete) operations through its storage engines.

InnoDB Storage Engine

The default InnoDB engine stores rows and indexes in .ibd files on disk, while keeping frequently accessed data in memory structures such as the buffer pool, adaptive hash index, and log buffers.

Data Pages

Data is organized into 16 KB pages, similar to Excel sheets, allowing the system to read or write only the required pages instead of the whole file.

Indexes

Pages are linked via a B+Tree index built on primary keys; auxiliary indexes can be created on other columns to speed up lookups.

Buffer Pool

The buffer pool caches data pages and index pages in process memory, reducing disk I/O. It also supports custom caching strategies and locking.

Adaptive Hash Index

Hot pages are added to an in‑memory hash table, providing O(1) lookup for frequently accessed rows.

Change Buffer

Write operations to secondary indexes are first collected in the change buffer and applied later when the corresponding pages are loaded into the buffer pool, further reducing I/O.

Undo Log

Before a row is modified, the original version is recorded in an undo log stored in memory and later flushed to disk, enabling transaction rollback.

Redo Log

All modifications are written to a redo‑log buffer and periodically flushed to a redo‑log file. This Write‑Ahead Logging (WAL) guarantees that committed transactions survive crashes.

Binlog

The server layer records every data‑changing statement in a binary log (binlog). Unlike the circular redo log, the binlog preserves the full history of changes, supporting point‑in‑time recovery and replication.

Server Layer

The server parses SQL, runs the optimizer to choose indexes, generates an execution plan, and then calls InnoDB’s API functions (e.g., write_row(), update_row(), create(), drop()).

Master‑Slave Replication

Because the binlog contains all changes, a replica can replay the log to stay synchronized with the master, allowing read traffic to be offloaded to slaves.

Query and Update Flow

Read: the buffer pool is checked first; if the needed page is missing, it is loaded from disk and optionally added to the adaptive hash index.

Write: data is written to the buffer pool, an undo log entry is created, the change is recorded in the redo‑log buffer, and secondary‑index updates are staged in the change buffer before being flushed.

Summary

MySQL consists of a server layer and interchangeable storage engines (InnoDB is the most common).

InnoDB splits its workload between memory (buffer pool, adaptive hash, log buffers) and disk (data files, ibd, undo/redo logs).

The server’s binlog records all changes for recovery and replication.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

InnoDB mysql Binlog indexes undo log redo log buffer pool

Written by

IT Services Circle

Delivering cutting-edge internet insights and practical learning resources. We're a passionate and principled IT media platform.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.