Databases 17 min read

Understanding MySQL Crash‑Safe Mechanism: WAL, Redo/Undo/Binlog, Two‑Phase and Group Commit, and Recovery Process

MySQL achieves crash‑safe durability by employing a write‑ahead log architecture that records changes in redo, undo, and binary logs, coordinates them through two‑phase and group commit to maintain order, and uses a structured recovery process to replay or roll back transactions after any failure.

vivo Internet Technology

May 20, 2020

Understanding MySQL Crash‑Safe Mechanism: WAL, Redo/Undo/Binlog, Two‑Phase and Group Commit, and Recovery Process

MySQL, as the most popular open‑source relational database, must guarantee that data is never lost. This article explains how MySQL is designed to be crash‑safe, i.e., it can recover to any point in time and ensure that a sudden crash never results in lost committed transactions.

Two key capabilities :

Point‑in‑time recovery using sufficient binary logs (binlog).

Crash‑safe execution where any transaction, even if the server crashes at any stage, will either be fully committed or automatically rolled back. This relies on the redo log and undo log.

The crash‑safe property is implemented by the Write‑Ahead Log (WAL) technique. In InnoDB, the transaction lifecycle is recorded in three core logs:

Redo log – records physical page changes; used to replay committed changes after a crash.

Undo log – records logical inverse operations; provides rollback and MVCC.

Binlog – records the original SQL statements; used for replication and point‑in‑time recovery.

The article walks through the execution flow of an UPDATE statement, showing how MySQL writes to memory, records changes in the redo log, logs the SQL in the binlog, and finally flushes both logs to disk asynchronously.

The article then introduces the WAL mechanism, explaining why MySQL writes to memory first and logs changes before persisting to disk: sequential writes to log files are far faster than random writes to data files.

Two‑Phase Commit (2PC)

To keep redo log and binlog consistent, MySQL splits the redo‑log write into a prepare and a commit phase, inserting the binlog write between them. This ensures that both logs reflect the same transaction order, which is essential for master‑slave consistency.

The article also shows a concrete example where interleaving redo‑log and binlog writes incorrectly would cause data divergence between primary and replica.

When multiple transactions commit concurrently, simple 2PC can still lead to ordering mismatches. The following code snippet illustrates such a scenario:

<pre><code><span>T1 (<span>--prepare--binlog---------------------commit)</span></span>

<span>T2 (<span>-----prepare-----binlog----commit)</span></span>

<span>T3 (<span>--------prepare-------binlog------commit)</span></span>

<span>解析：</span>

<span>    redo log <span>prepare</span>的顺序：T1 <span>--》T2 --》T3</span></span>

<span>    <span>binlog</span>的写入顺序：T1 <span>--》 T2 --》 T3</span></span>

<span>    <span>redo</span> <span>log</span> <span>commit</span>的顺序：T2 <span>--》 T3 --》T1</span></span>

<span>结论：由于<span>binlog</span>写入的顺序和<span>redo</span> <span>log</span>的提交结束的顺序不一致，导致<span>binlog</span>和<span>redo</span> <span>log</span>所记录的事务提交结束的顺序不一样，最终导致的结果就是主从数据不一致。</span>

To mitigate the lock contention of 2PC, MySQL 5.6 introduced Binary Log Group Commit (BLGC) . BLGC groups multiple transactions and flushes their binlogs together, reducing the number of fsync operations and the lock granularity. The process consists of three stages:

Flush Stage : leader acquires Lock_log, writes a batch of binlog entries to the cache.

Sync Stage : leader releases Lock_log, acquires Lock_sync, and performs a single fsync for the whole batch.

Commit Stage : leader releases Lock_sync, acquires Lock_commit, and commits each transaction in InnoDB, clearing undo information.

Group commit dramatically reduces I/O overhead and lock contention, while still preserving the order between redo and binlog.

Data Recovery Process

If MySQL crashes during a transaction, the recovery procedure examines the redo log for prepared transactions, matches them with binlog entries, and either re‑commits the redo log or rolls back using the undo log. The article enumerates four crash moments (A‑D) and explains the recovery actions for each.

Finally, the article summarizes the key takeaways:

WAL is the fundamental technique for consistency and durability in most databases.

MySQL’s redo log, undo log, binlog, two‑phase commit, and group commit together provide robust crash‑safe guarantees.

The recovery workflow validates logs at various crash points to ensure no committed data is lost.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

MySQL Binlog undo log WAL two-phase commit Crash Safety Group Commit Redo Log

Written by

vivo Internet Technology

Sharing practical vivo Internet technology insights and salon events, plus the latest industry news and hot conferences.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.