Tagged articles
14 articles
Page 1 of 1
FunTester
FunTester
Feb 26, 2026 · Backend Development

Why Waiting, Not Computing, Dominates Tail Latency in High‑Concurrency Systems

In high‑concurrency systems, tail latency is driven primarily by waiting on locks, resources, and scheduling rather than raw computation, with phenomena like head‑of‑line blocking, context‑switch overhead, and cache‑coherency costs amplifying unpredictable delays.

backend systemscontext switchhead-of-line blocking
0 likes · 13 min read
Why Waiting, Not Computing, Dominates Tail Latency in High‑Concurrency Systems
dbaplus Community
dbaplus Community
Feb 24, 2026 · Cloud Native

How CPU Architecture Bottlenecks Cripple Netflix’s Container Scaling

Netflix discovered that scaling hundreds of containers on modern CPUs hit severe lock‑contention due to mount‑related kernel locks, with performance varying across AWS instance types, NUMA designs, and hyper‑threading, leading them to redesign containerd mounting and choose hardware‑aware scheduling to restore efficient scaling.

AWSCPU architectureHyper-threading
0 likes · 16 min read
How CPU Architecture Bottlenecks Cripple Netflix’s Container Scaling
Linux Kernel Journey
Linux Kernel Journey
Aug 5, 2025 · Fundamentals

Why Convert madv_dontneed/madv_free to Per‑VMA Locks in Linux

The article explains how the traditional mmap_lock read lock creates severe contention and priority inversion for frequent madvise MADV_DONTNEED/FREE calls, and how a per‑VMA locking redesign eliminates this bottleneck, improves performance, and is slated for the next Linux kernel release.

Linuxkernel memory managementlock contention
0 likes · 5 min read
Why Convert madv_dontneed/madv_free to Per‑VMA Locks in Linux
JD Tech
JD Tech
Jul 2, 2025 · Backend Development

How We Tamed Database Lock Contention in High‑Volume Inventory Allocation

This article examines the severe row‑lock competition in MySQL during hot‑SKU inventory positioning, analyzes its causes and risks, and presents a comprehensive set of mitigation strategies—including DB account isolation, flow‑control, request merging, and message‑queue optimizations—validated by extensive performance testing and successful rollout across multiple clusters.

Message QueuePerformance Testingbackend optimization
0 likes · 14 min read
How We Tamed Database Lock Contention in High‑Volume Inventory Allocation
IT Services Circle
IT Services Circle
Nov 12, 2023 · Databases

Why Many Large Internet Companies Avoid Using MySQL Foreign Keys

The article explains that while MySQL foreign keys ensure data consistency and integrity, they introduce performance overhead, lock contention, and scalability issues—especially in high‑concurrency and sharded environments—leading many large internet firms to forego them in favor of application‑level solutions.

Database PerformanceForeign KeyScalability
0 likes · 5 min read
Why Many Large Internet Companies Avoid Using MySQL Foreign Keys
ITPUB
ITPUB
Nov 23, 2020 · Databases

Eliminating InnoDB Adaptive Hash Index Lock Contention for Faster Queries

The article examines the hidden lock‑contention issue in InnoDB’s Adaptive Hash Index (AHI) observed during high‑concurrency sysbench runs, explains why multiple threads block on the AHI hash table’s exclusive lock, and presents a lightweight fallback optimization that checks the lock before building AHI, reducing lock wait time and stabilizing QPS performance.

Adaptive Hash IndexDatabase InternalsInnoDB
0 likes · 9 min read
Eliminating InnoDB Adaptive Hash Index Lock Contention for Faster Queries
Didi Tech
Didi Tech
Aug 28, 2020 · Operations

Ceph Performance Optimization: Lock-Related Issues and Solutions

The article details how Didi’s large‑scale Ceph deployment suffered from high tail latency due to long‑held and coarse‑grained locks, and describes a series of fixes—including asynchronous read threads, fine‑grained object caches, per‑thread lock‑free logging, and lock‑free filestore apply—that cut latency by up to 90 % and more than doubled read throughput.

BlueStoreCephFileStore
0 likes · 12 min read
Ceph Performance Optimization: Lock-Related Issues and Solutions
Tencent Database Technology
Tencent Database Technology
Jun 27, 2019 · Databases

Analysis and Optimization of InnoDB lock_wait_thread Contention in a Tencent Cloud Database

The article investigates intermittent slow update performance in a Tencent Cloud internal system caused by massive lock_wait_thread contention, analyzes the underlying InnoDB lock mechanisms and thread behavior, implements a fix by disabling lock_wait_suspend_thread triggers, and demonstrates substantial latency reduction through benchmark results.

InnoDBPerformance Optimizationdatabase
0 likes · 8 min read
Analysis and Optimization of InnoDB lock_wait_thread Contention in a Tencent Cloud Database
Aikesheng Open Source Community
Aikesheng Open Source Community
Mar 19, 2019 · Databases

Root Cause Analysis of Slave IO Thread Hang in MySQL Semi‑Sync Replication with rpl_semi_sync_master_wait_for_slave_count=1

An in‑depth investigation reveals that when MySQL semi‑sync replication is configured with rpl_semi_sync_master_wait_for_slave_count=1, starting a second slave can cause the master’s dump thread to fail, leading to slave_io_thread stalls; the analysis includes reproduction steps, status checks, thread stack traces, and a patch using sched_yield to resolve the lock contention.

Dump threadSemi-sync replicationlock contention
0 likes · 8 min read
Root Cause Analysis of Slave IO Thread Hang in MySQL Semi‑Sync Replication with rpl_semi_sync_master_wait_for_slave_count=1