Investigation of Intermittent Redis Timeout Issues Caused by a Kernel Scheduling Bug on Skylake Servers
The article details how Ctrip engineers diagnosed sporadic Redis timeouts in containerized deployments, traced the problem to kernel scheduling delays caused by an APIC‑ID bug that inflated the possible‑CPU count, and resolved it by applying a kernel patch, offering verification steps for affected systems.
