dbaplus Community
dbaplus Community
Sep 7, 2024 · Operations

What Hidden Costs Do You Face When Chasing 5‑Nines Availability?

Achieving five‑nine (99.999%) uptime demands massive capital, operational, and human investments, and this article breaks down the infrastructure, monitoring, testing, staffing expenses and explains why the marginal benefits sharply diminish as availability targets rise.

availability engineeringfive nineshigh availability
0 likes · 8 min read
What Hidden Costs Do You Face When Chasing 5‑Nines Availability?
Baidu Geek Talk
Baidu Geek Talk
Jun 30, 2021 · Operations

How Baidu Achieves 5‑9+ Availability: Inside Its Stability Engineering and Observability

This article dissects Baidu Search's ultra‑large micro‑service architecture, detailing the challenges of maintaining five‑nine‑plus availability, the diverse failure modes, and the step‑by‑step evolution of its observability stack—from early log‑only analysis to the kepler1.0/kepler2.0 tracing, full‑log indexing, custom span‑id generation, and compression techniques that together enable rapid root‑cause diagnosis at massive scale.

Baidu SearchDistributed TracingMetrics
0 likes · 21 min read
How Baidu Achieves 5‑9+ Availability: Inside Its Stability Engineering and Observability