Tagged articles
5 articles
Page 1 of 1
21CTO
21CTO
Feb 9, 2022 · Operations

Why Roblox’s Three‑Day Outage Happened: Consul Streaming Bug and BoltDB Design Flaw

Roblox’s detailed post‑mortem reveals that a three‑day outage was caused by a Consul streaming bug and a design flaw in BoltDB’s freelist, which together created CPU contention and latency spikes on its massive on‑premises infrastructure, leading the team to disable streaming, add a second data‑center, and redesign their architecture.

BoltDBConsulInfrastructure
0 likes · 9 min read
Why Roblox’s Three‑Day Outage Happened: Consul Streaming Bug and BoltDB Design Flaw
Baidu Intelligent Testing
Baidu Intelligent Testing
Nov 16, 2021 · Cloud Native

Service Governance and etcd: Concepts, Raft & BoltDB Implementation, and Large‑Scale Practices at Baidu

This article introduces service governance fundamentals, explains how etcd’s Raft‑based consensus and BoltDB storage work, compares etcd with ZooKeeper and Consul, and describes Baidu’s large‑scale, high‑availability, high‑performance service‑governance platform built on these technologies.

BoltDBCloud NativeRaft
0 likes · 20 min read
Service Governance and etcd: Concepts, Raft & BoltDB Implementation, and Large‑Scale Practices at Baidu
Baidu Geek Talk
Baidu Geek Talk
Nov 10, 2021 · Operations

How etcd Powers Scalable Service Governance: Raft, BoltDB, and Real‑World Practices

This article explores service governance fundamentals, examines why etcd’s Raft‑based consensus and BoltDB storage make it ideal for large‑scale systems, compares it with ZooKeeper and Consul, and shares Baidu’s practical architecture, performance tricks, and operational metrics for high‑availability, high‑performance service management.

BoltDBDistributed SystemsPerformance Optimization
0 likes · 23 min read
How etcd Powers Scalable Service Governance: Raft, BoltDB, and Real‑World Practices
Alibaba Cloud Native
Alibaba Cloud Native
Sep 19, 2019 · Databases

How etcd’s Fully Concurrent Read Boosts Kubernetes Performance

This article reviews the evolution of etcd’s read‑write mechanisms, explains the “Fully Concurrent Read” feature introduced in etcd 3.4, and presents experimental results showing how it dramatically reduces expensive read latency and improves overall throughput in Kubernetes clusters.

BoltDBFully Concurrent ReadKubernetes
0 likes · 14 min read
How etcd’s Fully Concurrent Read Boosts Kubernetes Performance