Tagged articles
4 articles
Page 1 of 1
DevOps Coach
DevOps Coach
Nov 14, 2022 · Operations

Inside Google’s Retired File Server Backend: Exploring the Main Directory

This case study examines how Google decommissioned its legacy file‑server backend, focusing on the design, management, and migration of the main directory, and highlights the operational lessons and SRE practices that ensured a smooth transition without service disruption.

File ServerGoogle SREOperations
0 likes · 2 min read
Inside Google’s Retired File Server Backend: Exploring the Main Directory
TAL Education Technology
TAL Education Technology
Feb 10, 2022 · Operations

Client‑Side Circuit Breaking Strategies: State Machine, Google SRE Breaker, and Mitigation Techniques

This article explains why client‑side circuit breaking is essential, describes common state‑machine and Google SRE breaker strategies, provides practical pseudocode, and discusses mitigation methods such as Gutter mode, jittered exponential backoff, and graceful degradation to protect system stability.

Circuit BreakingGoogle SREclient-side
0 likes · 14 min read
Client‑Side Circuit Breaking Strategies: State Machine, Google SRE Breaker, and Mitigation Techniques
dbaplus Community
dbaplus Community
Nov 21, 2020 · Operations

What Google’s Debugging Playbook Can Teach Distributed Storage Teams

Drawing on Google’s SRE experience and the author’s work with Filecoin, this article outlines practical strategies for debugging large‑scale distributed systems, covering organizational culture, measurement, blameless postmortems, engineer mindsets, incident response steps, and tooling recommendations.

FilecoinGoogle SREpostmortem
0 likes · 15 min read
What Google’s Debugging Playbook Can Teach Distributed Storage Teams
Efficient Ops
Efficient Ops
Sep 5, 2016 · Operations

Inside Google’s Data Centers: How SRE Manages Hardware, Borg, and Global Services

This article explains how Google’s Site Reliability Engineering team designs and operates uniform hardware in its data centers, uses the Borg cluster manager, implements storage layers, SDN networking, monitoring, and a sample Shakespeare search service to achieve high‑availability, scalable production services.

BorgDistributed SystemsGoogle SRE
0 likes · 21 min read
Inside Google’s Data Centers: How SRE Manages Hardware, Borg, and Global Services