Efficient Ops
Nov 23, 2022 · Operations
How to Diagnose and Fix Node2 Ceph‑Related cgroup Leaks in a Kubernetes Cluster
This article walks through a real‑world Kubernetes incident where a node ran out of space due to Ceph storage inconsistencies and cgroup leaks, detailing step‑by‑step diagnostics, Ceph repair commands, pod eviction, node reboot, and post‑mortem recommendations for cluster operations.
CephCluster OperationsKubernetes
0 likes · 6 min read