Tag

node troubleshooting

0 views collected around this technical thread.

Efficient Ops
Efficient Ops
Nov 23, 2022 · Operations

How to Diagnose and Fix Node2 Ceph‑Related cgroup Leaks in a Kubernetes Cluster

This article walks through a real‑world Kubernetes incident where a node ran out of space due to Ceph storage inconsistencies and cgroup leaks, detailing step‑by‑step diagnostics, Ceph repair commands, pod eviction, node reboot, and post‑mortem recommendations for cluster operations.

CephCluster OperationsKubernetes
0 likes · 6 min read
How to Diagnose and Fix Node2 Ceph‑Related cgroup Leaks in a Kubernetes Cluster