Ops Development Stories
Ops Development Stories
Sep 4, 2025 · Cloud Native

Why Kubernetes HPA Ignores High CPU Usage and How Tolerance Affects Scaling

This article explains the internal architecture and source‑code flow of Kubernetes Horizontal Pod Autoscaler, detailing how components like HorizontalController and ReplicaCalculator compute desired replicas, why a default 10% tolerance can prevent scaling even when CPU exceeds the target, and how behavior policies and scaling limits influence HPA decisions.

HPAHorizontal Pod AutoscalerKubernetes
0 likes · 16 min read
Why Kubernetes HPA Ignores High CPU Usage and How Tolerance Affects Scaling