Tagged articles
25 articles
Page 1 of 1
dbaplus Community
dbaplus Community
Jan 4, 2026 · Cloud Native

Why One in a Million Searches Slowed 100× After Moving to Kubernetes

During Pinterest’s migration of its custom search platform Manas to the PinCompute Kubernetes environment, a rare latency spike—one request per million taking 100 times longer—was traced to cAdvisor’s memory‑intensive smaps scans, revealing hidden resource contention and prompting a targeted fix.

KubernetesMemory ManagementPerformance debugging
0 likes · 13 min read
Why One in a Million Searches Slowed 100× After Moving to Kubernetes
DevOps Coach
DevOps Coach
Sep 20, 2025 · Cloud Native

Why a Tiny Memory‑Intensive Process Caused 100× Latency Spikes After Pinterest’s Search Migration to Kubernetes

During Pinterest’s migration of its high‑traffic Manas search platform to the PinCompute Kubernetes environment, engineers observed an extremely rare latency outlier—one in a million requests took 100 times longer—prompting a deep investigation that traced the root cause to cAdvisor’s memory‑intensive smaps scans interfering with leaf node processing.

Cloud NativeKubernetesMemory Management
0 likes · 14 min read
Why a Tiny Memory‑Intensive Process Caused 100× Latency Spikes After Pinterest’s Search Migration to Kubernetes
Raymond Ops
Raymond Ops
May 9, 2025 · Operations

Build a Complete Prometheus Monitoring Stack with Docker

This tutorial explains Prometheus' core components, shows how to deploy Prometheus Server, Node Exporter, cAdvisor, and Grafana as Docker containers on two hosts, configures scraping and alerting, and demonstrates visualizing metrics with ready‑made Grafana dashboards.

AlertmanagerDockerExporter
0 likes · 8 min read
Build a Complete Prometheus Monitoring Stack with Docker
Alibaba Cloud Native
Alibaba Cloud Native
Jan 14, 2025 · Cloud Native

Unlocking Kubernetes IO Insights with ACK’s New Storage Monitoring Dashboards

This article explains how Alibaba Cloud Container Service for Kubernetes (ACK) has upgraded its storage monitoring dashboards to provide detailed visibility into local, PVC, and cloud‑based volumes, enabling users to detect IO bottlenecks, track real‑time read/write performance, and improve overall container reliability.

ACKCloud NativeDashboard
0 likes · 8 min read
Unlocking Kubernetes IO Insights with ACK’s New Storage Monitoring Dashboards
Raymond Ops
Raymond Ops
Dec 19, 2024 · Operations

How to Auto‑Scale Non‑CPU Apps with cAdvisor Network Metrics in Kubernetes

This guide explains how to use cAdvisor‑provided container network traffic counters as custom metrics for Kubernetes HPA, covering metric collection, Prometheus‑adapter configuration, verification, and a complete HPA testing workflow for elastic scaling of non‑CPU‑intensive workloads.

HPAKubernetesPrometheus
0 likes · 7 min read
How to Auto‑Scale Non‑CPU Apps with cAdvisor Network Metrics in Kubernetes
MaGe Linux Operations
MaGe Linux Operations
Mar 16, 2024 · Cloud Native

Scaling Non‑CPU‑Bound Apps with HPA Using cAdvisor Network Metrics

This guide shows how to enable Horizontal Pod Autoscaling for traffic‑driven workloads by leveraging cAdvisor's container network receive and transmit byte counters, converting them to per‑second rates with Prometheus‑adapter, and validating the custom metric through Kubernetes commands and console views.

Cloud NativeHPAKubernetes
0 likes · 7 min read
Scaling Non‑CPU‑Bound Apps with HPA Using cAdvisor Network Metrics
Efficient Ops
Efficient Ops
Dec 10, 2023 · Cloud Native

How to Build a Complete Kubernetes Monitoring Stack with Prometheus & Grafana

This guide walks through a full Kubernetes monitoring solution using cAdvisor, node_exporter, Prometheus, and Grafana, covering architecture, data collection, service discovery, deployment steps with DaemonSets, and detailed YAML configurations for a production‑ready observability stack.

GrafanaKubernetesPrometheus
0 likes · 6 min read
How to Build a Complete Kubernetes Monitoring Stack with Prometheus & Grafana
Architect
Architect
Sep 7, 2023 · Cloud Native

How Vivo Scaled Container Monitoring with Prometheus, Kafka, and VictoriaMetrics

This article details how Vivo's container platform faced exploding metric volumes, component overload, data gaps, and storage spikes, and explains the step‑by‑step architectural redesign, metric governance, performance tuning, cAdvisor redeployment, and VictoriaMetrics upgrade that restored high‑availability, low‑latency monitoring across a large Kubernetes fleet.

Cloud NativeKubernetesPrometheus
0 likes · 18 min read
How Vivo Scaled Container Monitoring with Prometheus, Kafka, and VictoriaMetrics
Efficient Ops
Efficient Ops
Jun 19, 2023 · Cloud Native

How Do Kubernetes Resource Limits Really Work? A Deep Dive into CPU Throttling

This article explains how Kubernetes resource limits function, how to interpret CPU limits as time slices, the Linux accounting system behind them, relevant Prometheus metrics for detecting throttling, practical examples with multithreaded containers, and guidance on setting alerts and avoiding performance pitfalls.

CPU throttlingKubernetesLinux accounting
0 likes · 12 min read
How Do Kubernetes Resource Limits Really Work? A Deep Dive into CPU Throttling
DevOps Operations Practice
DevOps Operations Practice
Apr 26, 2023 · Cloud Native

Monitoring Docker Containers with cAdvisor and Prometheus

This guide explains how to monitor Docker containers using the open‑source cAdvisor tool, integrate its metrics with Prometheus, and visualize the data in Grafana, providing step‑by‑step commands and configuration examples for a complete container‑monitoring solution.

Cloud NativeGrafanaPrometheus
0 likes · 5 min read
Monitoring Docker Containers with cAdvisor and Prometheus
MaGe Linux Operations
MaGe Linux Operations
Feb 10, 2023 · Cloud Native

How to Detect and Prevent OOM and CPU Throttling in Kubernetes Pods

This article explains why Kubernetes pods encounter out‑of‑memory errors and CPU throttling, how limits and requests influence resource allocation, and provides practical monitoring techniques using Prometheus and cAdvisor to proactively identify and mitigate these issues before they impact performance or cause pod eviction.

CPU throttlingOOMcAdvisor
0 likes · 9 min read
How to Detect and Prevent OOM and CPU Throttling in Kubernetes Pods
Efficient Ops
Efficient Ops
Jan 15, 2023 · Cloud Native

Understanding kubectl top: How Kubernetes Monitors Nodes and Pods

This article explains how the kubectl top command retrieves real‑time CPU and memory metrics for Kubernetes nodes and pods, details the underlying data flow, metric‑server and cAdvisor architecture, and addresses common issues and calculation differences compared to traditional system tools.

KubernetescAdvisorkubectl top
0 likes · 15 min read
Understanding kubectl top: How Kubernetes Monitors Nodes and Pods
Efficient Ops
Efficient Ops
Nov 30, 2022 · Cloud Native

How kubectl top Retrieves Real‑Time Metrics in Kubernetes: A Deep Dive

This article explains how the kubectl top command gathers real‑time CPU and memory usage for nodes and pods, details the underlying data flow and metric API implementation in Kubernetes, compares heapster and metrics‑server, and addresses common troubleshooting scenarios.

HeapsterKubernetescAdvisor
0 likes · 15 min read
How kubectl top Retrieves Real‑Time Metrics in Kubernetes: A Deep Dive
MaGe Linux Operations
MaGe Linux Operations
Sep 4, 2022 · Cloud Native

Understanding kubectl top: How Kubernetes Metrics Are Collected and Interpreted

This article explains how the kubectl top command retrieves real‑time CPU and memory usage for nodes and pods, details the underlying data flow from cAdvisor through heapster or metrics‑server, clarifies metric calculations, compares results with native top and docker stats, and addresses common errors and troubleshooting steps.

KubernetescAdvisorkubectl top
0 likes · 14 min read
Understanding kubectl top: How Kubernetes Metrics Are Collected and Interpreted
Efficient Ops
Efficient Ops
Aug 29, 2022 · Cloud Native

Understanding kubectl top: How Kubernetes Metrics Work and Common Issues

This article explains how kubectl top retrieves real‑time CPU and memory usage for nodes and pods, details the underlying data flow and metric‑server architecture, and addresses frequent errors such as missing components, pause‑container accounting, and differences from host top or docker stats.

KubernetescAdvisorkubectl top
0 likes · 14 min read
Understanding kubectl top: How Kubernetes Metrics Work and Common Issues
G7 EasyFlow Tech Circle
G7 EasyFlow Tech Circle
Dec 30, 2021 · Cloud Native

Why Kubernetes OOM Kills Use WSS, Not RSS – Diagnose & Fix Container Memory

After moving IoT services to Kubernetes, containers were OOM‑killed despite RSS staying below limits because Kubernetes bases OOM decisions on the Working Set Size (WSS) metric, which includes file cache, and the article explains its calculation, reproduces the issue, and offers practical mitigation strategies.

Cache ManagementContainer MemoryKernel Parameters
0 likes · 12 min read
Why Kubernetes OOM Kills Use WSS, Not RSS – Diagnose & Fix Container Memory
Open Source Linux
Open Source Linux
Nov 24, 2021 · Cloud Native

How to Build a Container Monitoring Stack with CAdvisor, InfluxDB, and Grafana

Learn how to set up a comprehensive container monitoring solution using CAdvisor for metrics collection, InfluxDB for time‑series storage, and Grafana for visualization, including deployment steps, integration details, common issues, and best‑practice configurations for reliable Docker‑based environments.

Cloud NativeDockerGrafana
0 likes · 17 min read
How to Build a Container Monitoring Stack with CAdvisor, InfluxDB, and Grafana
dbaplus Community
dbaplus Community
Sep 27, 2021 · Operations

6 Powerful Alternatives to Prometheus for Kubernetes Monitoring

Monitoring ensures Kubernetes applications run smoothly, and while Prometheus is a popular open‑source solution, this article examines six viable alternatives—Grafana, cAdvisor, Fluentd, Jaeger, Telepresence, and Zabbix—detailing their key features, strengths, and use‑cases for effective cluster observability.

FluentdGrafanaKubernetes
0 likes · 10 min read
6 Powerful Alternatives to Prometheus for Kubernetes Monitoring
Efficient Ops
Efficient Ops
Oct 8, 2019 · Operations

Build a Docker Container Monitoring Stack with CAdvisor, InfluxDB, Grafana

To effectively monitor Dockerized services, this guide walks through selecting a monitoring solution, deploying CAdvisor, integrating it with InfluxDB for persistent storage, visualizing metrics via Grafana, and addressing common issues such as missing utilities, memory stats, and network traffic inaccuracies.

GrafanaInfluxDBOperations
0 likes · 15 min read
Build a Docker Container Monitoring Stack with CAdvisor, InfluxDB, Grafana
dbaplus Community
dbaplus Community
Aug 19, 2016 · Operations

Unlocking System Reliability: The Value and Complete Architecture of Monitoring for Containers

This article explains why monitoring is essential for system reliability, outlines the key components of a comprehensive monitoring framework, compares data collection methods, and presents practical container monitoring solutions—from Docker stats to cAdvisor with InfluxDB and Grafana, as well as Kubernetes and Mesos integrations.

GrafanaKubernetesPrometheus
0 likes · 14 min read
Unlocking System Reliability: The Value and Complete Architecture of Monitoring for Containers