Tagged articles
47 articles
Page 1 of 1
Raymond Ops
Raymond Ops
Feb 2, 2026 · Operations

10 Essential PromQL Queries Every Ops Engineer Should Master

This article presents ten practical PromQL query examples covering CPU, memory, disk, network, database, Kubernetes, and business metrics, explains the underlying concepts, provides alert thresholds and best‑practice tips, and includes advanced optimization and alert‑rule design guidance for reliable monitoring.

AlertingMetricsObservability
0 likes · 22 min read
10 Essential PromQL Queries Every Ops Engineer Should Master
DevOps Coach
DevOps Coach
Nov 24, 2025 · Operations

10 Essential Grafana Dashboards to Spot Incidents Early

This guide presents ten essential Grafana dashboards—covering SLO burn, user‑journey funnel, infrastructure USE metrics, queue lag, database health, cache hit‑rate, CDN latency, rollout guardrails, trace topology, and a command‑center view—each explained with its purpose, panel layout, and ready‑to‑use PromQL or LogQL queries.

DashboardsGrafanaObservability
0 likes · 13 min read
10 Essential Grafana Dashboards to Spot Incidents Early
Linux Ops Smart Journey
Linux Ops Smart Journey
Oct 21, 2025 · Operations

Master Nightingale Dashboards: Build Pie, Gauge, and Honeycomb Charts Step‑by‑Step

This guide walks you through creating effective Nightingale monitoring dashboards by configuring three common chart types—Metric (Gauge), Pie, and Honeycomb—including step‑by‑step PromQL queries, legend settings, panel options, styling, and advanced configurations to turn raw data into actionable visual insights.

DashboardPromQLmonitoring
0 likes · 4 min read
Master Nightingale Dashboards: Build Pie, Gauge, and Honeycomb Charts Step‑by‑Step
Raymond Ops
Raymond Ops
Oct 12, 2025 · Operations

Master PromQL: From Basics to Advanced Query Techniques

This comprehensive guide walks you through PromQL fundamentals, covering data types, gauge and counter metrics, time‑series concepts, query selectors, offsets, arithmetic and logical operators, vector matching, aggregation functions, and key Prometheus functions such as increase, rate, and histogram_quantile, with practical examples and visual illustrations.

AlertingMetricsPromQL
0 likes · 29 min read
Master PromQL: From Basics to Advanced Query Techniques
MaGe Linux Operations
MaGe Linux Operations
Sep 14, 2025 · Operations

10 Essential PromQL Queries Every Ops Engineer Must Master

This article presents ten practical PromQL query examples covering CPU, memory, disk, network, application, database, Kubernetes, and business metrics, along with key concepts, alerting thresholds, and best‑practice tips to help operations engineers build a comprehensive monitoring system in cloud‑native environments.

KubernetesPromQL
0 likes · 23 min read
10 Essential PromQL Queries Every Ops Engineer Must Master
Java One
Java One
Sep 3, 2025 · Operations

How to Install, Configure, and Run Prometheus: A Step‑by‑Step Guide

This guide walks you through installing Prometheus via binary download, configuring global scrape settings and job definitions, running the server with command‑line options, and using the web UI and PromQL to verify target health and query metrics, illustrated with screenshots and example queries.

InstallationObservabilityPromQL
0 likes · 6 min read
How to Install, Configure, and Run Prometheus: A Step‑by‑Step Guide
Linux Ops Smart Journey
Linux Ops Smart Journey
Aug 20, 2025 · Operations

How to Turn Abstract Metrics into Intuitive Gauges with Grafana

This guide explains why Grafana's Gauge panel creates a powerful visual metaphor for system pressure, walks through creating the gauge, configuring PromQL queries, setting panel options, thresholds, and JSON definitions, and shows how to produce clear, boss‑friendly monitoring dashboards.

Gauge panelGrafanaJSON configuration
0 likes · 5 min read
How to Turn Abstract Metrics into Intuitive Gauges with Grafana
Alibaba Cloud Observability
Alibaba Cloud Observability
Jun 3, 2025 · Cloud Native

How PromQL Copilot Turns Natural Language into Precise Monitoring Queries

PromQL Copilot leverages Alibaba Cloud's observability platform and AI techniques to convert ambiguous natural‑language monitoring requests into accurate PromQL statements, addressing challenges of ambiguity, domain knowledge, and metric coverage while providing generation, explanation, diagnosis, and recommendation features for cloud‑native environments.

AICloud NativeMetrics
0 likes · 12 min read
How PromQL Copilot Turns Natural Language into Precise Monitoring Queries
MaGe Linux Operations
MaGe Linux Operations
May 7, 2025 · Operations

Master PromQL: From Basics to Advanced Query Techniques for Monitoring

This comprehensive guide walks you through PromQL fundamentals, data types, query expressions, selectors, operators, aggregation, and essential functions, illustrating each concept with real‑world monitoring scenarios and code examples to help you effectively query and analyze time‑series data in Prometheus.

PromQLPrometheusTime Series
0 likes · 32 min read
Master PromQL: From Basics to Advanced Query Techniques for Monitoring
ITPUB
ITPUB
Aug 11, 2024 · Operations

Scaling Bilibili’s Metrics Platform with VictoriaMetrics and Flink Pre‑aggregation

This article details how Bilibili redesigned its monitoring system to overcome explosive metric growth by separating collection and storage, adopting VictoriaMetrics, implementing zone‑based scheduling, automating PromQL query replacement, and using Flink for efficient pre‑aggregation, resulting in dramatically lower latency and higher stability.

FlinkObservabilityPromQL
0 likes · 31 min read
Scaling Bilibili’s Metrics Platform with VictoriaMetrics and Flink Pre‑aggregation
Efficient Ops
Efficient Ops
Mar 3, 2024 · Operations

Mastering Prometheus: From Metrics Collection to Alerting and Visualization

This comprehensive guide explains Prometheus' architecture, metric collection models, storage format, query language (PromQL), alerting workflow, configuration reload methods, metric types, custom exporters, and how to visualise data with Grafana, providing a complete end‑to‑end monitoring solution.

GrafanaMetricsObservability
0 likes · 21 min read
Mastering Prometheus: From Metrics Collection to Alerting and Visualization
Efficient Ops
Efficient Ops
Dec 24, 2023 · Operations

Avoid These 6 Common Prometheus Mistakes When Getting Started

This guide translates and condenses six frequent errors new Prometheus users make—high‑cardinality labels, losing valuable tags during aggregation, using bare selectors, omitting the for field, choosing too‑short rate windows, and applying rate‑related functions to wrong metric types—offering practical fixes to improve monitoring reliability.

ObservabilityPromQLPrometheus
0 likes · 12 min read
Avoid These 6 Common Prometheus Mistakes When Getting Started
Didi Tech
Didi Tech
Sep 21, 2023 · Cloud Native

OBC: A Cloud-Native Real-Time Computing Engine for Metrics at Didi

To replace costly, duplicated Flink jobs, Didi built Observe‑Compute (OBC), a cloud‑native, PromQL‑driven real‑time metric engine with centralized policy management, scalable containerized workers, and zero‑downtime scaling, achieving million‑RMB annual savings while handling 10 M points per second.

Flink alternativeOBCObservability
0 likes · 17 min read
OBC: A Cloud-Native Real-Time Computing Engine for Metrics at Didi
dbaplus Community
dbaplus Community
Aug 7, 2023 · Operations

Why Prometheus Queries Slow Down and How Recording Rules Speed Them Up

The article examines performance bottlenecks in Prometheus‑Grafana monitoring dashboards caused by high metric cardinality, explains the internal query processing steps, demonstrates how to analyze and reduce cardinality with PromQL and recording rules, and shows concrete command‑line examples that dramatically improve query latency.

CardinalityPromQLRecording Rules
0 likes · 10 min read
Why Prometheus Queries Slow Down and How Recording Rules Speed Them Up
MaGe Linux Operations
MaGe Linux Operations
Mar 30, 2023 · Operations

Demystifying PromQL: How Nested Functional Queries Work in Prometheus

This article explores the structure and evaluation of PromQL queries, covering its nested functional language nature, expression types, time handling with instant and range queries, and practical examples using the PromLens visualizer, helping readers grasp how Prometheus processes and types queries.

ObservabilityPromQLTime Series
0 likes · 11 min read
Demystifying PromQL: How Nested Functional Queries Work in Prometheus
Open Source Linux
Open Source Linux
Dec 8, 2022 · Operations

Master Prometheus: From Metrics Collection to Alerting and Visualization

Prometheus is an open‑source monitoring solution that covers metric exposition, scraping, storage, querying, visualization, and alerting, and this guide walks through its architecture, configuration, custom exporters, PromQL queries, Grafana integration, and alert management, providing a comprehensive introduction for developers and ops engineers.

AlertingExporterGrafana
0 likes · 22 min read
Master Prometheus: From Metrics Collection to Alerting and Visualization
macrozheng
macrozheng
Nov 19, 2022 · Operations

Unlocking Prometheus: Visual Guide to Architecture, Metrics, and Alerts

This article visually explains Prometheus’s architecture, core features, metric collection methods, exporters, PromQL query language, and alerting workflow, helping readers understand how to monitor cloud‑native systems effectively while noting its strengths and limitations.

AlertingExportersMetrics
0 likes · 8 min read
Unlocking Prometheus: Visual Guide to Architecture, Metrics, and Alerts
21CTO
21CTO
Jun 28, 2022 · Operations

Master Prometheus: From Metrics Collection to Alerts and Grafana Visualization

This comprehensive guide walks you through Prometheus fundamentals, including metric exposure, scraping, storage, querying with PromQL, custom exporter creation in Go, dynamic configuration reloading, and visualizing data with Grafana, while also covering alerting with Alertmanager and best practices for accurate histogram bucket design.

AlertingGrafanaMetrics
0 likes · 20 min read
Master Prometheus: From Metrics Collection to Alerts and Grafana Visualization
IT Architects Alliance
IT Architects Alliance
Jun 27, 2022 · Operations

Comprehensive Guide to Prometheus: Metrics Collection, Storage, Querying, Alerting and Visualization

This article provides a detailed overview of Prometheus, covering its architecture, metric exposure, scraping models, storage format, metric types, custom exporter implementation in Go, PromQL query language, built‑in functions, Grafana integration, and alerting with Alertmanager, offering practical code examples throughout.

AlertingGoGrafana
0 likes · 20 min read
Comprehensive Guide to Prometheus: Metrics Collection, Storage, Querying, Alerting and Visualization
Architecture Digest
Architecture Digest
Jun 11, 2022 · Operations

Comprehensive Introduction to Prometheus: Architecture, Metrics, Configuration, PromQL, Exporters, Visualization, and Alerting

This article provides a thorough overview of Prometheus, covering its ecosystem, how metrics are exposed and scraped, storage and query mechanisms, metric types, PromQL usage, exporter implementation, dynamic configuration reload, Grafana visualization, and Alertmanager alerting, with practical code examples throughout.

ExportersGrafanaPromQL
0 likes · 21 min read
Comprehensive Introduction to Prometheus: Architecture, Metrics, Configuration, PromQL, Exporters, Visualization, and Alerting
Tencent Cloud Developer
Tencent Cloud Developer
May 30, 2022 · Cloud Native

An Introduction to Prometheus: Metrics Collection, Storage, Querying, Visualization and Alerting

Prometheus is an open‑source monitoring system that scrapes metrics from services or exporters, stores them in a time‑series database, lets users query with PromQL, visualizes data via its web UI or Grafana, and sends alerts through Alertmanager, supporting custom Go metrics, various discovery methods, and four metric types.

AlertingGoGrafana
0 likes · 21 min read
An Introduction to Prometheus: Metrics Collection, Storage, Querying, Visualization and Alerting
Shopee Tech Team
Shopee Tech Team
Sep 23, 2021 · Big Data

Design and Architecture of the Boussole Real-Time Multi-Dimensional Data Analysis Engine

Boussole is Shopee’s real‑time analytics engine that transforms each dimension into key‑value pairs stored primarily in HBase, pre‑aggregates selected dimension combos, hashes metrics and tags, executes distributed PromQL queries with a CockroachDB‑inspired executor, applies Delta‑of‑Delta compression and point‑capping, and continues to evolve with adaptive pre‑aggregation and new storage models to maintain millisecond latency for massive multi‑dimensional analysis.

Distributed QueryPre-aggregationPromQL
0 likes · 24 min read
Design and Architecture of the Boussole Real-Time Multi-Dimensional Data Analysis Engine
Code Ape Tech Column
Code Ape Tech Column
Jun 19, 2021 · Operations

Master Prometheus: From Installation to Advanced Monitoring with Grafana

This comprehensive guide walks you through Prometheus' origins, core features, installation methods, configuration files, PromQL basics, exporter setup, Grafana integration, alerting with Alertmanager, and advanced topics like service discovery, providing a complete roadmap for building a production‑grade monitoring system.

AlertmanagerDockerGrafana
0 likes · 34 min read
Master Prometheus: From Installation to Advanced Monitoring with Grafana
Open Source Linux
Open Source Linux
Jun 3, 2021 · Operations

Master Kubernetes Capacity Planning: Detect & Optimize Unused Resources

This guide explains Kubernetes capacity planning, showing how to detect idle CPU and memory, identify wasteful namespaces, use open‑source tools like kube‑state‑metrics and cAdvisor, and apply PromQL queries to optimize resource requests and measure the impact of your improvements.

KubernetesPromQLResource Optimization
0 likes · 10 min read
Master Kubernetes Capacity Planning: Detect & Optimize Unused Resources
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 26, 2021 · Operations

Comprehensive Guide to Prometheus: Installation, Configuration, PromQL, Exporters, Grafana, and Alerting

This article provides a complete tutorial on Prometheus, covering its origins, core features, installation methods (binary and Docker), configuration file structure, PromQL basics, HTTP API usage, Grafana integration, various exporters for metrics collection, and alerting with Alertmanager, all within a cloud‑native monitoring context.

AlertingExportersGrafana
0 likes · 32 min read
Comprehensive Guide to Prometheus: Installation, Configuration, PromQL, Exporters, Grafana, and Alerting
Architect
Architect
Feb 26, 2021 · Operations

Comprehensive Guide to Prometheus: Overview, Installation, Configuration, PromQL, Exporters, Grafana Integration, and Alerting

This article provides a detailed introduction to Prometheus, covering its history, core features, installation methods, configuration file structure, PromQL basics, various exporters, Grafana visualization, alerting with Alertmanager, service discovery, and best‑practice recommendations for building a production‑grade monitoring system.

AlertmanagerExportersGrafana
0 likes · 34 min read
Comprehensive Guide to Prometheus: Overview, Installation, Configuration, PromQL, Exporters, Grafana Integration, and Alerting
Aikesheng Open Source Community
Aikesheng Open Source Community
Aug 24, 2020 · Operations

Prometheus Data Query Basics and Practical Usage Guide

This article introduces Prometheus' query language PromQL, explains instant and range vector selectors, label matching, offset handling, storage design, common functions and aggregation operators, and provides practical advice for efficient querying and avoiding performance issues.

OperationsPromQLPrometheus
0 likes · 13 min read
Prometheus Data Query Basics and Practical Usage Guide
Programmer DD
Programmer DD
Sep 20, 2019 · Operations

Master Prometheus: Key Features, Architecture, and Query Essentials

This article introduces Prometheus, an open‑source cloud‑native monitoring and alerting system, covering its main characteristics, core components, architecture diagram, typical use cases, query language syntax, built‑in functions, time‑series types, and practical tips for reliable operation.

AlertingOperationsPromQL
0 likes · 9 min read
Master Prometheus: Key Features, Architecture, and Query Essentials
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Aug 30, 2017 · Operations

Mastering Prometheus: From Metrics Basics to High‑Availability Monitoring

This article shares practical experiences of using Prometheus for monitoring complex services, covering metric types, PromQL query techniques, naming conventions, service discovery with file‑based configs, high‑availability sharding, alerting via Alertmanager, and visualisation with Grafana, providing actionable guidance for reliable observability.

GrafanaPromQLPrometheus
0 likes · 15 min read
Mastering Prometheus: From Metrics Basics to High‑Availability Monitoring