Tagged articles
368 articles
Page 1 of 4
Java Architect Essentials
Java Architect Essentials
Apr 26, 2026 · Backend Development

15 SpringBoot Performance Tweaks to Handle Million-Scale Concurrency

This guide walks through exposing metrics, integrating Prometheus and Grafana, using async‑profiler flame graphs, tuning Tomcat/Undertow, optimizing JVM flags, applying SkyWalking tracing, and applying layer‑wise code, cache, and thread‑pool improvements so a SpringBoot service can reliably serve millions of concurrent requests.

GrafanaNGINXPrometheus
0 likes · 20 min read
15 SpringBoot Performance Tweaks to Handle Million-Scale Concurrency
Raymond Ops
Raymond Ops
Apr 18, 2026 · Operations

How to Build a Lightweight Log Platform with Grafana and Loki in 3 Simple Steps

This guide walks you through replacing a heavyweight ELK stack with a minimal Grafana‑Loki logging solution, covering environment requirements, installation of Loki and Promtail, configuration details, best‑practice tips, troubleshooting, and backup strategies for reliable log aggregation.

GrafanaLokiObservability
0 likes · 25 min read
How to Build a Lightweight Log Platform with Grafana and Loki in 3 Simple Steps
Ops Community
Ops Community
Apr 18, 2026 · Operations

Master Linux Host Monitoring: Prometheus, Node Exporter, Thresholds & Scripts

This comprehensive guide walks you through building a robust Linux host monitoring system with Prometheus and node_exporter, covering CPU, memory, disk, and network metrics, practical threshold formulas, ready‑to‑run Bash scripts, Alertmanager rules, Grafana dashboards, and best‑practice recommendations for reliable operations.

AlertmanagerGrafanaLinux monitoring
0 likes · 49 min read
Master Linux Host Monitoring: Prometheus, Node Exporter, Thresholds & Scripts
Lobster Programming
Lobster Programming
Apr 8, 2026 · Big Data

How to Implement Real‑Time API Traffic Counting at Scale

This article compares three practical approaches—direct database storage, a Flink‑Kafka‑Redis‑Grafana pipeline, and an ELK stack—to achieve real‑time API request counting for high‑concurrency scenarios, outlining their architectures, advantages, and trade‑offs.

API analyticsELKFlink
0 likes · 6 min read
How to Implement Real‑Time API Traffic Counting at Scale
Linux Tech Enthusiast
Linux Tech Enthusiast
Apr 7, 2026 · Operations

Top 10 Essential Tools Every Ops Engineer Uses Daily

This article enumerates ten widely used operations tools—Shell scripts, Git, Ansible, Prometheus, Grafana, Docker, Kubernetes, Nginx, ELK Stack, and Zabbix—detailing each tool's function, suitable scenarios, advantages, and concrete usage examples for daily sysadmin tasks.

AnsibleDockerELK
0 likes · 8 min read
Top 10 Essential Tools Every Ops Engineer Uses Daily
Ops Community
Ops Community
Apr 2, 2026 · Operations

Build a Production‑Ready Prometheus + Grafana Monitoring Stack in Minutes

Learn how to quickly set up a complete, production‑grade monitoring system using Prometheus 3.x and Grafana 11, covering installation, service discovery, PromQL queries, recording rules, Alertmanager routing, Grafana dashboards, best‑practice configurations, and troubleshooting for environments of any size.

AlertingGrafanacloud-native
0 likes · 55 min read
Build a Production‑Ready Prometheus + Grafana Monitoring Stack in Minutes
Raymond Ops
Raymond Ops
Feb 24, 2026 · Cloud Native

Master Enterprise Monitoring: Build a Prometheus + Grafana Observability Platform

This guide details how to design and implement an enterprise‑grade cloud‑native observability platform using Prometheus for metrics collection and Grafana for visualization, covering architecture, high‑availability deployment, alerting, dashboard automation, case studies, best‑practice recommendations, and future trends.

Cloud NativeGrafanaObservability
0 likes · 24 min read
Master Enterprise Monitoring: Build a Prometheus + Grafana Observability Platform
Woodpecker Software Testing
Woodpecker Software Testing
Feb 12, 2026 · Operations

How to Build a Full‑Chain JMeter Load Test for an E‑Commerce Mega‑Sale

This article walks through designing and implementing a complete JMeter load‑testing solution for an e‑commerce platform's big‑sale scenario, covering business‑flow mapping, request correlation, multi‑stage stress testing, real‑time monitoring with InfluxDB + Grafana, bottleneck identification, and practical optimization tips.

GrafanaInfluxDBJMeter
0 likes · 7 min read
How to Build a Full‑Chain JMeter Load Test for an E‑Commerce Mega‑Sale
Big Data Tech Team
Big Data Tech Team
Jan 22, 2026 · Industry Insights

Top 10 Open‑Source Data Visualization Platforms You Should Know

This article presents a concise overview of ten popular open‑source data visualization tools—including Echarts, D3.js, Grafana, Plotly, Redash, Metabase, Superset, Kibana, AntV, and Pyecharts—highlighting their main features, typical use cases, and visual examples to help readers choose the right solution for their needs.

Big DataD3.jsData visualization
0 likes · 6 min read
Top 10 Open‑Source Data Visualization Platforms You Should Know
DevOps Coach
DevOps Coach
Jan 21, 2026 · Operations

How I Turned a 3‑Day Latency Nightmare into a 30‑Second Debugging Tool

After a late‑night PagerDuty alert revealed a p95 latency over 5 seconds despite normal CPU, memory, and database metrics, the author spent three days tracing the issue to a tiny thread‑pool configuration, then built an open‑source CLI that automates the entire diagnosis in seconds.

Connection PoolGrafanalatency debugging
0 likes · 17 min read
How I Turned a 3‑Day Latency Nightmare into a 30‑Second Debugging Tool
Raymond Ops
Raymond Ops
Jan 12, 2026 · Operations

Build a Real-Time Linux Performance Alert System with Prometheus & Grafana

This guide walks you through designing a layered Linux monitoring architecture, selecting a Prometheus‑Grafana stack, defining key CPU, memory and disk metrics, crafting smart alert rules, visualizing dashboards, and adding automation and AI‑driven predictive techniques for reliable, business‑focused operations.

GrafanaLinuxOps
0 likes · 13 min read
Build a Real-Time Linux Performance Alert System with Prometheus & Grafana
Ray's Galactic Tech
Ray's Galactic Tech
Dec 2, 2025 · Operations

Build an End‑to‑End AIOps Solution: Log Alerts and Automated Self‑Healing Ops

This guide walks through designing and implementing an intelligent operations workflow that transforms passive log monitoring into proactive alerting and automated remediation, covering core concepts, tech‑stack selection, step‑by‑step configuration of log collection, alert rules, webhook integration, Ansible automation, and best‑practice considerations for scaling and security.

AlertingAnsibleGrafana
0 likes · 7 min read
Build an End‑to‑End AIOps Solution: Log Alerts and Automated Self‑Healing Ops
MaGe Linux Operations
MaGe Linux Operations
Nov 28, 2025 · Operations

10 Essential Linux Ops Tools Every Engineer Should Master

This article presents a curated list of ten widely used Linux operations tools, detailing each tool's core functions, typical use cases, key advantages, and real‑world examples, while also providing practical shell and Ansible code snippets to help engineers apply them immediately.

AnsibleDockerGrafana
0 likes · 9 min read
10 Essential Linux Ops Tools Every Engineer Should Master
DevOps Coach
DevOps Coach
Nov 24, 2025 · Operations

10 Essential Grafana Dashboards to Spot Incidents Early

This guide presents ten essential Grafana dashboards—covering SLO burn, user‑journey funnel, infrastructure USE metrics, queue lag, database health, cache hit‑rate, CDN latency, rollout guardrails, trace topology, and a command‑center view—each explained with its purpose, panel layout, and ready‑to‑use PromQL or LogQL queries.

DashboardsGrafanaObservability
0 likes · 13 min read
10 Essential Grafana Dashboards to Spot Incidents Early
Liangxu Linux
Liangxu Linux
Nov 6, 2025 · Operations

Top 6 Free Open‑Source Network Monitoring Tools You Should Know

This article introduces six free open‑source network monitoring solutions—Zabbix, Prometheus, Cacti, Grafana, OpenNMS, and Nagios—explaining their key features, how they collect and visualize metrics, and why they are valuable for maintaining system stability and security.

GrafanaNagiosNetwork Monitoring
0 likes · 5 min read
Top 6 Free Open‑Source Network Monitoring Tools You Should Know
JakartaEE China Community
JakartaEE China Community
Nov 4, 2025 · Operations

How Logs, Traces, and Metrics Differ—and Why It Matters

Logs, tracing, and metrics each serve distinct monitoring goals—logs capture discrete events for debugging and audit, traces map request flows to pinpoint performance bottlenecks, and metrics provide time‑series health data; understanding their differences and integrating tools like ELK, OpenTelemetry, Prometheus, and Grafana enables robust observability.

ELKGrafanaObservability
0 likes · 7 min read
How Logs, Traces, and Metrics Differ—and Why It Matters
DevOps Coach
DevOps Coach
Oct 22, 2025 · Cloud Native

Simplify Scalable Kubernetes Pod Logging with Grafana podLogs

This guide explains how Grafana's podLogs feature, powered by Vector.dev, transforms raw Kubernetes pod logs into enriched, searchable, cluster‑wide observability data, covering why pod‑level logs matter, configuration steps, advanced custom log paths, and practical examples.

Cloud NativeGrafanaKubernetes
0 likes · 14 min read
Simplify Scalable Kubernetes Pod Logging with Grafana podLogs
MaGe Linux Operations
MaGe Linux Operations
Sep 29, 2025 · Cloud Native

How to Build an Enterprise‑Grade Monitoring & Alerting System with Prometheus and Grafana

This article explains how to design and implement a cloud‑native observability platform using Prometheus and Grafana, covering architecture evolution, core Prometheus concepts, high‑availability cluster deployment, storage tuning, sharding, alert rule design, Grafana dashboard automation, multi‑cluster monitoring, and best‑practice recommendations for modern enterprises.

Cloud Native MonitoringGrafanaKubernetes
0 likes · 26 min read
How to Build an Enterprise‑Grade Monitoring & Alerting System with Prometheus and Grafana
Code Ape Tech Column
Code Ape Tech Column
Sep 12, 2025 · Operations

Master Grafana & Prometheus: Step‑by‑Step Guide to Build a Full‑Featured Monitoring System

This comprehensive tutorial walks you through installing and configuring Grafana, Prometheus, and related exporters, setting up dashboards, enabling email alerts, and extending monitoring to MySQL, RabbitMQ, Redis, and TiDB, all while providing clear code snippets and practical tips for a robust observability stack.

AlertingDevOpsGrafana
0 likes · 24 min read
Master Grafana & Prometheus: Step‑by‑Step Guide to Build a Full‑Featured Monitoring System
php Courses
php Courses
Aug 29, 2025 · Operations

How to Build a Real‑Time PHP Log Event Pipeline for Instant Insights

Learn how to transform PHP logs into real‑time, structured events by implementing a log event pipeline that includes JSON logging, lightweight collectors like Filebeat, streaming platforms such as Kafka or Flink, enrichment, and visualization with Grafana, enabling instant monitoring, alerting, and data‑driven decisions.

FlinkGrafanaKafka
0 likes · 7 min read
How to Build a Real‑Time PHP Log Event Pipeline for Instant Insights
Linux Ops Smart Journey
Linux Ops Smart Journey
Aug 20, 2025 · Operations

How to Turn Abstract Metrics into Intuitive Gauges with Grafana

This guide explains why Grafana's Gauge panel creates a powerful visual metaphor for system pressure, walks through creating the gauge, configuring PromQL queries, setting panel options, thresholds, and JSON definitions, and shows how to produce clear, boss‑friendly monitoring dashboards.

Gauge panelGrafanaJSON configuration
0 likes · 5 min read
How to Turn Abstract Metrics into Intuitive Gauges with Grafana
MaGe Linux Operations
MaGe Linux Operations
Aug 16, 2025 · Operations

How to Build a Real-Time Linux Performance Alert System

Discover why conventional monitoring often fails and learn to construct a robust, three‑layer Linux performance alert system using Prometheus, Grafana, and Alertmanager, with detailed metric definitions, smart alert rules, visual dashboards, predictive capacity planning, automation scripts, and best‑practice guidelines for reliable operations.

GrafanaLinux monitoring
0 likes · 13 min read
How to Build a Real-Time Linux Performance Alert System
MaGe Linux Operations
MaGe Linux Operations
Jul 23, 2025 · Cloud Native

Build a Real‑Time eBPF‑Based Kubernetes Network Anomaly Detector

This article walks through designing and implementing a zero‑intrusion, real‑time network anomaly detection system for Kubernetes using eBPF, covering architecture, kernel‑space eBPF programs, Go user‑space collectors, deployment via DaemonSet, performance optimizations, alerting integration with Prometheus/Grafana, and real‑world case studies.

GoGrafanaKubernetes
0 likes · 16 min read
Build a Real‑Time eBPF‑Based Kubernetes Network Anomaly Detector
Efficient Ops
Efficient Ops
Jul 2, 2025 · Operations

Master Grafana: Key Features, Installation on Linux & Docker

This guide introduces Grafana, outlines its multi‑source monitoring features, and provides step‑by‑step installation instructions for Linux using systemd and for Docker Compose, including required commands, configuration files, and how to create and save a basic dashboard.

DockerGrafanaInstallation
0 likes · 4 min read
Master Grafana: Key Features, Installation on Linux & Docker
Architect's Tech Stack
Architect's Tech Stack
Jun 21, 2025 · Operations

Quickly Visualize Nginx Access Logs with Loki and Grafana

This guide shows how to collect Nginx access logs, convert them to JSON, store them in Loki, and create real‑time dashboards in Grafana, covering installation, configuration, Docker deployment, and required plugins for a complete monitoring solution.

GrafanaLog MonitoringLoki
0 likes · 8 min read
Quickly Visualize Nginx Access Logs with Loki and Grafana
Liangxu Linux
Liangxu Linux
Jun 10, 2025 · Cloud Native

Why Loki Is the Ideal Cloud‑Native Log Aggregator for Prometheus & Grafana

Loki, an open‑source log aggregation system from Grafana Labs, integrates tightly with Prometheus and Grafana, stores logs efficiently using object storage, offers a simple label‑based model, and provides cost‑effective, high‑performance logging for cloud‑native environments while outlining its architecture, usage, configuration, advantages, limitations, and retention policies.

Cloud NativeGrafanaLoki
0 likes · 10 min read
Why Loki Is the Ideal Cloud‑Native Log Aggregator for Prometheus & Grafana
Architect's Tech Stack
Architect's Tech Stack
May 20, 2025 · Operations

Visualizing Nginx Access Logs with Loki and Grafana

This guide explains how to collect Nginx access logs, convert them to JSON, store them in Loki using Promtail, and visualize the data with Grafana dashboards, including installation of required modules, Docker deployment, and world‑map panel configuration.

GrafanaJSONLoki
0 likes · 8 min read
Visualizing Nginx Access Logs with Loki and Grafana
Raymond Ops
Raymond Ops
May 9, 2025 · Operations

Build a Complete Prometheus Monitoring Stack with Docker

This tutorial explains Prometheus' core components, shows how to deploy Prometheus Server, Node Exporter, cAdvisor, and Grafana as Docker containers on two hosts, configures scraping and alerting, and demonstrates visualizing metrics with ready‑made Grafana dashboards.

AlertmanagerDockerExporter
0 likes · 8 min read
Build a Complete Prometheus Monitoring Stack with Docker
Alibaba Cloud Observability
Alibaba Cloud Observability
Apr 14, 2025 · Cloud Native

How to Connect Grafana to Large Language Models with MCP (Model Context Protocol)

This guide shows how to use the Model Context Protocol (MCP) to build a lightweight server that links Grafana dashboards to large language models, covering MCP concepts, FastMCP setup, Python client implementation, environment preparation, and integration with Cherry Studio for seamless AI-driven data access.

AI integrationCloud NativeGrafana
0 likes · 12 min read
How to Connect Grafana to Large Language Models with MCP (Model Context Protocol)
Ops Development Stories
Ops Development Stories
Mar 4, 2025 · Operations

Master Process Exporter: Deploy, Integrate with Prometheus & Grafana in Kubernetes

This guide walks Kubernetes administrators through the full lifecycle of Process Exporter—from lightweight deployment and RBAC setup, through Prometheus Operator integration and Grafana dashboard creation, to detailed configuration and alerting—enabling precise process‑level monitoring and rapid root‑cause analysis.

DaemonSetGrafanaKubernetes
0 likes · 15 min read
Master Process Exporter: Deploy, Integrate with Prometheus & Grafana in Kubernetes
Infra Learning Club
Infra Learning Club
Feb 16, 2025 · Operations

GPUprobe: Using eBPF to Monitor CUDA Memory Leaks

The article introduces GPUprobe, an eBPF‑based tool that provides lightweight, continuous, application‑level monitoring of CUDA memory allocation, leaks, and kernel launches, compares it with NSight Systems and DCGM, and demonstrates near‑zero overhead integration with Prometheus and Grafana through detailed code examples and real‑world output analysis.

GPU monitoringGrafanaObservability
0 likes · 13 min read
GPUprobe: Using eBPF to Monitor CUDA Memory Leaks
Architect
Architect
Dec 31, 2024 · Operations

Integrating Prometheus with Spring Boot and Visualizing Metrics Using Grafana

This guide explains how to monitor a Spring Boot application using Prometheus, configure Spring Boot Actuator, run Prometheus (including Docker deployment), set up Grafana for visualizing metrics, and create custom metrics with Micrometer, providing step‑by‑step instructions and code examples.

ActuatorDockerGrafana
0 likes · 10 min read
Integrating Prometheus with Spring Boot and Visualizing Metrics Using Grafana
Linux Ops Smart Journey
Linux Ops Smart Journey
Dec 27, 2024 · Cloud Native

How to Enable Ceph Enterprise Monitoring with Prometheus & Grafana

Learn step‑by‑step how to activate Ceph’s monitoring modules, configure Prometheus to collect Ceph metrics, verify data collection, and integrate Grafana dashboards, including tips on required dependencies and troubleshooting, to ensure reliable, secure storage management in enterprise cloud‑native environments.

CephGrafanaPrometheus
0 likes · 4 min read
How to Enable Ceph Enterprise Monitoring with Prometheus & Grafana
Linux Ops Smart Journey
Linux Ops Smart Journey
Dec 20, 2024 · Cloud Native

How to Set Up MinIO Enterprise Monitoring with Prometheus & Grafana

This guide walks you through configuring MinIO's enterprise monitoring panel, generating Prometheus metrics for clusters, nodes, buckets, and resources, integrating them into Grafana dashboards, and verifying successful data collection to enhance data management and operational efficiency.

GrafanaPrometheusmonitoring
0 likes · 7 min read
How to Set Up MinIO Enterprise Monitoring with Prometheus & Grafana
Linux Ops Smart Journey
Linux Ops Smart Journey
Dec 3, 2024 · Cloud Native

How to Set Up Harbor Monitoring with Prometheus and Grafana

Learn step‑by‑step how to deploy the harbor‑exporter, configure Prometheus to scrape Harbor metrics, verify data collection, and add official Grafana dashboards, enabling real‑time monitoring of your Harbor registry for improved stability, security, and performance in cloud‑native environments.

GrafanaHarborKubernetes
0 likes · 6 min read
How to Set Up Harbor Monitoring with Prometheus and Grafana
Linux Kernel Journey
Linux Kernel Journey
Nov 14, 2024 · Artificial Intelligence

Deep Dive: How DeepFlow Collects Business Metrics for Large‑Model Services

This article explains how China Mobile built a hybrid‑cloud production environment for its customer‑service LLM, using eBPF and WebAssembly plugins from DeepFlow to achieve zero‑intrusion observability, automatically capture full‑stack topology, application/network metrics, and key LLM business indicators such as TTFT, TPOT, and token throughput.

DeepFlowGrafanaLLM
0 likes · 19 min read
Deep Dive: How DeepFlow Collects Business Metrics for Large‑Model Services
Linux Ops Smart Journey
Linux Ops Smart Journey
Nov 12, 2024 · Databases

Master PostgreSQL Monitoring with Grafana: Step-by-Step Guide

Learn how to deploy postgres_exporter, configure PostgreSQL extensions, set up Prometheus scraping, and create Grafana dashboards for comprehensive PostgreSQL performance monitoring, complete with command-line instructions and tips for verifying data collection and visualizing metrics.

GrafanaPostgreSQLPrometheus
0 likes · 6 min read
Master PostgreSQL Monitoring with Grafana: Step-by-Step Guide
Java Architect Essentials
Java Architect Essentials
Oct 27, 2024 · Operations

Integrating Prometheus with Spring Boot for Real‑time Monitoring and Grafana Visualization

This article explains how to use Prometheus together with Spring Boot Actuator and Micrometer to collect, expose, and visualize application metrics, including step‑by‑step dependency configuration, YAML settings, Docker deployment of Prometheus and Grafana, and adding custom metrics for comprehensive monitoring.

ActuatorGrafanaPrometheus
0 likes · 10 min read
Integrating Prometheus with Spring Boot for Real‑time Monitoring and Grafana Visualization
ITPUB
ITPUB
Oct 6, 2024 · Operations

Mastering Prometheus Metrics: Practical Best‑Practice Guide for Effective Monitoring

This guide explains how to design and implement Prometheus metrics for application monitoring, covering the selection of monitoring targets, the four golden metrics, system‑specific metric groups, vector and label choices, naming conventions, histogram bucket design, and useful Grafana visualization tips.

GrafanaOperationsPrometheus
0 likes · 9 min read
Mastering Prometheus Metrics: Practical Best‑Practice Guide for Effective Monitoring
DevOps Operations Practice
DevOps Operations Practice
Sep 17, 2024 · Operations

Comparison of ELK, EFK, and PLG Log Systems

This article introduces three major log management stacks—ELK, EFK, and PLG (Promtail + Loki + Grafana)—explaining their components, deployment scenarios, and provides a detailed comparison of their storage, indexing, and visualization approaches, especially in Kubernetes environments.

EFKELKGrafana
0 likes · 5 min read
Comparison of ELK, EFK, and PLG Log Systems
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 5, 2024 · Artificial Intelligence

Deploying NVIDIA NIM on Alibaba Cloud ACK with Cloud‑Native AI Suite: A Step‑by‑Step Guide

This guide explains how to quickly build a high‑performance, observable, and elastically scalable LLM inference service by deploying NVIDIA NIM on an Alibaba Cloud ACK cluster using the Cloud‑Native AI Suite, KServe, Prometheus, Grafana, and custom autoscaling based on request‑queue metrics.

Alibaba Cloud ACKGrafanaKServe
0 likes · 15 min read
Deploying NVIDIA NIM on Alibaba Cloud ACK with Cloud‑Native AI Suite: A Step‑by‑Step Guide
Top Architect
Top Architect
Aug 29, 2024 · Operations

Setting Up Nginx Log Monitoring with Loki, Promtail, and Grafana

This article walks through a complete, step‑by‑step solution for collecting Nginx access logs, converting them to JSON, shipping them with Promtail to Loki, and visualizing the data in Grafana, including Docker deployment, dashboard import, and world‑map plugin installation.

GrafanaLokiOperations
0 likes · 10 min read
Setting Up Nginx Log Monitoring with Loki, Promtail, and Grafana
Sohu Tech Products
Sohu Tech Products
Aug 21, 2024 · Operations

Building Dynamic Grafana Dashboards for Push System Monitoring

By instrumenting each node of ZuanZuan’s push system with a Prometheus counter labeled by node name and traceId, and visualizing these metrics in a Grafana Flowcharting dashboard that dynamically highlights the trace path, developers can instantly pinpoint failures, cutting troubleshooting time from minutes to near‑zero.

Dynamic DashboardGrafanaJava
0 likes · 11 min read
Building Dynamic Grafana Dashboards for Push System Monitoring
Zhuanzhuan Tech
Zhuanzhuan Tech
Aug 7, 2024 · Operations

Building a Dynamic Grafana Dashboard for Push System TraceId Visualization

This article describes how to use Grafana's Flowcharting plugin and Prometheus metrics to create a dynamic, interactive dashboard that visualizes each logical node of a push notification pipeline, enabling rapid trace‑ID based troubleshooting and reducing manual investigation effort.

GrafanaOperationsdynamic-view
0 likes · 11 min read
Building a Dynamic Grafana Dashboard for Push System TraceId Visualization