Topic

Monitoring

Collection size
1711 articles
Page 6 of 86
Practical DevOps Architecture
Practical DevOps Architecture
Jun 13, 2024 · Operations

Comprehensive Data Center Operations Training Course Overview

This extensive training program covers everything a data center operations engineer needs—from foundational infrastructure management and server hardware maintenance to advanced network configuration, security hardening, monitoring, fault handling, and practical hands‑on skills for real‑world challenges.

MonitoringNetworkOperations
0 likes · 6 min read
Comprehensive Data Center Operations Training Course Overview
Practical DevOps Architecture
Practical DevOps Architecture
May 9, 2024 · Operations

Monitoring SSL Certificate Expiration with Zabbix Using a Shell Script

This guide explains how to create a shell script that checks SSL certificate expiration dates and integrates it with Zabbix by configuring a user parameter, testing the script, and setting up monitoring items, triggers, graphs, and alerts to ensure services remain available.

AutomationMonitoringOperations
0 likes · 3 min read
Monitoring SSL Certificate Expiration with Zabbix Using a Shell Script
Practical DevOps Architecture
Practical DevOps Architecture
May 22, 2024 · Operations

SRE & Linux Operations Course Outline

This article presents a detailed curriculum covering fundamental infrastructure, cluster architecture, automation, log collection, Linux system administration, containerization, monitoring, security, and related DevOps tools across multiple phases and daily modules for comprehensive SRE training.

AutomationContainersLinux
0 likes · 8 min read
SRE & Linux Operations Course Outline
Practical DevOps Architecture
Practical DevOps Architecture
Sep 30, 2022 · Operations

Resolving Filebeat Startup Failure: EOF Error in Registrar State

This guide explains how to troubleshoot Filebeat failing to start due to an EOF error while loading registrar state, by inspecting logs, resetting the registry directory, and restarting the service on a Linux host.

FilebeatLinuxLogstash
0 likes · 4 min read
Resolving Filebeat Startup Failure: EOF Error in Registrar State
DevOps Cloud Academy
DevOps Cloud Academy
May 31, 2024 · Cloud Native

Optimizing RabbitMQ Performance on Kubernetes

This guide explains how to deploy RabbitMQ on Kubernetes and improve its performance through Helm installation, resource tuning, monitoring, scaling, security hardening, and advanced configuration techniques, providing practical code examples for each step.

HelmKubernetesMonitoring
0 likes · 9 min read
Optimizing RabbitMQ Performance on Kubernetes
DevOps Cloud Academy
DevOps Cloud Academy
Feb 2, 2024 · Operations

DevOps Tools for 2024: A Comprehensive Overview

An extensive overview of essential DevOps tools for 2024, covering categories such as version control, CI/CD, container orchestration, configuration management, infrastructure as code, monitoring, collaboration, artifact repositories, testing, security, deployment automation, serverless platforms, and database management to guide effective tool selection.

AutomationCI/CDDevOps
0 likes · 7 min read
DevOps Tools for 2024: A Comprehensive Overview
DevOps Cloud Academy
DevOps Cloud Academy
Feb 27, 2023 · Operations

Understanding GitOps: History, Principles, Benefits, and Practical Implementation

This article explains the origins of GitOps, defines its core principles of declarative infrastructure, versioned desired state, automated approval, and compliance monitoring, and outlines its benefits and a concrete practice using tools such as GitLab, ArgoCD, Kubernetes, Terraform, Prometheus, and Grafana.

CI/CDCloud NativeGitOps
0 likes · 18 min read
Understanding GitOps: History, Principles, Benefits, and Practical Implementation
DevOps Cloud Academy
DevOps Cloud Academy
Mar 2, 2022 · Operations

Promoter: Rendering AlertManager Graphs for DingTalk Notifications Using Go

The article introduces Promoter, a Go‑based webhook that fetches Prometheus metrics, renders alert graphs with gonum/plot, stores the images in S3‑compatible object storage, and embeds them in DingTalk notifications, providing deployment instructions, template customization, and core implementation details.

AlertmanagerDingTalkGo
0 likes · 10 min read
Promoter: Rendering AlertManager Graphs for DingTalk Notifications Using Go
DevOps Cloud Academy
DevOps Cloud Academy
Jan 25, 2021 · Cloud Native

Blackbox Monitoring with Prometheus Blackbox Exporter in Kubernetes

This guide explains how to complement Prometheus white‑box monitoring with black‑box probes by deploying the Blackbox Exporter in a Kubernetes cluster, configuring ConfigMaps, Deployments, Services, and Prometheus scrape jobs for HTTP, DNS, TCP, and ICMP checks, and using annotations for automatic service discovery.

Blackbox ExporterKubernetesMonitoring
0 likes · 10 min read
Blackbox Monitoring with Prometheus Blackbox Exporter in Kubernetes
Top Architect
Top Architect
Dec 5, 2024 · Databases

Database Monitoring and Slow Query Log Management Guide

This article explains how database administrators can monitor system resource usage with commands like top, iostat, and vmstat, and configure MySQL slow query logging, including enabling the log, setting thresholds, viewing logs, and applying best‑practice recommendations for analysis and issue resolution.

Database AdministrationLinux CommandsMonitoring
0 likes · 8 min read
Database Monitoring and Slow Query Log Management Guide
Top Architect
Top Architect
Sep 14, 2024 · Operations

Centralized Log Collection with Filebeat and Graylog: Installation, Configuration, and Usage

This article explains why centralized log collection is essential for multi‑environment services, introduces Graylog as a lightweight alternative to ELK, details Filebeat's role and workflow, provides configuration examples, shows how to deploy both Filebeat and Graylog via Docker or packages, and demonstrates integration with Spring Boot and log search techniques.

DockerELKFilebeat
0 likes · 20 min read
Centralized Log Collection with Filebeat and Graylog: Installation, Configuration, and Usage
Top Architect
Top Architect
Aug 29, 2024 · Operations

Setting Up Nginx Log Monitoring with Loki, Promtail, and Grafana

This article walks through a complete, step‑by‑step solution for collecting Nginx access logs, converting them to JSON, shipping them with Promtail to Loki, and visualizing the data in Grafana, including Docker deployment, dashboard import, and world‑map plugin installation.

LoggingMonitoringNginx
0 likes · 10 min read
Setting Up Nginx Log Monitoring with Loki, Promtail, and Grafana
Top Architect
Top Architect
May 5, 2023 · Backend Development

Using Redis Sentinel for High Availability: Design and Implementation

This article introduces Redis Sentinel as the official high‑availability solution for Redis, explains its core functions, provides configuration examples, compares three ways to receive failover notifications (script, client subscription, and indirect service), and offers design recommendations for robust production deployments.

DevOpsFailoverMonitoring
0 likes · 10 min read
Using Redis Sentinel for High Availability: Design and Implementation
Top Architect
Top Architect
Jul 9, 2022 · Operations

Introduction to Loki: Architecture, Deployment, and Usage

This article introduces Loki, Grafana Labs' open‑source, horizontally scalable log aggregation system, compares it with ELK/EFK, explains its architecture and components, and provides step‑by‑step deployment instructions with Promtail configuration and Grafana integration for Kubernetes environments.

ContainerKubernetesLogging
0 likes · 15 min read
Introduction to Loki: Architecture, Deployment, and Usage
Top Architect
Top Architect
Aug 25, 2020 · Operations

Prometheus Monitoring in Kubernetes: Principles, Exporters, Configuration, Capacity Planning, and Best Practices

This comprehensive guide explores Prometheus as a cloud‑native monitoring solution for Kubernetes, covering core principles, exporter selection, configuration snippets, Grafana dashboard creation, capacity planning, high‑cardinality challenges, rate calculations, prediction functions, high‑availability designs, and integration with Alertmanager and other operational tools.

AlertmanagerExporterKubernetes
0 likes · 38 min read
Prometheus Monitoring in Kubernetes: Principles, Exporters, Configuration, Capacity Planning, and Best Practices
DataFunSummit
DataFunSummit
Mar 22, 2024 · Artificial Intelligence

Risk Control Model Construction for Online Small Loans: Pre‑loan, In‑loan, Post‑loan and Monitoring

This article presents a comprehensive overview of risk control model building for online small‑loan scenarios, covering pre‑loan, in‑loan and post‑loan stages, the associated data pipelines, model deployment strategies, optimization attempts, and monitoring frameworks to ensure accuracy, stability and effectiveness.

Monitoringcredit scoringdata pipeline
0 likes · 16 min read
Risk Control Model Construction for Online Small Loans: Pre‑loan, In‑loan, Post‑loan and Monitoring
DataFunTalk
DataFunTalk
Jul 9, 2023 · Operations

Building High‑Performance Observability Data Pipelines with Vector and Honghu

This article explains the concepts and importance of observability, introduces the Vector data‑pipeline tool and its architecture, demonstrates how to configure sources, transforms and sinks, and shows how to integrate Vector with the Honghu platform to build a complete, real‑time monitoring solution for modern distributed systems.

Big DataHonghuMonitoring
0 likes · 33 min read
Building High‑Performance Observability Data Pipelines with Vector and Honghu
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Aug 27, 2024 · Operations

Comprehensive Guide to Installing, Configuring, and Using Grafana for Monitoring

This article provides a step‑by‑step tutorial on what Grafana is, its common monitoring scenarios, how to install it on Linux, Windows, macOS or via Docker, configure data sources such as Prometheus, and create or import dashboards for system and business metric visualization.

DashboardDockerInstallation
0 likes · 5 min read
Comprehensive Guide to Installing, Configuring, and Using Grafana for Monitoring
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jun 25, 2024 · Cloud Native

Eight Essential Practices for Microservice Governance

This article outlines eight fundamental microservice governance techniques—including service registration and discovery, load balancing, circuit breaking, security, monitoring, configuration management, and API gateway—to help architects design reliable, secure, and maintainable cloud‑native systems.

ConfigurationLoad BalancingMonitoring
0 likes · 8 min read
Eight Essential Practices for Microservice Governance