Topic

monitoring

Collection size
1767 articles
Page 74 of 89
DevOps
DevOps
Oct 17, 2023 · Databases

Fundamental Methods for Service Troubleshooting and Redis Performance Optimization

After encountering latency spikes in a service, the article walks through systematic troubleshooting steps—examining module performance, resource metrics, and network latency—followed by detailed Redis diagnostics using metrics, latency commands, and profiling tools, ultimately recommending scaling, rate limiting, and caching strategies to resolve the issue.

BackendMonitoringOperations
0 likes · 22 min read
Fundamental Methods for Service Troubleshooting and Redis Performance Optimization
DevOps
DevOps
Mar 14, 2023 · Operations

15 Essential DevOps and SRE Tools to Watch in 2023

This guide outlines fifteen key DevOps and SRE tools for 2023—including monitoring, application platforms, chat‑ops, incident management, diagramming, and CI/CD solutions—explaining their core features, benefits, and how they help teams maintain reliable, observable, and automated software delivery pipelines.

MonitoringObservabilitySRE
0 likes · 11 min read
15 Essential DevOps and SRE Tools to Watch in 2023
DevOps
DevOps
Aug 31, 2022 · Operations

Key Software Performance Metrics for Successful Development

This article explains why performance testing is essential before large‑scale deployment and outlines fourteen critical software performance metrics—such as response time, request rate, error rate, CPU utilization, and concurrent users—to help development teams measure, analyze, and improve their products.

MonitoringQAdevops
0 likes · 7 min read
Key Software Performance Metrics for Successful Development
DevOps
DevOps
Apr 7, 2022 · Operations

Top DevOps Tools: Comprehensive List, Features, and Selection Guide

This article provides an in‑depth overview of DevOps tools, explaining their role in automating software development and operations, and presents a curated list of popular tools with key features, download links, and guidance on choosing the right solution for your team.

CI/CDMonitoringautomation
0 likes · 18 min read
Top DevOps Tools: Comprehensive List, Features, and Selection Guide
DevOps
DevOps
Apr 8, 2020 · Operations

Bilibili DevOps Case Study: Culture, Community, User‑Driven Demand Management, High‑Performance Microservices, and Data Operations

This article presents a comprehensive DevOps case study of Bilibili, covering its cultural background, community ecosystem, user‑centric demand management, migration to high‑performance microservices, and the implementation of logging, monitoring, and real‑time data platforms to support rapid, reliable delivery.

BilibiliData PlatformMonitoring
0 likes · 17 min read
Bilibili DevOps Case Study: Culture, Community, User‑Driven Demand Management, High‑Performance Microservices, and Data Operations
DevOps
DevOps
Aug 13, 2019 · Operations

Comprehensive DevOps Toolset Overview

This article presents a detailed, categorized list of DevOps tools—including version control, automated build and testing, CI/CD, container platforms, configuration management, micro‑service platforms, logging, and monitoring solutions—providing concise descriptions for each to help teams select appropriate utilities for modern software delivery pipelines.

CI/CDConfiguration ManagementContainers
0 likes · 14 min read
Comprehensive DevOps Toolset Overview
DevOps
DevOps
Jul 12, 2017 · Cloud Native

Container Monitoring: Challenges, Metrics Collection, and Best Practices

This article examines the unique challenges of monitoring containers, outlines three categories of metrics to collect, compares host‑centric and layered monitoring architectures, provides detailed methods for gathering CPU, memory, I/O and network data via cgroup files and Docker commands, and shares practical insights, tooling recommendations, and a Q&A session for effective container observability.

ContainersDockerMetrics
0 likes · 18 min read
Container Monitoring: Challenges, Metrics Collection, and Best Practices
DevOps
DevOps
Apr 25, 2017 · Operations

Analyzing and Visualizing Docker Logs with the ELK Stack (Part Two)

This article explains how to analyze and visualize Docker container logs using the ELK stack, covering preparation, parsing tips, Kibana query techniques, and example visualizations to help monitor Dockerized environments effectively in production.

DockerELKKibana
0 likes · 7 min read
Analyzing and Visualizing Docker Logs with the ELK Stack (Part Two)
DevOps
DevOps
Apr 24, 2017 · Operations

Collecting and Processing Docker Logs with ELK: Installation and Configuration Guide

This article explains the challenges of Docker log collection and provides a step‑by‑step guide for installing ELK components, configuring Logstash, Kibana, and various log shippers such as Filebeat, logging drivers, Logspout, and Logz.io to reliably gather and visualize container logs.

DockerELKFilebeat
0 likes · 10 min read
Collecting and Processing Docker Logs with ELK: Installation and Configuration Guide
DevOps
DevOps
Jun 20, 2016 · Operations

A Comprehensive Overview of Popular DevOps Tools for IT Operations

This article provides a detailed overview of widely used DevOps tools—including monitoring solutions like Microsoft SCOM, Vistara, SolarWinds, Nimsoft, ServiceNow, automation platforms Chef and Puppet, container platform Docker, orchestration systems Apache Mesos and Kubernetes, as well as performance monitoring tools New Relic and Graphite/Grafana—highlighting their features, typical use cases, and important considerations.

ContainersIT OperationsMonitoring
0 likes · 10 min read
A Comprehensive Overview of Popular DevOps Tools for IT Operations
Practical DevOps Architecture
Practical DevOps Architecture
May 15, 2024 · Databases

Identifying and Killing High‑CPU MySQL Queries with top and a Python Helper Script

This article explains how to quickly locate the MySQL statements that cause sudden CPU spikes by inspecting per‑thread usage with top, correlating threads to SQL via performance_schema, and using a Python script to list and optionally kill the offending queries.

MonitoringMySQLOperations
0 likes · 8 min read
Identifying and Killing High‑CPU MySQL Queries with top and a Python Helper Script
Practical DevOps Architecture
Practical DevOps Architecture
Feb 1, 2024 · Operations

Installing and Configuring Prometheus MySQL Exporter on Kubernetes with Alert Rules

This guide walks through installing the MySQL exporter, deploying the Prometheus MySQL exporter via Helm on a Kubernetes cluster, creating comprehensive Prometheus alert rules for MySQL health, and testing the alerts by scaling the MySQL deployment, providing a complete monitoring solution.

AlertingHelmKubernetes
0 likes · 6 min read
Installing and Configuring Prometheus MySQL Exporter on Kubernetes with Alert Rules
Practical DevOps Architecture
Practical DevOps Architecture
Jan 10, 2024 · Operations

Monitoring Domain Expiration with Prometheus, black_exporter, and Grafana

This guide demonstrates how to use Docker, Prometheus, black_exporter, and Grafana to monitor website status codes, response times, and especially certificate expiration dates by configuring exporters, Prometheus scrape jobs, and alerting rules for domain health.

AlertingDockerDomain Expiration
0 likes · 3 min read
Monitoring Domain Expiration with Prometheus, black_exporter, and Grafana
Practical DevOps Architecture
Practical DevOps Architecture
May 26, 2023 · Cloud Native

Comprehensive Cloud Native DevOps Course Curriculum

This document presents a detailed curriculum for a cloud‑native DevOps training program, covering Docker, Kubernetes, Helm, Prometheus, Grafana, Loki, CI/CD pipelines, GitOps, Istio, Containerd, and related operational tools across more than ninety structured lessons.

CI/CDCloud NativeDocker
0 likes · 7 min read
Comprehensive Cloud Native DevOps Course Curriculum
Practical DevOps Architecture
Practical DevOps Architecture
Feb 8, 2022 · Operations

Extending Zabbix Monitoring with Custom Scripts and Handling Stale NFS Handles

This article explains how Zabbix monitoring can be extended with custom shell or Python scripts to gather business-specific metrics, demonstrates a sample script that checks disk usage, and provides three methods to resolve a stale NFS file handle error, including using fuser, process inspection, and forced unmount.

MonitoringOperationsZabbix
0 likes · 3 min read
Extending Zabbix Monitoring with Custom Scripts and Handling Stale NFS Handles
Practical DevOps Architecture
Practical DevOps Architecture
Jan 8, 2022 · Operations

Prometheus Service Discovery: File‑Based, API, and DNS Methods

The article explains how Prometheus uses service discovery to automatically locate monitoring targets in large‑scale environments such as Kubernetes, describing file‑based, API‑query, and DNS‑record approaches, and provides example configuration snippets and validation commands.

KubernetesMonitoringOperations
0 likes · 3 min read
Prometheus Service Discovery: File‑Based, API, and DNS Methods
Practical DevOps Architecture
Practical DevOps Architecture
Jul 18, 2021 · Cloud Native

Deploying Prometheus and Grafana on a Test Kubernetes Cluster

This guide walks through setting up a Prometheus monitoring stack and Grafana visualization on a test Kubernetes cluster, covering pod inspection, directory preparation, YAML deployments for node-exporter, RBAC, ConfigMap, Prometheus and Grafana, and final service exposure.

Cloud NativeDeploymentGrafana
0 likes · 5 min read
Deploying Prometheus and Grafana on a Test Kubernetes Cluster
Practical DevOps Architecture
Practical DevOps Architecture
Jul 5, 2021 · Operations

Introduction to Zabbix and How to Configure Automatic Discovery

This article introduces Zabbix, an open‑source enterprise monitoring system, outlines its key features such as rich visualization, high availability, and diverse data collection methods, and provides a step‑by‑step guide to setting up server‑side automatic discovery and related actions.

Automatic DiscoveryMonitoringOperations
0 likes · 10 min read
Introduction to Zabbix and How to Configure Automatic Discovery