Topic

Monitoring

Collection size
1711 articles
Page 77 of 86
Java Architecture Diary
Java Architecture Diary
Oct 20, 2020 · Backend Development

How to Integrate Druid Monitor with Spring Cloud: A Step‑by‑Step Guide

This article explains what Druid Monitor and Druid Admin are, why cluster‑level monitoring is needed in microservice architectures, and provides a complete Spring Cloud starter implementation with configuration examples, code snippets, and usage limitations.

DruidJavaMicroservices
0 likes · 6 min read
How to Integrate Druid Monitor with Spring Cloud: A Step‑by‑Step Guide
Efficient Ops
Efficient Ops
Sep 24, 2024 · Operations

Master Linux Performance in 60 Seconds: 10 Essential Commands

When a Linux server shows performance issues, the first minute is critical; this guide walks you through ten standard command‑line tools—uptime, dmesg, vmstat, mpstat, pidstat, iostat, free, sar, and top—explaining what each metric means and how to interpret the output for quick troubleshooting.

CLILinuxMonitoring
0 likes · 19 min read
Master Linux Performance in 60 Seconds: 10 Essential Commands
Efficient Ops
Efficient Ops
Jul 1, 2024 · Cloud Native

How to Monitor Business Metrics with Prometheus in Kubernetes

This article explains the concept of observability, details Prometheus metric definitions and types, and provides Go code examples for exposing, defining, generating, and scraping business‑level metrics in a Kubernetes‑based cloud‑native environment.

GoKubernetesMonitoring
0 likes · 11 min read
How to Monitor Business Metrics with Prometheus in Kubernetes
Efficient Ops
Efficient Ops
Feb 18, 2024 · Operations

What Does IT Operations Involve and How to Automate It?

This article outlines the core responsibilities of IT operations, examines the current state and management goals, and provides a detailed roadmap for automating tasks such as server provisioning, environment definition, deployment, monitoring, and version release across multiple maturity stages.

DevOpsIT OperationsInfrastructure
0 likes · 11 min read
What Does IT Operations Involve and How to Automate It?
Efficient Ops
Efficient Ops
Feb 19, 2024 · Operations

Mastering Prometheus: Practical Tips for Effective Application Monitoring

This article explains how to design and implement Prometheus metrics for application monitoring, covering the selection of monitoring targets, golden metrics, label conventions, naming rules, histogram bucket choices, and Grafana visualization tricks to help engineers build reliable observability pipelines.

GrafanaMonitoringOperations
0 likes · 10 min read
Mastering Prometheus: Practical Tips for Effective Application Monitoring
Efficient Ops
Efficient Ops
Jan 22, 2024 · Operations

Mastering Monitoring: Black‑Box vs White‑Box, Metrics, and Prometheus in Practice

This guide explains monitoring fundamentals, clears common misconceptions, compares black‑box and white‑box approaches, outlines key metrics such as latency, traffic, errors and saturation, and provides a deep dive into Prometheus architecture, data model, query language, and practical examples for CPU, memory, and disk monitoring.

Cloud NativeMonitoringOperations
0 likes · 15 min read
Mastering Monitoring: Black‑Box vs White‑Box, Metrics, and Prometheus in Practice
Efficient Ops
Efficient Ops
Oct 24, 2023 · Operations

How to Monitor Business Metrics with Prometheus in Kubernetes

This article explains how to use Prometheus to monitor business‑level metrics in a Kubernetes environment, covering observability fundamentals, metric definitions, metric types, exposing metrics via a /metrics endpoint, and practical Go code examples for defining, recording, and scraping custom metrics.

GoKubernetesMonitoring
0 likes · 11 min read
How to Monitor Business Metrics with Prometheus in Kubernetes
Efficient Ops
Efficient Ops
Oct 15, 2023 · Databases

How to Diagnose and Fix Slow Redis Responses: A Step-by-Step Guide

This article walks through practical methods for troubleshooting slow service alerts, diagnosing Redis performance bottlenecks, and reproducing issues with local demos and load simulations, offering concrete metrics, command‑line checks, and mitigation strategies such as scaling, rate‑limiting, and pipeline optimization.

MonitoringOperationsRedis
0 likes · 22 min read
How to Diagnose and Fix Slow Redis Responses: A Step-by-Step Guide
Efficient Ops
Efficient Ops
Sep 12, 2023 · Operations

Understanding Prometheus Metric Types: Counters, Gauges, Histograms & Summaries

This article explains how metrics are used to monitor software performance, introduces basic metric components and dimensional metrics, compares Prometheus, OpenMetrics and OpenTelemetry standards, and provides detailed guidance on Prometheus metric types—Counter, Gauge, Histogram, and Summary—with code examples and query patterns.

Cloud NativeMonitoringPrometheus
0 likes · 18 min read
Understanding Prometheus Metric Types: Counters, Gauges, Histograms & Summaries
Efficient Ops
Efficient Ops
Aug 21, 2023 · Operations

Mastering Application Monitoring with Prometheus: Practical Tips and Best Practices

This guide explains how to design effective Prometheus metrics, choose appropriate monitoring objects, labels, and buckets, and leverage Grafana visualizations to gain deep insight into application performance across online services, offline processing, and batch jobs.

DevOpsGrafanaMonitoring
0 likes · 10 min read
Mastering Application Monitoring with Prometheus: Practical Tips and Best Practices
Efficient Ops
Efficient Ops
Jul 3, 2023 · Operations

Mastering Application Monitoring with Prometheus: Practical Metrics and Best Practices

This article explains how to design effective Prometheus metrics for various application types, covering golden metrics, label selection, naming conventions, bucket choices, and Grafana visualization tips to help engineers build reliable observability solutions.

GrafanaMonitoringPrometheus
0 likes · 9 min read
Mastering Application Monitoring with Prometheus: Practical Metrics and Best Practices
Efficient Ops
Efficient Ops
Jul 5, 2023 · Big Data

How ByteDance Built a Cloud‑Native Big Data Ops Platform for Unified Logging & Alerts

ByteDance’s cloud‑native big data operations platform consolidates logging, monitoring, and alerting across heterogeneous environments, using unified log collection (intrusive and Filebeat), dynamic alert rules, customizable notification plugins, and scalable monitoring pipelines, thereby reducing operational complexity, shielding users from infrastructure differences, and enhancing multi‑tenant efficiency.

AlertingBig DataCloud Native
0 likes · 10 min read
How ByteDance Built a Cloud‑Native Big Data Ops Platform for Unified Logging & Alerts
Efficient Ops
Efficient Ops
Jun 14, 2023 · Artificial Intelligence

How AIOps Transforms IT Operations: From Early Risk Detection to Intelligent Management

This article outlines the background, objectives, and implementation framework of AIOps at a major bank, detailing data consolidation, analysis engines, scenario ecosystems, practical case studies, and future directions for intelligent, proactive IT operations.

AIOpsData PlatformIT Operations
0 likes · 15 min read
How AIOps Transforms IT Operations: From Early Risk Detection to Intelligent Management
Efficient Ops
Efficient Ops
Apr 18, 2023 · Databases

Mastering MongoDB Clusters: Setup, Monitoring, Migration, and Optimization

This comprehensive guide explains MongoDB cluster architecture, component roles, common use cases, monitoring commands, essential maintenance operations, data migration steps, troubleshooting of typical production issues, and practical optimization recommendations for high‑performance deployments.

ClusterMongoDBMonitoring
0 likes · 20 min read
Mastering MongoDB Clusters: Setup, Monitoring, Migration, and Optimization
Efficient Ops
Efficient Ops
Jan 16, 2023 · Operations

How China Mobile’s Centralized AIOps Platform Achieved Top‑Tier Evaluation

This article details China Mobile Information's interview about their centralized AIOps platform, the recent excellent‑level assessment by the China Academy of Information and Communications Technology, the system's key modules, future plans, and the broader significance of AI‑driven IT operations.

AIOpsIT OperationsMonitoring
0 likes · 11 min read
How China Mobile’s Centralized AIOps Platform Achieved Top‑Tier Evaluation
Efficient Ops
Efficient Ops
Dec 18, 2022 · Operations

Mastering Application Monitoring with Prometheus: Practical Tips and Best Practices

This article explains how to design effective Prometheus metrics, choose appropriate vectors, labels, buckets, and naming conventions, and offers Grafana usage tricks to help engineers monitor online services, batch jobs, and offline processing systems with clear, actionable insights.

GrafanaMonitoringOperations
0 likes · 9 min read
Mastering Application Monitoring with Prometheus: Practical Tips and Best Practices
Efficient Ops
Efficient Ops
Nov 29, 2022 · Operations

How to Retrieve and Process Prometheus Metrics via Its API

This article explains how to use the Prometheus HTTP API to query instant and range metrics, interpret the JSON responses, and fetch data programmatically with Python, providing code examples and details on request parameters, error handling, and practical usage.

APIDevOpsMonitoring
0 likes · 8 min read
How to Retrieve and Process Prometheus Metrics via Its API
Efficient Ops
Efficient Ops
Nov 15, 2022 · Operations

Master Linux Performance: Key Metrics, Tools, and Optimization Strategies

This comprehensive guide explains Linux performance optimization by defining key metrics such as throughput and latency, interpreting average load, analyzing CPU context switches, memory management, and I/O behavior, and recommending practical tools and techniques—including vmstat, pidstat, perf, and dstat—to identify and resolve bottlenecks.

CPULinuxMemory
0 likes · 45 min read
Master Linux Performance: Key Metrics, Tools, and Optimization Strategies
Efficient Ops
Efficient Ops
Nov 7, 2022 · Operations

Essential Redis Monitoring Metrics and Commands for Effective Operations

This guide details key Redis monitoring metrics—including performance, memory, activity, persistence, and error indicators—along with practical commands, configuration settings, and code snippets to help operators efficiently track and troubleshoot Redis instances.

LinuxMonitoringOperations
0 likes · 6 min read
Essential Redis Monitoring Metrics and Commands for Effective Operations
Efficient Ops
Efficient Ops
Oct 13, 2022 · Operations

Essential Guide to Effective Monitoring in Operations: Goals, Methods, and Tools

This article outlines the essential components of operational monitoring, covering monitoring objectives, methods, core processes, key tools, metrics for hardware, system, application, network, and business layers, as well as alerting, handling, and best practices for building a comprehensive, reliable monitoring solution.

AlertingMonitoringOperations
0 likes · 7 min read
Essential Guide to Effective Monitoring in Operations: Goals, Methods, and Tools