Tag

Alert Convergence

1 views collected around this technical thread.

vivo Internet Technology
vivo Internet Technology
Nov 17, 2021 · Operations

Design and Architecture of a Unified Alert Convergence System for Monitoring

The paper presents a unified alert convergence system that centralizes metric calculation, detection, and alarm handling across monitoring subsystems, employing mechanisms such as convergence, claiming, silencing, escalation, and a Redis‑based delayed queue integrated via Kafka or REST to reduce alarm fatigue, improve MTTA/MTTR, and enable future AI‑driven AIOps.

Alert ConvergenceMTTAMTTR
0 likes · 18 min read
Design and Architecture of a Unified Alert Convergence System for Monitoring
Qunar Tech Salon
Qunar Tech Salon
Feb 20, 2020 · Operations

Design and Implementation of Business‑Driven Monitoring Systems at JD Cloud

This article explains why monitoring is essential for operations, outlines the four‑layer monitoring standard (infrastructure, liveliness, performance, business), breaks down functional modules and data flows, and showcases JD Cloud's practical design, alarm‑convergence project, and future AI‑driven observability directions.

Alert ConvergenceData ProcessingJD Cloud
0 likes · 12 min read
Design and Implementation of Business‑Driven Monitoring Systems at JD Cloud
360 Tech Engineering
360 Tech Engineering
Jul 12, 2019 · Operations

StackStorm‑Based Monitoring Alert Auto‑Remediation Solution

This article introduces a StackStorm‑driven monitoring and alert auto‑remediation architecture that converges alarms, performs root‑cause analysis, and executes self‑healing actions, detailing its components, workflow, configuration examples, and real‑world deployment outcomes.

Alert ConvergenceAuto‑RemediationStackStorm
0 likes · 7 min read
StackStorm‑Based Monitoring Alert Auto‑Remediation Solution
Efficient Ops
Efficient Ops
Jun 13, 2018 · Operations

Designing an Effective CMDB: Boost Ops Efficiency, Alert Convergence & Self‑Healing

This article explains how a well‑designed CMDB abstracts and models operational objects, categorizes business, hardware, application and custom data, and enables alert convergence and automated fault‑healing, dramatically improving DevOps efficiency and reliability.

Alert ConvergenceCMDBDevOps
0 likes · 7 min read
Designing an Effective CMDB: Boost Ops Efficiency, Alert Convergence & Self‑Healing