Cloud Native 9 min read

How to Build a Self‑Service Cloud Monitoring Governance System with Alibaba Cloud

This guide explains how Alibaba Cloud's monitoring governance feature helps enterprises assess, improve, and automate their cloud monitoring setup—covering detection categories, practical steps for enabling and reviewing checks, and tips to avoid alert fatigue while ensuring full resource coverage.

Alibaba Cloud Observability
Alibaba Cloud Observability
Alibaba Cloud Observability
How to Build a Self‑Service Cloud Monitoring Governance System with Alibaba Cloud

Overview

In the wave of digital transformation, cloud computing has become essential for enterprise agility and innovation, with Alibaba Cloud playing a key role in efficient cloud management.

As cloud environments grow in complexity and scale, enterprises face challenges in managing and monitoring resources securely, efficiently, and compliantly. Alibaba Cloud Monitoring offers a governance detection function that evaluates a user's monitoring capability, suggests one‑click fixes, and guides the construction of a complete, customized monitoring system.

Detection Item Classification

Governance detection groups 13 items into four categories: Monitoring Coverage, Platform Configuration, Usage Status, and Optimization Suggestions. The article uses two items— Cloud Product Resource Monitoring and Continuous Alarm —as examples.

Cloud Product Resource Monitoring

Full coverage of cloud product resources is fundamental for business continuity. Setting alarm rules for these resources is a basic requirement; any resource without an alarm rule is flagged for governance. The detection covers 17 core cloud products such as ECS, RDS, Redis, SLB, MongoDB, and OSS.

If a resource lacks alarm coverage, the system marks it as a governance target, allowing users to apply a one‑click remediation that automatically creates appropriate alarm rules.

Continuous Alarm

While occasional alarms are normal, prolonged alarm states cause alert fatigue, leading users to ignore critical alerts and increasing the risk of system failures or security incidents. The system flags any alarm rule that remains in an alarm state for more than 24 hours as a governance target, prompting users to resolve the issue or adjust thresholds.

Detection Items Overview

Monitoring Coverage

Cloud Product Resource Monitoring [1]

Cloud Monitoring Plug‑in Installation Coverage [2]

Platform Configuration

Invalid Alarm Rules [3]

Alarm Rules Linked to Expired Resources [4]

Non‑Recommended Metric Rules [5]

Old‑Version System Event Subscription Rules [6]

Usage Status

Callback Failures [7]

Continuous Alarms [8]

Non‑Recommended Cloud Monitoring Plug‑in Versions [9]

Non‑Recommended Cloud Monitoring API Calls [10]

Optimization Suggestions

Regularly Monitor Resource Load [11]

Use Efficient Methods to Retrieve Metric Data [12]

Continuously Monitor Public Service Availability [13]

Enabling Governance Detection

To start using the detection feature, log in to the Cloud Monitoring console, navigate to Overview → Governance Detection , and click Start Detection .

In the left navigation, click Overview .

On the Overview page, select the Governance Detection tab.

Click Start Detection and wait for the scan to complete.

Viewing the Detection Report

After detection finishes, view each item’s results on the report page. Clicking a problematic item reveals details, the objects needing governance, and recommended actions.

Conclusion

The governance detection feature enables users to self‑service the best practices of Cloud Monitoring, building a complete monitoring system that reduces alert fatigue, ensures full resource coverage, and continuously improves operational efficiency.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Alibaba Cloudalert fatiguegovernance detectionresource coverage
Alibaba Cloud Observability
Written by

Alibaba Cloud Observability

Driving continuous progress in observability technology!

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.