Tagged articles
7 articles
Page 1 of 1
转转QA
转转QA
Oct 31, 2025 · Operations

Boosting Service Quality with Intelligent Inspection, Notification, and Automation Engines

This article outlines the design and value of an automated service quality monitoring platform, detailing its core benefits—intelligent detection, automated execution, data‑driven decisions, and precise notifications—along with functional architecture, key modules, code examples, technical requirements, and practical recommendations.

AI MonitoringBackendOperations
0 likes · 10 min read
Boosting Service Quality with Intelligent Inspection, Notification, and Automation Engines
Tencent Advertising Technology
Tencent Advertising Technology
Oct 30, 2025 · Operations

How Tencent Ads Achieved Full‑Automation CI/CD with AI‑Driven Monitoring

This article details Tencent Ads' systematic transformation of its CI/CD pipeline into a fully automated, data‑driven process, highlighting the challenges of manual labor, release quality, and iteration speed, and describing the AI‑powered monitoring and intelligent interception mechanisms that enable zero‑human deployment and multi‑release per day.

AI MonitoringDevOpsTencent Ads
0 likes · 31 min read
How Tencent Ads Achieved Full‑Automation CI/CD with AI‑Driven Monitoring
Efficient Ops
Efficient Ops
May 20, 2025 · Operations

How AI Is Transforming Enterprise Monitoring and Automated Operations

This article outlines a comprehensive AI‑driven framework for upgrading intelligent monitoring, automating operations, enhancing log analysis, optimizing cloud costs, strengthening security, and improving disaster recovery, showcasing practical techniques such as unified data platforms, dynamic baselines, smart ticket routing, and self‑healing infrastructure.

AI Monitoringcloud optimizationsecurity
0 likes · 9 min read
How AI Is Transforming Enterprise Monitoring and Automated Operations
Volcano Engine Developer Services
Volcano Engine Developer Services
Apr 1, 2025 · Artificial Intelligence

Taming High Cardinality in AI Model & Autonomous Driving Monitoring with Prometheus

This article explores how high cardinality in Prometheus metrics impacts AI large‑model and autonomous‑driving observability, explains the underlying concepts, outlines the performance and cost challenges, and presents practical design, collection, and query‑side solutions—including metric modeling, pre‑aggregation, and remote‑read pushdown—to keep monitoring efficient and scalable.

AI MonitoringCardinalityPrometheus
0 likes · 12 min read
Taming High Cardinality in AI Model & Autonomous Driving Monitoring with Prometheus
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 13, 2025 · Artificial Intelligence

How to Master LLM Observability: End-to-End Monitoring with Alibaba Cloud

This article outlines Alibaba Cloud’s comprehensive LLM observability solution, covering challenges, key metrics, component architecture, data collection, tracing, performance analysis, and practical integration steps—including Python agent setup and Dify demo—to help developers monitor and optimize large language model applications.

AI MonitoringCloud NativeLLM Observability
0 likes · 19 min read
How to Master LLM Observability: End-to-End Monitoring with Alibaba Cloud
360 Tech Engineering
360 Tech Engineering
Oct 31, 2019 · Operations

AIOps Implementation Practice at 360: Architecture, Models, and Automation

The article details 360's AIOps deployment, covering external speaker insights, internal architecture, data collection pipelines, AI models for resource recycling, alarm reduction, and correlation, as well as visualization dashboards, labeling platforms, and self‑healing mechanisms, illustrating a comprehensive AI‑driven operations framework.

AI MonitoringOperations Automationaiops
0 likes · 14 min read
AIOps Implementation Practice at 360: Architecture, Models, and Automation
Efficient Ops
Efficient Ops
Apr 19, 2018 · Operations

How Alibaba Prevents Release Failures in Billion‑Dollar Transactions

Alibaba’s experts share how they boost release speed and stability for trillion‑dollar transactions by combining P2P file distribution, automated monitoring, AI‑driven anomaly detection, and an unattended release system that automatically pauses risky deployments, reducing faults while handling massive e‑commerce workloads.

AI MonitoringDeploymentOperations
0 likes · 25 min read
How Alibaba Prevents Release Failures in Billion‑Dollar Transactions