Tagged articles
307 articles
Page 4 of 4
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 27, 2017 · Operations

How Alibaba Scales DevOps with StarOps: Inside Their Operations Platform

This article explains how Alibaba has evolved its DevOps practice over a decade, detailing the layered architecture of its StarOps suite—including the foundational StarAgent, the Fortress (jump server), the Qingting file‑distribution system, and intelligent AIOps features—showing how automation, scalability, and AI‑driven monitoring enable stable, low‑cost operations for massive workloads such as Double 11.

AutomationInfrastructureaiops
0 likes · 17 min read
How Alibaba Scales DevOps with StarOps: Inside Their Operations Platform
Efficient Ops
Efficient Ops
Sep 10, 2017 · Operations

How We Built a Scalable, High‑Availability Monitoring Platform with Service Trees

This article details the challenges of traditional monitoring systems, the design and implementation of a custom high‑availability monitoring platform using a Golang‑based service tree, Raft‑backed storage, InfluxDB for time‑series data, and a modular architecture that supports Windows agents, third‑party reporting, and AI‑driven future enhancements.

InfluxDBOpsaiops
0 likes · 13 min read
How We Built a Scalable, High‑Availability Monitoring Platform with Service Trees
Efficient Ops
Efficient Ops
Aug 22, 2017 · Artificial Intelligence

How Smart Operations (AIOps) Can Bridge Industry and Academia

At APMCon 2017, Tsinghua professor Pei Dan outlined the research challenges of intelligent operations, emphasizing the need to define clear algorithmic problems, foster open‑source collaboration between industry and academia, and build a problem‑library to accelerate AIOps adoption across enterprises.

Industry-Academia CollaborationIntelligent Operationsaiops
0 likes · 25 min read
How Smart Operations (AIOps) Can Bridge Industry and Academia
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
May 11, 2017 · Operations

How Heka + ELK Supercharges Mobile Game Log Management

Discover how integrating Heka with the ELK stack—Elasticsearch, Logstash, and Kibana—enables mobile game developers to efficiently collect, store, analyze, and visualize logs, offering rapid troubleshooting, automated alerts, and scalable AIOps capabilities that boost operational responsiveness and game performance.

ELKHekaKibana
0 likes · 9 min read
How Heka + ELK Supercharges Mobile Game Log Management