Tag

capacity scaling

1 views collected around this technical thread.

AntTech
AntTech
Mar 12, 2022 · Operations

Evolution of Large‑Scale Distributed System Stability at Ant Group

The article outlines Ant Group's multi‑stage journey of building large‑scale distributed system stability, describing architectural evolutions, risk‑inspection mechanisms, high‑availability solutions such as LDC and fine‑grained traffic scheduling, and intelligent risk‑defense products that together enable resilient, cost‑effective operations.

High AvailabilityOperationscapacity scaling
0 likes · 15 min read
Evolution of Large‑Scale Distributed System Stability at Ant Group
Efficient Ops
Efficient Ops
Jul 4, 2015 · Operations

From Xiaomi to a Trading Exchange: Real‑World Automation Ops Case Studies

This article presents two practical automation operations case studies—Xiaomi's three‑year journey to platform‑managed, self‑scheduling services and a trading exchange's step‑by‑step build from zero automation—highlighting standards, tooling, and cultural challenges for modern ops teams.

DeploymentDevOpsOperations
0 likes · 9 min read
From Xiaomi to a Trading Exchange: Real‑World Automation Ops Case Studies