Chaos Mesh: A Cloud‑Native Chaos Engineering Platform for Kubernetes
Chaos Mesh, a CNCF‑hosted cloud‑native chaos engineering platform, orchestrates fault injection experiments in Kubernetes through components like the Chaos Operator and Dashboard, supporting various CRD types such as DNSChaos, PodChaos, and NetworkChaos to simulate failures ranging from pod kills to network partitions.
Chaos Mesh is a CNCF‑hosted cloud‑native chaos engineering platform that enables the orchestration of fault‑injection experiments within Kubernetes clusters. The platform consists of two main components: the open‑source Chaos Operator, which handles the core orchestration logic, and the Chaos Dashboard, a web UI for designing, managing, and monitoring experiments.
The Chaos Operator injects chaos into applications and Kubernetes infrastructure using CustomResourceDefinitions (CRDs). It comprises a controller‑manager for scheduling and managing CRD lifecycles and a privileged daemon (Chaos‑daemon) that runs on each node with access to network, cgroup, and other system resources.
Supported CRD types include DNSChaos, PodChaos, PodIOChaos, PodNetworkChaos, NetworkChaos, IOChaos, TimeChaos, StressChaos, and KernelChaos. These enable a wide range of failure scenarios such as pod‑kill, pod‑failure, container‑kill, network latency/reordering (netem), network partitions, I/O delays or errors, clock skew, CPU and memory stress, and DNS errors.
To get started, users are directed to the official Chaos Mesh documentation for installation and usage instructions.
Numerous organizations have adopted Chaos Mesh for reliability testing, including Authzed (using TimeChaos to fake vDSO time calls), ByteDance, DataStax, DigitalChina, KingNet, NetEase Fuxi Lab, Percona, PingCAP, Prudential, Qiniu Cloud, RabbitMQ, Tencent, Xpeng Motors, and Maycur. These adopters employ the platform to validate service resilience, simulate network issues, test distributed systems, and generate fault‑injection datasets.
Vendors such as Civo, KubeSphere, and Microsoft integrate Chaos Mesh into their Kubernetes marketplaces or cloud services, allowing users to deploy the platform easily and run chaos experiments on managed clusters like Azure AKS.
The article concludes with references to the original source and various community channels for further discussion and resources.
Architects Research Society
A daily treasure trove for architects, expanding your view and depth. We share enterprise, business, application, data, technology, and security architecture, discuss frameworks, planning, governance, standards, and implementation, and explore emerging styles such as microservices, event‑driven, micro‑frontend, big data, data warehousing, IoT, and AI architecture.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.