Cloud Native 12 min read

Why OpenKruise v1.0 Is a Game-Changer for Cloud‑Native Workloads

OpenKruise v1.0, the CNCF‑hosted Kubernetes extension, expands beyond basic workloads with enhanced Workload types, in‑place upgrades, high‑availability safeguards, and advanced operational tools, drawing on Alibaba’s massive container experience and now adopted by over 35 global adopters.

Alibaba Cloud Native
Alibaba Cloud Native
Alibaba Cloud Native
Why OpenKruise v1.0 Is a Game-Changer for Cloud‑Native Workloads

Introduction

In December 2021, the CNCF‑incubated open‑source project OpenKruise announced its v1.0 release. Built as an extension suite for Kubernetes, OpenKruise focuses on cloud‑native application automation, covering deployment, release, operations, and availability protection.

Background and Origin

OpenKruise originates from Alibaba’s decades‑long practice of large‑scale container deployment and management. Alibaba migrated its legacy T4 containers to the Alibaba Serverless Infrastructure (ASI), which extends native Kubernetes with standardized APIs for complex enterprise scenarios. In June 2019, the first preview of OpenKruise was released and announced at KubeCon.

Open Source Governance

The Alibaba Cloud team designed an open collaboration model: the core repository lives entirely in the community on GitHub, while an internal fork holds only a few adaptation interfaces. All development, pull‑requests, and reviews happen openly, keeping internal‑external code consistency above 95%.

v1.0 Feature Highlights

1. Enhanced Workload Types

OpenKruise adds several advanced Workload resources such as CloneSet , Advanced StatefulSet , and SidecarSet . These address the 40‑60% coverage gap of native Kubernetes workloads in large‑scale production environments, offering better scaling elasticity and rollout capabilities.

CloneSet vs Deployment scaling and rollout comparison
CloneSet vs Deployment scaling and rollout comparison

2. In‑Place Upgrade Improvements

The v1.0 release dramatically strengthens in‑place upgrades, allowing Pods to retain their identity (IP, volume, node) while updating container images or environment variables. Two main mechanisms are supported:

Image‑based in‑place upgrade: the Kruise controller updates the image field, triggering kubelet to replace only the container image.

Downward‑API‑based upgrade: changes to labels/annotations propagate via kruise‑daemon, which restarts the affected containers without recreating the Pod.

In‑place upgrade workflow diagram
In‑place upgrade workflow diagram

3. High‑Availability Protection

OpenKruise introduces a safeguard against cascading deletions. By labeling CRDs, namespaces, or workloads, Kruise validates delete operations and blocks those that would unintentionally remove active Pods. It also provides an enhanced Pod Disruption Budget called Pod Unavailable Budget (PUB) , protecting against any operation that could make Pods unavailable, including deletions and in‑place upgrades.

Deleting a CRD removes all its CRs; deleting a namespace removes all resources inside; deleting a workload removes its Pods.

4. Operational Extensions

To overcome Kubernetes’ limited runtime control, OpenKruise adds a node‑level component kruise‑daemon. This daemon interprets custom resources and talks to the Container Runtime Interface (CRI), enabling features such as image pre‑warming, container restart, cross‑namespace ConfigMap/Secret distribution, and ordered container startup.

Adoption and Community Impact

More than 35 organizations—including Alibaba, Ant Group, Meituan, Ctrip, NetEase, Xiaomi, OPPO, Suning, Lyft, Bringg, and Shopee—have deployed OpenKruise in production. The project was donated to the CNCF in November 2020 and is slated for CNCF incubation in early 2022.

Future Directions

OpenKruise is exploring a “ControllerMesh” project that proxies operator‑to‑apiserver traffic, enabling multi‑tenant deployment, dynamic isolation, gray‑scale upgrades, fault injection, and client‑side rate limiting without modifying the operators themselves.

Overall, OpenKruise has matured its workload management capabilities and is now expanding into runtime extensions, positioning itself as a comprehensive solution for cloud‑native application automation.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Cloud NativeKubernetesOpenKruiseWorkload ManagementIn‑Place Upgrade
Alibaba Cloud Native
Written by

Alibaba Cloud Native

We publish cloud-native tech news, curate in-depth content, host regular events and live streams, and share Alibaba product and user case studies. Join us to explore and share the cloud-native insights you need.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.