Cloud Native 9 min read

Graceful Shutdown in Kubernetes and Integration with AWS Load Balancer

This article explains how to perform graceful shutdown of Pods within a Kubernetes cluster, details the internal shutdown workflows, and describes strategies to ensure that AWS Load Balancer updates are synchronized to avoid traffic loss, including using IP mode and sidecar approaches.

DataFunTalk

Jun 9, 2021

Graceful Shutdown in Kubernetes and Integration with AWS Load Balancer

In Q1 2020 SmartNews began promoting Kubernetes usage internally and encountered frequent complaints that during Deployment rolling upgrades the external AWS Load Balancer would lose traffic, prompting the need for graceful termination that first detaches traffic from Pods before deletion.

The article first discusses how to achieve graceful shutdown inside a Kubernetes cluster, referencing the official Kubernetes documentation and showing a diagram of the Pod deletion process.

When a Pod deletion request reaches the API server, two asynchronous workflows are triggered: one to close network connections and another to terminate the application. The network workflow removes the Pod from Endpoints, notifies the API server, and causes kube‑proxy and CoreDNS to update their rules. The application workflow has kubelet observe the event, execute any preStop hook, send a SIGTERM to the process, allow it to exit gracefully, and finally send SIGKILL if the process does not exit in time.

To guarantee graceful termination, the application workflow must run longer than the network workflow; common solutions include configuring a preStop hook that sleeps for a period after receiving SIGTERM so that connection cleanup can finish.

The article notes that if controller‑manager, CoreDNS, or kube‑proxy malfunction, traffic loss can still occur despite a long graceful shutdown period.

Section 02 addresses integration with AWS Load Balancer. Unlike the internal shutdown, external termination requires the AWS Load Balancer Controller to synchronize Ingress state with the AWS Load Balancer, which is subject to API latency and rate limiting, often causing traffic loss.

An architecture diagram of the AWS Load Balancer Controller is presented, followed by a list of its components:

Controller watches Ingress resources and syncs state to the AWS Load Balancer.

Ingress creates an ALB that can be configured via annotations.

Target Groups are created for each Service defined in the Ingress.

Listeners handle ports, also configurable via annotations.

Rules map paths to Services.

Two main solutions are proposed:

Use IP mode instead of instance mode for Ingress target type to reduce the number of AWS API calls caused by node additions or removals.

Deploy a sidecar container that, during the preStop hook, polls target group status and only exits when the target is draining, ensuring the correct order of traffic removal and application shutdown.

The article concludes that graceful shutdown is a universal requirement for Kubernetes platforms, emphasizing the declarative nature of the Kubernetes API and the need for applications to handle SIGTERM properly.

SmartNews mitigates AWS API rate‑limit issues by reducing unnecessary calls and converting asynchronous problems into synchronous ones to preserve execution order.

References: [1] Elastic Load Balancer: https://aws.amazon.com/cn/elasticloadbalancing/ [2] Graceful shutdown and zero‑downtime deployments in Kubernetes: https://learnk8s.io/graceful-shutdown [3] How AWS Load Balancer controller works: https://kubernetes-sigs.github.io/aws-load-balancer-controller/v2.2/how-it-works/

For further discussion, readers are invited to comment on the SmartNews CN public account.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Kubernetes Ingress Graceful Shutdown AWS Load Balancer

Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.