Cloud Native 21 min read

Limitations and Challenges of Kubernetes in Cluster Management and Application Scenarios

The article examines Kubernetes' widespread adoption, outlines its scalability and multi‑cluster management constraints, discusses practical application scenarios such as deployment models, batch scheduling, and hard multi‑tenancy, and highlights the gaps that still limit its use in large‑scale production environments.

Sohu Tech Products

Jul 14, 2021

Limitations and Challenges of Kubernetes in Cluster Management and Application Scenarios

Kubernetes, released in 2014, has become the de‑facto standard for container orchestration, with most developers and about 75% of production environments now using it.

Despite its popularity, Kubernetes has several limitations; understanding these risks is essential for effective use. The article analyzes these limits from cluster management and application scenario perspectives.

Cluster Management

A cluster is a set of computers that work together as a single resource pool for scheduling containers. The following discusses complex issues Kubernetes faces in managing clusters.

Horizontal Scalability

Cluster size is a key metric for evaluating resource management systems, yet Kubernetes can manage far fewer nodes than other systems. For example, an 8‑CPU, 16‑GB VM on AWS costs about $150 per month (≈¥1,000). A 5,000‑node cluster would cost roughly $8,000,000 per month (≈¥50,000,000), and a 1% improvement in utilization could save ¥500,000 monthly.

Kubernetes officially supports up to 5,000 nodes, 150,000 Pods, 300,000 containers, and 100 Pods per node, which is an order of magnitude smaller than Apache Mesos (tens of thousands of nodes) or Hadoop YARN (50,000 nodes). Even with community optimizations, scaling beyond a few thousand nodes often encounters bottlenecks in etcd, the API server, the scheduler, and controllers.

Large enterprises must limit certain Kubernetes features or add caches to the API server to achieve stable scaling; community‑driven changes are possible but require coordinated effort.

Multi‑Cluster Management

Even a massive single cluster cannot solve all enterprise problems; managing multiple clusters is essential. The SIG Multi‑Cluster group works on solutions, but challenges include resource imbalance, cross‑cluster access difficulty, and higher operational cost.

kubefed

kubefed provides a federated control plane for managing resources and networking across clusters. It creates federated objects such as FederatedDeployment that are translated into regular Deployment objects in each member cluster.

<code style="padding: 16px; color: #333; font-family: Operator Mono, Consolas, Monaco, Menlo, monospace; font-size: 12px"><span style="line-height: 26px">kind:</span> <span style="color: #d14; line-height: 26px">FederatedDeployment</span><br/><span style="color: #d14; line-height: 26px">...</span><br/><span style="line-height: 26px">spec:</span><br/>  <span style="color: #d14; line-height: 26px">...</span><br/>  <span style="line-height: 26px">overrides:</span><br/>  <span style="color: #998; font-style: italic; line-height: 26px"># Apply overrides to cluster1</span><br/>    <span style="color: #990073; line-height: 26px">-</span> <span style="line-height: 26px">clusterName:</span> <span style="color: #d14; line-height: 26px">cluster1</span><br/>      <span style="line-height: 26px">clusterOverrides:</span><br/>        <span style="color: #998; font-style: italic; line-height: 26px"># Set the replicas field to 5</span><br/>        <span style="color: #990073; line-height: 26px">-</span> <span style="line-height: 26px">path:</span> <span style="color: #d14; line-height: 26px">"/spec/replicas"</span><br/>          <span style="line-height: 26px">value:</span> <span style="color: #008080; line-height: 26px">5</span><br/>        <span style="color: #998; font-style: italic; line-height: 26px"># Set the image of the first container</span><br/>        <span style="color: #990073; line-height: 26px">-</span> <span style="line-height: 26px">path:</span> <span style="color: #d14; line-height: 26px">"/spec/template/spec/containers/0/image"</span><br/>          <span style="line-height: 26px">value:</span> <span style="color: #d14; line-height: 26px">"nginx:1.17.0-alpine"</span><br/>        <span style="color: #998; font-style: italic; line-height: 26px"># Ensure the annotation "foo: bar" exists</span><br/>        <span style="color: #990073; line-height: 26px">-</span> <span style="line-height: 26px">path:</span> <span style="color: #d14; line-height: 26px">"/metadata/annotations"</span><br/>          <span style="line-height: 26px">op:</span> <span style="color: #d14; line-height: 26px">"add"</span><br/>          <span style="line-height: 26px">value:</span><br/>            <span style="line-height: 26px">foo:</span> <span style="color: #d14; line-height: 26px">bar</span><br/>        <span style="color: #998; font-style: italic; line-height: 26px"># Ensure an annotation with key "foo" does not exist</span><br/>        <span style="color: #990073; line-height: 26px">-</span> <span style="line-height: 26px">path:</span> <span style="color: #d14; line-height: 26px">"/metadata/annotations/foo"</span><br/>          <span style="line-height: 26px">op:</span> <span style="color: #d14; line-height: 26px">"remove"</span><br/>        <span style="color: #998; font-style: italic; line-height: 26px"># Adds an argument `-q` at index 0 of the args list</span><br/>        <span style="color: #998; font-style: italic; line-height: 26px"># this will obviously shift the existing arguments, if any</span><br/>        <span style="color: #990073; line-height: 26px">-</span> <span style="line-height: 26px">path:</span> <span style="color: #d14; line-height: 26px">"/spec/template/spec/containers/0/args/0"</span><br/>          <span style="line-height: 26px">op:</span> <span style="color: #d14; line-height: 26px">"add"</span><br/>          <span style="line-height: 26px">value:</span> <span style="color: #d14; line-height: 26px">"-q"</span><br/></code>

kubefed also supports more advanced strategies via ReplicaSchedulingPreference to distribute replicas across clusters based on weight and capacity.

<code style="padding: 16px; color: #333; font-family: Operator Mono, Consolas, Monaco, Menlo, monospace; font-size: 12px"><span style="line-height: 26px">apiVersion:</span> <span style="color: #d14; line-height: 26px">scheduling.kubefed.io/v1alpha1</span><br/><span style="line-height: 26px">kind:</span> <span style="color: #d14; line-height: 26px">ReplicaSchedulingPreference</span><br/><span style="line-height: 26px">metadata:</span><br/>  <span style="line-height: 26px">name:</span> <span style="color: #d14; line-height: 26px">test-deployment</span><br/>  <span style="line-height: 26px">namespace:</span> <span style="color: #d14; line-height: 26px">test-ns</span><br/><span style="line-height: 26px">spec:</span><br/>  <span style="line-height: 26px">targetKind:</span> <span style="color: #d14; line-height: 26px">FederatedDeployment</span><br/>  <span style="line-height: 26px">totalReplicas:</span> <span style="color: #008080; line-height: 26px">9</span><br/>  <span style="line-height: 26px">clusters:</span><br/>    <span style="line-height: 26px">A:</span><br/>      <span style="line-height: 26px">minReplicas:</span> <span style="color: #008080; line-height: 26px">4</span><br/>      <span style="line-height: 26px">maxReplicas:</span> <span style="color: #008080; line-height: 26px">6</span><br/>      <span style="line-height: 26px">weight:</span> <span style="color: #008080; line-height: 26px">1</span><br/>    <span style="line-height: 26px">B:</span><br/>      <span style="line-height: 26px">minReplicas:</span> <span style="color: #008080; line-height: 26px">4</span><br/>      <span style="line-height: 26px">maxReplicas:</span> <span style="color: #008080; line-height: 26px">8</span><br/>      <span style="line-height: 26px">weight:</span> <span style="color: #008080; line-height: 26px">2</span><br/></code>

Cluster API (SIG Cluster‑Lifecycle) offers a declarative API for provisioning, updating, and operating multiple clusters, with the key resource Machine representing a node that is created, updated, or deleted by provider‑specific controllers.

Application Scenarios

The following sections discuss interesting Kubernetes application scenarios, including deployment models, batch scheduling, and hard multi‑tenancy, which are current community focus areas and also notable blind spots.

Application Distribution

Kubernetes core provides three basic workload resources: Deployment (stateless services), StatefulSet (stateful services), and DaemonSet (node‑level daemons). While they cover ~90% of cases, more complex workloads rely on CRDs and SIG Apps contributions.

Batch Scheduling

Machine‑learning, batch, and streaming workloads have never been Kubernetes' strong suit; many organizations still use Hadoop YARN for batch processing. The scheduler framework now supports advanced concepts like PodGroup for co‑scheduling, useful for Spark or TensorFlow jobs.

<code style="padding: 16px; color: #333; font-family: Operator Mono, Consolas, Monaco, Menlo, monospace; font-size: 12px"><span style="color: #998; font-style: italic; line-height: 26px"># PodGroup CRD spec</span><br/><span style="line-height: 26px">apiVersion:</span> <span style="color: #d14; line-height: 26px">scheduling.sigs.k8s.io/v1alpha1</span><br/><span style="line-height: 26px">kind:</span> <span style="color: #d14; line-height: 26px">PodGroup</span><br/><span style="line-height: 26px">metadata:</span><br/>  <span style="line-height: 26px">name:</span> <span style="color: #d14; line-height: 26px">nginx</span><br/><span style="line-height: 26px">spec:</span><br/>  <span style="line-height: 26px">scheduleTimeoutSeconds:</span> <span style="color: #008080; line-height: 26px">10</span><br/>  <span style="line-height: 26px">minMember:</span> <span style="color: #008080; line-height: 26px">3</span><br/><span style="color: #999; font-weight: bold; line-height: 26px">---</span><br/><span style="color: #998; font-style: italic; line-height: 26px"># Add a label to mark the pod belongs to a group</span><br/><span style="line-height: 26px">labels:</span><br/>  <span style="line-height: 26px">pod-group.scheduling.sigs.k8s.io:</span> <span style="color: #d14; line-height: 26px">nginx</span><br/></code>

Volcano, a native batch system for Kubernetes, supports frameworks such as TensorFlow, Spark, PyTorch, and MPI, but Kubernetes still lags behind dedicated batch systems like YARN.

Hard Multi‑Tenancy

Hard multi‑tenancy—isolating tenants so they do not affect each other—is still difficult for Kubernetes. Namespaces provide logical separation, but they cannot guarantee resource isolation for CPU, I/O, network, or cache. The community’s multi‑tenancy working group has produced limited results so far.

Kubernetes incurs high overhead for small clusters because a stable control plane requires at least three etcd nodes.

Containers share underlying hosts, leading to potential interference when CPU, memory, I/O, or network resources are not fully isolated.

Conclusion

Every technology has a lifecycle; lower‑level technologies tend to last longer. Kubernetes dominates container orchestration today, but its limitations mean that future tools may eventually replace it. Understanding both strengths and weaknesses helps practitioners use it wisely and stay prepared for the next generation of orchestration platforms.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

cloud-native Kubernetes Multi-Cluster cluster management limitations

Written by

Sohu Tech Products

A knowledge-sharing platform for Sohu's technology products. As a leading Chinese internet brand with media, video, search, and gaming services and over 700 million users, Sohu continuously drives tech innovation and practice. We’ll share practical insights and tech news here.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.