Tagged articles
4047 articles
Page 20 of 41
Qunar Tech Salon
Qunar Tech Salon
Nov 29, 2022 · Cloud Native

Qunar’s Experience Replacing Prometheus with VictoriaMetrics for Cloud‑Native Container Monitoring

This article details Qunar’s migration from a traditional Prometheus‑based monitoring stack to VictoriaMetrics, describing the challenges of large‑scale container metrics collection, the architectural redesign using VM‑Cluster, vmagent, and vmalert, and the performance improvements achieved after full replacement.

KubernetesPrometheusTime Series Database
0 likes · 14 min read
Qunar’s Experience Replacing Prometheus with VictoriaMetrics for Cloud‑Native Container Monitoring
Alibaba Cloud Native
Alibaba Cloud Native
Nov 27, 2022 · Cloud Native

How HeLian Health Scaled Its Low‑Fault Medical Platform with Cloud‑Native Microservices

HeLian Health’s journey from a fast‑iteration monolith to a stable, cloud‑native microservice architecture illustrates how modular design, Dubbo‑Nacos integration, Kubernetes, and Alibaba Cloud MSE can overcome low‑fault tolerance challenges in the medical check‑up domain while boosting release efficiency and reducing operational costs.

Cloud NativeKubernetesMSE
0 likes · 10 min read
How HeLian Health Scaled Its Low‑Fault Medical Platform with Cloud‑Native Microservices
ITPUB
ITPUB
Nov 25, 2022 · Big Data

How Berserker’s Big Data Platform Solved Scheduling, State and Scaling Challenges

This article details the architecture, evolution, and technical solutions of the Berserker big‑data platform—including component design, state‑management problems, release strategies, two‑phase commit, RPC handling, routing, message queuing, containerized execution, dependency model redesign, and future roadmap—demonstrating how the system achieved high availability, low latency, and scalable operations.

Data PlatformDockerKubernetes
0 likes · 19 min read
How Berserker’s Big Data Platform Solved Scheduling, State and Scaling Challenges
Tencent Cloud Developer
Tencent Cloud Developer
Nov 24, 2022 · Cloud Native

Large‑Scale Cost Optimization for Kubernetes/TKE: Data Collection, Measures, and Implementation

The article details a Tencent‑led, end‑to‑end cost‑optimization project for large‑scale Kubernetes/TKE clusters that collected extensive workload metrics, applied VPA/HPA enhancements, custom scheduling and node‑downscaling via the open‑source Crane platform, ultimately delivering up to 70% CPU and 50% memory savings with zero‑fault deployments.

HPAKubernetesResource Management
0 likes · 29 min read
Large‑Scale Cost Optimization for Kubernetes/TKE: Data Collection, Measures, and Implementation
dbaplus Community
dbaplus Community
Nov 23, 2022 · Operations

Choosing the Right Kubernetes Monitoring Stack: Tools & Best Practices

Monitoring Kubernetes clusters is essential for visibility and scalability, but selecting the right tools can be complex; this article outlines best‑practice approaches and compares popular open‑source solutions such as Prometheus, Grafana, Thanos, Elasticsearch, Logstash, and Kibana, helping you build an effective monitoring stack.

GrafanaKubernetesPrometheus
0 likes · 8 min read
Choosing the Right Kubernetes Monitoring Stack: Tools & Best Practices
MaGe Linux Operations
MaGe Linux Operations
Nov 22, 2022 · Operations

Why Kubernetes Pods Fail with “Resource temporarily unavailable” – Understanding PID Limits

This article analyzes a Kubernetes‑Docker environment where Java pods encounter “fork: Resource temporarily unavailable” errors, tracing the issue through kernel event logs, ulimit settings, PID limits, and related sysctl parameters, and provides detailed recommendations for kernel and user‑level configuration to prevent such failures.

DockerKubernetesLinux kernel
0 likes · 23 min read
Why Kubernetes Pods Fail with “Resource temporarily unavailable” – Understanding PID Limits
Alibaba Cloud Native
Alibaba Cloud Native
Nov 21, 2022 · Cloud Computing

What’s New in Open Cluster Management v0.9.0 and Future Roadmap?

The Open Cluster Management (OCM) project releases v0.9.0 with enhanced security, service exposure, plugin configurability, API updates, and outlines upcoming features like independent control‑plane deployment, ArgoCD pull‑mode integration, placement strategies, and edge‑scenario support.

ArgoCDCloud NativeCluster‑Proxy
0 likes · 10 min read
What’s New in Open Cluster Management v0.9.0 and Future Roadmap?
Tencent Cloud Developer
Tencent Cloud Developer
Nov 21, 2022 · Operations

DSP Algorithm Principles and Application in Crane Predictive Autoscaling

The article details how Tencent Cloud’s Crane predictive autoscaling leverages a digital signal processing pipeline—transforming Prometheus time‑series data via Fourier analysis, using DFT/FFT to identify periodic patterns, predict load with IFFT, and configure margins, thresholds, and spectrum parameters to overcome HPA’s reactive limits.

Cloud Native AutoscalingCraneFast Fourier Transform
0 likes · 16 min read
DSP Algorithm Principles and Application in Crane Predictive Autoscaling
Top Architect
Top Architect
Nov 21, 2022 · Cloud Native

Understanding Underlay and Overlay Network Models in Kubernetes

This article provides a comprehensive overview of Kubernetes networking by explaining the concepts of underlay and overlay network models, describing their implementations such as flannel host‑gw, Calico BGP, IPVLAN/MACVLAN, and tunneling technologies like VxLAN and IPIP, and offering practical references for each approach.

CNICalicoFlannel
0 likes · 14 min read
Understanding Underlay and Overlay Network Models in Kubernetes
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Nov 21, 2022 · Cloud Native

Zero‑Touch Kubernetes Deployment: A Minimalist, Automated Solution

This article analyzes the challenges of traditional Kubernetes cluster installation, proposes a fully automated, zero‑touch deployment framework that abstracts and packages all required resources, and explores its architecture, implementation using Go, cloud‑edge collaboration with KubeEdge, and future extensions toward CRD‑based operators.

CRDCloud NativeDeployment
0 likes · 22 min read
Zero‑Touch Kubernetes Deployment: A Minimalist, Automated Solution
Cloud Native Technology Community
Cloud Native Technology Community
Nov 21, 2022 · Cloud Native

How to Replace Istio Ingress Gateway with Kubernetes Gateway API – Step‑by‑Step Guide

This tutorial explains the background of the Kubernetes Gateway API, compares it with the traditional Ingress API, and provides a hands‑on walkthrough for installing the Gateway API CRDs, configuring Istio to use a Gateway resource, exposing a sample httpbin service, updating routes, and cleaning up the environment.

Cloud NativeGateway APIIngress
0 likes · 12 min read
How to Replace Istio Ingress Gateway with Kubernetes Gateway API – Step‑by‑Step Guide
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Nov 18, 2022 · Cloud Native

Top 10 Common Kubernetes Issues and How to Fix Them

This guide walks through ten frequent Kubernetes problems—from service access failures and port‑mapping errors to image pull issues and CrashLoopBackOff—explaining root causes, showing exact error messages, and providing concise command‑line solutions to restore cluster health.

DevOpsKubernetescloud-native
0 likes · 6 min read
Top 10 Common Kubernetes Issues and How to Fix Them
Open Source Linux
Open Source Linux
Nov 18, 2022 · Cloud Native

Understanding Kubernetes Underlay and Overlay Network Models: A Deep Dive

This article provides a comprehensive overview of Kubernetes networking, explaining the concepts of underlay and overlay network models, their implementations with Flannel, Calico, IPVLAN, MACVLAN, Multus, DANM, and common tunneling protocols such as VxLAN, IPIP, and GRE, while highlighting practical configuration notes and architectural differences.

CNICalicoFlannel
0 likes · 14 min read
Understanding Kubernetes Underlay and Overlay Network Models: A Deep Dive
360 Smart Cloud
360 Smart Cloud
Nov 17, 2022 · Databases

Exploring StarRocks Applications, Performance Tests, and Cloud‑Native Integration at 360

This article reviews the practical applications and experimental explorations of StarRocks at 360, describing the cloud‑native lake‑warehouse product Yunzhou, its three‑tier architecture, performance comparisons with Trino using TPCH 100 GB, challenges of Kubernetes integration, and future directions for storage‑compute separation.

Big DataCloud NativeKubernetes
0 likes · 7 min read
Exploring StarRocks Applications, Performance Tests, and Cloud‑Native Integration at 360
Tencent Cloud Developer
Tencent Cloud Developer
Nov 16, 2022 · Cloud Native

Prometheus Monitoring Practices for Tencent Happy Dou Dizhu Game

Tencent transformed its popular Happy Dou Dizhu game’s monitoring by migrating to Tencent Cloud Managed Prometheus and Grafana, unifying metric naming, consolidating ServiceMonitors, defining dashboards as code, and avoiding high‑cardinality labels, which cut labor costs by over 30% and greatly improved operational efficiency.

GrafanaKubernetesPrometheus
0 likes · 11 min read
Prometheus Monitoring Practices for Tencent Happy Dou Dizhu Game
Cloud Native Technology Community
Cloud Native Technology Community
Nov 16, 2022 · Cloud Native

How Do Packets Travel Inside and Outside Kubernetes? A Deep Dive into Pods, Network Namespaces, and CNI

This article explains how Kubernetes forwards packets from the initial web request through container networking, covering the network model, pod creation steps, the role of the pause container, same‑node and cross‑node pod‑to‑pod traffic, service IP translation, and the underlying CNI, iptables, and conntrack mechanisms.

CNIKubernetesNetwork Namespace
0 likes · 28 min read
How Do Packets Travel Inside and Outside Kubernetes? A Deep Dive into Pods, Network Namespaces, and CNI
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Nov 16, 2022 · Cloud Native

SREWorks v1.3 Unveiled: Plugin Architecture, UI Enhancements & Cloud‑Native Ops

Version 1.3 of SREWorks introduces a modular plugin system with independent plugin packages, visual configuration UI, standardized component definition workflow, upgraded application mechanisms separating enterprise and ops apps, offline app packages, remote UMD component loading, new front‑end widgets, and a Grafana‑based streaming job monitoring dashboard.

GrafanaKubernetesSREWorks
0 likes · 12 min read
SREWorks v1.3 Unveiled: Plugin Architecture, UI Enhancements & Cloud‑Native Ops
DevOps Cloud Academy
DevOps Cloud Academy
Nov 13, 2022 · Cloud Native

Grafana Phlare: Open‑Source Continuous Profiling Database – Architecture, Features, and Kubernetes Deployment Guide

Grafana Phlare is an open‑source, horizontally scalable continuous profiling database that integrates with Grafana, offering easy installation, multi‑tenant support, and object‑storage‑backed long‑term storage, with detailed deployment instructions for both monolithic and micro‑service modes on Kubernetes using Helm.

Continuous ProfilingGrafanaKubernetes
0 likes · 11 min read
Grafana Phlare: Open‑Source Continuous Profiling Database – Architecture, Features, and Kubernetes Deployment Guide
MaGe Linux Operations
MaGe Linux Operations
Nov 13, 2022 · Cloud Native

Mastering Kubernetes CNI: Choose the Right Network Plugin for Your Cloud‑Native Apps

This article introduces common container network scenarios and explains how various Kubernetes CNI plugins—Kube‑OVN, Antrea, Cilium, Calico, Flannel, Submariner, and others—implement these functions, guiding users on selecting and deploying the appropriate plugin with tools like Kubekey for multi‑cluster and policy needs.

CNICloudNativeKubekey
0 likes · 20 min read
Mastering Kubernetes CNI: Choose the Right Network Plugin for Your Cloud‑Native Apps
Tencent Cloud Middleware
Tencent Cloud Middleware
Nov 10, 2022 · Cloud Native

How We Scaled Apache Pulsar on Kubernetes for WeChat’s Billion‑User Real‑Time Recommendations

This article details the WeChat engineering team’s practical experience deploying and optimizing Apache Pulsar on Kubernetes for massive real‑time recommendation workloads, covering cloud‑native advantages, non‑persistent topics, load‑balancing tweaks, broker cache improvements, COS offloader development, and future roadmap.

Apache PulsarCloud NativeKubernetes
0 likes · 13 min read
How We Scaled Apache Pulsar on Kubernetes for WeChat’s Billion‑User Real‑Time Recommendations
Alibaba Cloud Native
Alibaba Cloud Native
Nov 9, 2022 · Cloud Native

13 Common Kubernetes Pod Failures and How to Diagnose Them

This article outlines the Kubernetes pod lifecycle, describes the five pod phases, enumerates 13 typical failure scenarios—including scheduling, image pull, dependency, init container, probe, and OOM issues—provides error states, root causes, and step‑by‑step kubectl commands for diagnosis and remediation.

Cloud NativeKubernetesOperations
0 likes · 22 min read
13 Common Kubernetes Pod Failures and How to Diagnose Them
Cloud Native Technology Community
Cloud Native Technology Community
Nov 9, 2022 · Cloud Native

How Sidecars Are Revolutionizing Load Testing in Kubernetes

The article explains why traditional load testing struggles with modern, scalable infrastructures and shows how integrating sidecar containers into Kubernetes can simplify traffic capture, enrich replay data, reduce operational overhead, and enable more realistic, automated load‑testing pipelines.

Cloud NativeDevOpsKubernetes
0 likes · 8 min read
How Sidecars Are Revolutionizing Load Testing in Kubernetes
Ops Development Stories
Ops Development Stories
Nov 9, 2022 · Operations

How Grafana Phlare Enables Scalable Continuous Profiling for Cloud‑Native Environments

Grafana Phlare is an open‑source, horizontally scalable continuous profiling database that integrates with Grafana, offering easy installation, multi‑tenant support, cheap object‑storage persistence, and both monolithic and microservice deployment modes, with detailed Helm‑based Kubernetes setup and usage instructions.

Continuous ProfilingKubernetescloud-native
0 likes · 14 min read
How Grafana Phlare Enables Scalable Continuous Profiling for Cloud‑Native Environments
Open Source Linux
Open Source Linux
Nov 7, 2022 · Cloud Native

Unlock Scalable Cloud‑Native Alerting with Grafana Mimir: Architecture & Setup

This article explains the current state of cloud‑native alerting, introduces Grafana Mimir as a horizontally scalable, multi‑tenant storage for Prometheus, details its architecture and components, and provides step‑by‑step guidance for installing, configuring, and operating Mimir in Kubernetes environments.

AlertingCloud NativeKubernetes
0 likes · 24 min read
Unlock Scalable Cloud‑Native Alerting with Grafana Mimir: Architecture & Setup
Cloud Native Technology Community
Cloud Native Technology Community
Nov 7, 2022 · Cloud Computing

How Edge Computing Is Transforming Automotive Manufacturing

This article explores how edge computing, combined with cloud-native technologies, 5G, and GPU acceleration, enables real‑time data processing, intelligent inspection, digital twins, and autonomous driving in the automotive industry, outlining practical architectures, hardware choices, and deployment patterns.

5GCloud NativeEdge Computing
0 likes · 19 min read
How Edge Computing Is Transforming Automotive Manufacturing
Programmer DD
Programmer DD
Nov 7, 2022 · Cloud Native

Kubernetes Multi‑Tenant Isolation: Strategies, Tools, and How to Choose

This article explains how Kubernetes can be configured for multi‑tenant isolation using control‑plane mechanisms like namespaces, RBAC, and quotas, data‑plane techniques such as container runtimes, storage policies, and network policies, and compares namespace‑based tenancy with virtual control‑plane and multi‑cluster approaches.

Cloud NativeIsolationKubernetes
0 likes · 11 min read
Kubernetes Multi‑Tenant Isolation: Strategies, Tools, and How to Choose
Top Architect
Top Architect
Nov 7, 2022 · Cloud Native

Step‑by‑Step Deployment of a Highly Available Kubernetes Cluster with Nginx/Keepalived Load Balancer, Flannel CNI, IPVS, Dashboard, and Harbor Registry

This comprehensive guide walks you through installing Docker and containerd, configuring yum repositories, setting up kubeadm/kubelet/kubectl, initializing a multi‑master Kubernetes cluster, enabling Flannel CNI and IPVS, building a Nginx‑Keepalived HA load balancer, deploying the Kubernetes dashboard, configuring NFS storage with a dynamic provisioner, and installing a secure Harbor image registry for private images.

FlannelHarborKubernetes
0 likes · 44 min read
Step‑by‑Step Deployment of a Highly Available Kubernetes Cluster with Nginx/Keepalived Load Balancer, Flannel CNI, IPVS, Dashboard, and Harbor Registry
MaGe Linux Operations
MaGe Linux Operations
Nov 6, 2022 · Cloud Native

How to Safely Shut Down and Restart a Kubernetes Cluster

This guide walks you through the essential steps, commands, and precautions for safely draining nodes, backing up applications, CRDs, and etcd, then shutting down and later restarting a Kubernetes cluster while avoiding common pitfalls.

BackupCluster MaintenanceKubernetes
0 likes · 6 min read
How to Safely Shut Down and Restart a Kubernetes Cluster
Alibaba Cloud Native
Alibaba Cloud Native
Nov 4, 2022 · Cloud Native

How Koordinator Revolutionizes Cloud‑Native Mixed‑Workload Scheduling

Koordinator, an open‑source cloud‑native scheduler launched in April 2022, unifies heterogeneous workloads on Kubernetes through zero‑intrusion plugins, fine‑grained resource oversubscription, QoS‑aware scheduling, and a flexible descheduler framework, dramatically improving resource utilization and latency‑sensitive service performance.

Cloud NativeKubernetesMixed Workload
0 likes · 22 min read
How Koordinator Revolutionizes Cloud‑Native Mixed‑Workload Scheduling
Top Architect
Top Architect
Nov 4, 2022 · Cloud Native

Step-by-Step Guide to Deploy a High‑Availability Kubernetes Cluster with Dashboard, Nginx/Keepalived, NFS, Harbor, and Ingress

This comprehensive tutorial walks through preparing hosts, installing Docker and containerd, setting up Kubernetes components, initializing a HA master cluster, configuring networking, deploying the Kubernetes dashboard, NFS storage, Harbor registry, and an Nginx/Keepalived load balancer, all with detailed commands and configuration files.

Cloud NativeDockerHarbor
0 likes · 41 min read
Step-by-Step Guide to Deploy a High‑Availability Kubernetes Cluster with Dashboard, Nginx/Keepalived, NFS, Harbor, and Ingress
Architecture Digest
Architecture Digest
Nov 4, 2022 · Cloud Native

Containerization Journey at Ximalaya: Practices, Tools, and Lessons Learned

This article recounts Ximalaya’s evolution from early Docker adoption to a mature cloud‑native deployment platform, detailing principles, custom tools such as barge and k8s‑sync, health‑check strategies, multi‑process management, and integration with existing middleware to achieve reliable, zero‑downtime service releases.

ContainerizationDeploymentDevOps
0 likes · 18 min read
Containerization Journey at Ximalaya: Practices, Tools, and Lessons Learned
Open Source Linux
Open Source Linux
Nov 3, 2022 · Cloud Native

Why kt-connect Beats Telepresence for Kubernetes Local Debugging

This article traces the evolution of local debugging from simple Tomcat setups to Kubernetes, compares proxy tools like port‑forwarding, VPN, Telepresence and kt‑connect, and explains kt‑connect's four operation modes with practical command examples for developers.

DockerKubernetesSpring Cloud
0 likes · 13 min read
Why kt-connect Beats Telepresence for Kubernetes Local Debugging
Architect
Architect
Nov 3, 2022 · Cloud Native

Step-by-Step Guide to Deploying a High‑Availability Kubernetes Cluster with Dashboard and Harbor

This comprehensive tutorial walks through preparing multiple Linux nodes, installing Docker and containerd, setting up kubeadm, kubelet and kubectl, initializing a HA Kubernetes control plane, configuring Flannel networking, deploying the Kubernetes dashboard, installing Nginx + Keepalived for load balancing, setting up NFS with rsync, provisioning storage via Helm, and finally installing a secure Harbor image registry, all with detailed commands and configuration snippets.

DashboardHAHarbor
0 likes · 39 min read
Step-by-Step Guide to Deploying a High‑Availability Kubernetes Cluster with Dashboard and Harbor
DataFunSummit
DataFunSummit
Nov 3, 2022 · Artificial Intelligence

Applying NVIDIA MPS to Boost GPU Utilization for Recommendation Inference

This article explains why traditional CPU inference and naïve GPU usage are inefficient for recommendation workloads, introduces NVIDIA Multi‑Process Service (MPS) technology, describes VIVO's custom Rust‑based inference engine and deployment strategies, and presents performance and cost benefits along with practical deployment considerations.

GPU inferenceKubernetesMPS
0 likes · 13 min read
Applying NVIDIA MPS to Boost GPU Utilization for Recommendation Inference
Efficient Ops
Efficient Ops
Nov 2, 2022 · Cloud Native

Why Your HPA Isn’t Scaling: 3 Common Misconceptions and How to Fix Them

This article explains three frequent misunderstandings about Kubernetes Horizontal Pod Autoscaler—dead zones, misuse of utilization calculations, and perceived lag in scaling—while detailing HPA’s inner workings, metric sources, calculation methods, and behavior configuration to help you avoid scaling pitfalls.

HPAKubernetesautoscaling
0 likes · 12 min read
Why Your HPA Isn’t Scaling: 3 Common Misconceptions and How to Fix Them
DevOps Cloud Academy
DevOps Cloud Academy
Nov 2, 2022 · Cloud Native

An Overview of CI/CD Pipelines with Kubernetes

This article examines CI/CD approaches within a Kubernetes ecosystem, detailing pull- and push-based pipeline architectures, their advantages and disadvantages, key considerations such as automated testing, validation and rollbacks, and reviews popular open-source tools like Spinnaker, Jenkins X and Argo CD.

Cloud NativeDevOpsKubernetes
0 likes · 19 min read
An Overview of CI/CD Pipelines with Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
Nov 1, 2022 · Cloud Native

How BizWorks Harnesses KubeVela for Cloud‑Native Continuous Delivery

This article details how BizWorks integrates KubeVela to build a cloud‑native PaaS platform that streamlines application deployment, resolves fragmented configuration challenges, and delivers continuous integration and delivery capabilities across multi‑cloud environments.

BizWorksCloud NativeContinuous Delivery
0 likes · 12 min read
How BizWorks Harnesses KubeVela for Cloud‑Native Continuous Delivery
AI Cyberspace
AI Cyberspace
Nov 1, 2022 · Cloud Native

10 Essential Cloud‑Native Tools Every Agile Team Should Use

This article outlines ten indispensable cloud‑native tools—from Docker and Kubernetes to Serverless, Helm, Ansible, and Wireshark—explaining how each supports modern agile development, improves infrastructure stability, and accelerates digital transformation in post‑pandemic enterprises.

Cloud NativeDevOpsDocker
0 likes · 16 min read
10 Essential Cloud‑Native Tools Every Agile Team Should Use
Cloud Native Technology Community
Cloud Native Technology Community
Nov 1, 2022 · Cloud Native

Essential kubectl Commands and Concepts for Managing Kubernetes Clusters

This article introduces the role of the Kubernetes API server, explains kubectl syntax and concepts, and provides detailed explanations of eight essential kubectl commands—including listing resources, describing objects, creating and modifying resources, deleting objects, handling PersistentVolumes, configuring security contexts, managing deployments, and executing commands inside pods—helping readers master everyday Kubernetes operations.

CLICloud NativeKubernetes
0 likes · 11 min read
Essential kubectl Commands and Concepts for Managing Kubernetes Clusters
Architect
Architect
Oct 31, 2022 · Cloud Native

Building a Cloud‑Native Container Platform at Ximalaya: Practices, Principles, and Evolution

This article recounts Ximalaya's journey from a simple Docker‑based project template to a full‑featured Kubernetes‑driven container platform, describing the principles, tools, health‑check strategies, deployment patterns, middleware integration, and lessons learned for building reliable cloud‑native services.

Cloud NativeContainerizationDeployment
0 likes · 13 min read
Building a Cloud‑Native Container Platform at Ximalaya: Practices, Principles, and Evolution
DevOps Cloud Academy
DevOps Cloud Academy
Oct 31, 2022 · Operations

Rolling Deployment Strategy: Advantages, Disadvantages, and Considerations

The rolling deployment strategy incrementally replaces old application instances with new ones, allowing users to encounter both versions during rollout, and is praised for ease of implementation, low risk, and default support in platforms like Kubernetes, though it can be slow, costly for large infrastructures, and may affect user experience.

Deployment StrategyKubernetesOperations
0 likes · 2 min read
Rolling Deployment Strategy: Advantages, Disadvantages, and Considerations
Open Source Linux
Open Source Linux
Oct 30, 2022 · Operations

Unlock Kubernetes Insights: Master Event Types, Monitoring, and Alerting

This guide explains what Kubernetes events are, how to list and filter them, categorizes common event types, and shows practical ways to collect, store, and alert on events using native commands and open‑source tools, helping teams reduce alert fatigue and improve cluster observability.

AlertingEventsKubernetes
0 likes · 11 min read
Unlock Kubernetes Insights: Master Event Types, Monitoring, and Alerting
DataFunSummit
DataFunSummit
Oct 30, 2022 · Big Data

Integrating Apache Spark with Cloud‑Native Technologies: Principles, Kubernetes Deployments, EMR on ACK, and Serverless Spark on DLF

This article examines the challenges of traditional Spark clusters and explains how integrating Spark with cloud‑native platforms—through Kubernetes deployment modes, EMR on ACK practices, Remote Shuffle Service, and serverless Spark on DLF—provides elastic scaling, lower operational costs, and advanced features such as executor rolling and custom scheduler support.

Big DataDLFKubernetes
0 likes · 18 min read
Integrating Apache Spark with Cloud‑Native Technologies: Principles, Kubernetes Deployments, EMR on ACK, and Serverless Spark on DLF
Cloud Native Technology Community
Cloud Native Technology Community
Oct 27, 2022 · Cloud Native

Boost Kubernetes Pod‑to‑Pod Performance with istio‑tcpip‑bypass and eBPF

This article explains how the Intel istio‑tcpip‑bypass plugin leverages eBPF to bypass the host TCP/IP stack for intra‑node pod communication, provides environment setup, step‑by‑step deployment commands, and shows benchmark results that demonstrate 40‑80% latency reduction and throughput improvement.

Kube-OVNKubernetesPerformance Testing
0 likes · 6 min read
Boost Kubernetes Pod‑to‑Pod Performance with istio‑tcpip‑bypass and eBPF
Java Architect Essentials
Java Architect Essentials
Oct 25, 2022 · Cloud Native

Comprehensive Guide to Deploying a Highly Available Kubernetes Cluster with Nginx+Keepalived, Ingress, NFS, and Harbor

This step‑by‑step tutorial walks through setting up a multi‑node Kubernetes cluster, configuring high availability with Nginx and Keepalived, installing Docker and containerd, deploying an NFS provisioner, configuring Ingress, and installing a secure Harbor registry using Helm and custom TLS certificates.

Cloud NativeDockerHarbor
0 likes · 39 min read
Comprehensive Guide to Deploying a Highly Available Kubernetes Cluster with Nginx+Keepalived, Ingress, NFS, and Harbor
Efficient Ops
Efficient Ops
Oct 25, 2022 · Cloud Native

How Guangdong Mobile Built a Resilient Container Cloud from Scratch

This article details Guangdong Mobile's end‑to‑end journey of designing, constructing, and operating a production‑grade container cloud platform, covering architecture decisions, monitoring, logging, high‑availability, scaling, network optimization, upgrade challenges, and lessons learned for cloud‑native practitioners.

Cloud NativeDevOpsKubernetes
0 likes · 26 min read
How Guangdong Mobile Built a Resilient Container Cloud from Scratch
Cloud Native Technology Community
Cloud Native Technology Community
Oct 25, 2022 · Cloud Native

Harbor: Cloud‑Native Docker Registry Authentication, Installation, and Image Push Guide

This article introduces Harbor, a CNCF‑hosted cloud‑native Docker registry, explains its token‑based authentication flow, details the required Kubernetes and Helm prerequisites, provides step‑by‑step installation and configuration instructions, and demonstrates how to push and pull images using containerd and Kubernetes.

AuthenticationDocker RegistryHarbor
0 likes · 17 min read
Harbor: Cloud‑Native Docker Registry Authentication, Installation, and Image Push Guide
ByteDance Cloud Native
ByteDance Cloud Native
Oct 24, 2022 · Cloud Native

How KubeGateway Solves kube‑apiserver Load‑Balancing and Traffic Governance

KubeGateway, a custom seven‑layer gateway built by ByteDance, eliminates kube‑apiserver load‑imbalance and adds comprehensive request governance—including routing, rate‑limiting, and degradation—by parsing HTTP2 traffic, supporting flexible policies, and transparently proxying requests without client changes.

AuthenticationKubernetesgateway
0 likes · 14 min read
How KubeGateway Solves kube‑apiserver Load‑Balancing and Traffic Governance
Top Architect
Top Architect
Oct 24, 2022 · Backend Development

Evolution of the Tianyi Account Gateway System: From Zuul‑Based 1.0 to Kong‑Based 3.0

The article chronicles the architectural evolution of China Telecom's Tianyi Account gateway—from its 2017 Zuul‑based 1.0 implementation, through a 2.0 redesign using Kong with custom plugins, to the current 3.0 version featuring CP/DP separation, multi‑language plugins, and cloud‑native deployment, highlighting performance gains, scalability, and operational improvements.

Cloud NativeKongKubernetes
0 likes · 14 min read
Evolution of the Tianyi Account Gateway System: From Zuul‑Based 1.0 to Kong‑Based 3.0
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Oct 24, 2022 · Cloud Native

GitOps on Kubernetes: Secure, Scalable Continuous Deployment Best Practices

GitOps, a cloud‑native continuous deployment approach that uses Git as the single source of truth, enables secure, auditable, and multi‑tenant Kubernetes management by automating state reconciliation, supporting multi‑cluster and multi‑environment deployments, and integrating tools like Argo CD, Flux, Helm, and Kustomize.

Argo CDContinuous DeploymentDevOps
0 likes · 12 min read
GitOps on Kubernetes: Secure, Scalable Continuous Deployment Best Practices
MaGe Linux Operations
MaGe Linux Operations
Oct 22, 2022 · Cloud Native

How Do Packets Flow Inside and Outside Kubernetes? A Deep Dive into Pod Networking

This article explains how Kubernetes forwards packets from the initial web request through container networking, covering the Kubernetes network model, Linux network namespaces, the role of the pause container, pod‑to‑pod communication on the same and different nodes, CNI plugins, and how services use Netfilter and iptables to rewrite traffic.

CNIKubernetesPod Networking
0 likes · 25 min read
How Do Packets Flow Inside and Outside Kubernetes? A Deep Dive into Pod Networking
21CTO
21CTO
Oct 21, 2022 · Cloud Native

How Ford is Driving Cloud‑Native Transformation: Tools, Culture, and Lessons Learned

Ford’s tech leaders Beckie Riss and Satish Puranam discuss how the automaker’s six‑year cloud‑native journey—spanning Kubernetes, Tekton, Knative, and a culture of curiosity—has reshaped development, accelerated delivery, and tackled the challenges of upskilling engineers and simplifying platform adoption.

DevOpsDigital TransformationKubernetes
0 likes · 9 min read
How Ford is Driving Cloud‑Native Transformation: Tools, Culture, and Lessons Learned
Programmer DD
Programmer DD
Oct 21, 2022 · Cloud Native

How Grafana Mimir Transforms Cloud‑Native Monitoring and Alerting

This article explains how Grafana Mimir provides a scalable, highly‑available, multi‑tenant long‑term storage for Prometheus, details its architecture and core components such as compactor, distributor, ingester, querier, query‑frontend and store‑gateway, and shows step‑by‑step installation, status checking, and Alertmanager configuration for cloud‑native environments.

AlertmanagerCloud Native MonitoringGrafana Mimir
0 likes · 22 min read
How Grafana Mimir Transforms Cloud‑Native Monitoring and Alerting
Bilibili Tech
Bilibili Tech
Oct 21, 2022 · Big Data

Kyuubi at Bilibili: Architecture, Enhancements, and Production Practices for Large‑Scale Data Processing

Bilibili adopted the open‑source Kyuubi proxy to replace its unstable STS layer, enabling multi‑tenant, multi‑engine (Spark, Presto, Flink) SQL/Scala processing with Hive Thrift compatibility, fine‑grained queue isolation, UI monitoring, stability safeguards, and Kubernetes/YARN deployment, while planning further cloud‑native extensions.

Big DataKubernetesKyuubi
0 likes · 20 min read
Kyuubi at Bilibili: Architecture, Enhancements, and Production Practices for Large‑Scale Data Processing
Hulu Beijing
Hulu Beijing
Oct 21, 2022 · Big Data

How Hulu Scales Spark on Kubernetes: Cloud‑Native Big Data at Disney‑Scale

Hulu’s data platform team describes how they migrated large‑scale Spark workloads from Yarn to native Spark on Kubernetes, leveraging AWS services such as EKS, S3, and custom operators to achieve dynamic scaling, unified monitoring, cost‑effective resource management, and improved stability for search, recommendation, and advertising pipelines.

AWSBig DataCloud Native
0 likes · 18 min read
How Hulu Scales Spark on Kubernetes: Cloud‑Native Big Data at Disney‑Scale
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 20, 2022 · Cloud Native

Why Kubernetes Remains Complex and How Serverless Designs Aim to Simplify It

The article examines the inherent and accidental complexities of Kubernetes as a distributed cluster manager, discusses challenges in resource scheduling, infrastructure diversity, and operational overhead, and explores how cloud‑native solutions such as managed services, nodeless and serverless Kubernetes architectures attempt to reduce these complexities while introducing new trade‑offs.

Cloud NativeKubernetesOperations
0 likes · 18 min read
Why Kubernetes Remains Complex and How Serverless Designs Aim to Simplify It
DevOps Cloud Academy
DevOps Cloud Academy
Oct 20, 2022 · Cloud Native

Comparative Analysis of KubeSphere and Rainbond Cloud‑Native Application Platforms

This article provides a detailed comparative analysis of the cloud‑native application platforms KubeSphere and Rainbond, evaluating their product positioning, community activity, installation experience, application deployment, micro‑service architecture, marketplace features, multi‑cluster management, and operational capabilities to help readers choose the most suitable solution for their needs.

Cloud NativeKubeSphereKubernetes
0 likes · 17 min read
Comparative Analysis of KubeSphere and Rainbond Cloud‑Native Application Platforms
Huolala Tech
Huolala Tech
Oct 20, 2022 · Cloud Native

How Huolala Cuts Cloud Costs with Kubernetes: Spot Instances, Smart Autoscaling, and Predictive Scaling

This presentation details Huolala's end‑to‑end cloud‑native cost‑optimization strategy, covering the company's infrastructure basics, Kubernetes‑based server cost‑saving techniques, a tailored optimization roadmap, practical Spot Instance usage, and a custom CronHPA‑driven scheduled scaling solution to boost resource utilization.

Cloud NativeCost OptimizationHPA
0 likes · 23 min read
How Huolala Cuts Cloud Costs with Kubernetes: Spot Instances, Smart Autoscaling, and Predictive Scaling
Efficient Ops
Efficient Ops
Oct 19, 2022 · Big Data

Master Prometheus Monitoring for Big Data on Kubernetes: Design & Alerting

This article explains how to design and implement a Prometheus‑based monitoring system for big‑data components running on Kubernetes, covering metric exposure methods, scrape configurations, exporter deployment, and dynamic alert rule management with Alertmanager.

Alert RulesAlertmanagerBig Data Monitoring
0 likes · 17 min read
Master Prometheus Monitoring for Big Data on Kubernetes: Design & Alerting
Xiaolei Talks DB
Xiaolei Talks DB
Oct 19, 2022 · Databases

How to Deploy TiDB on Private or Public Cloud with TiDB Operator

This article explains how TiDB, a cloud‑native distributed database, can be deployed on private or public clouds using the TiDB Operator, covering Kubernetes basics, operator concepts, TiDB Operator features, architecture, and upcoming deployment guides.

Database DeploymentKubernetesOperator
0 likes · 10 min read
How to Deploy TiDB on Private or Public Cloud with TiDB Operator
Software Development Quality
Software Development Quality
Oct 19, 2022 · Cloud Native

Unlocking Developer Speed: Why Internal Development Platforms Matter

Internal Development Platforms (IDPs) unify existing tools and workflows to provide developers with self‑service capabilities, reduce cognitive load, and standardize deployments, while enabling ops teams to automate repetitive tasks, improve productivity, and deliver measurable gains in MTTR, change‑failure rate, deployment frequency, and lead time.

Developer ExperienceInternal Development PlatformKubernetes
0 likes · 11 min read
Unlocking Developer Speed: Why Internal Development Platforms Matter
Cloud Native Technology Community
Cloud Native Technology Community
Oct 18, 2022 · Cloud Native

The Benefits of Containerization and Its Role in Modern DevOps

This article explains what containers are, outlines their advantages such as scalability, portability and DevOps integration, describes popular container images and tools like Docker and Kubernetes, and concludes with a summary of why containerization is essential for agile cloud‑native development.

ContainerizationDevOpsDocker
0 likes · 10 min read
The Benefits of Containerization and Its Role in Modern DevOps
Cloud Native Technology Community
Cloud Native Technology Community
Oct 17, 2022 · Cloud Native

A Three‑Step Approach to Understanding, Managing, and Preventing Kubernetes Failures

This article presents a practical three‑step methodology—understanding, managing, and preventing—to troubleshoot Kubernetes deployments, explains how to leverage monitoring, observability, and incident‑response tools, and offers guidance on fostering team collaboration and building resilient, self‑healing cloud‑native systems.

Cloud NativeKubernetesOperations
0 likes · 7 min read
A Three‑Step Approach to Understanding, Managing, and Preventing Kubernetes Failures
Open Source Linux
Open Source Linux
Oct 16, 2022 · Cloud Native

Mastering nsenter: Debug Container Networks and Linux Namespaces

This article explains how the nsenter command from util‑linux lets you enter a container's network or other namespaces to run host tools for debugging, details its syntax, provides practical examples with Docker and Kubernetes, and covers the underlying concepts of Linux namespaces, clone, and setns.

KubernetesLinux Namespacesclone
0 likes · 9 min read
Mastering nsenter: Debug Container Networks and Linux Namespaces
Alibaba Cloud Native
Alibaba Cloud Native
Oct 13, 2022 · Cloud Native

How OpenSergo Enables Unified Traffic Routing for Cloud‑Native Microservices

This article explains the concept of traffic routing in microservice governance, outlines the challenges of heterogeneous environments, introduces the OpenSergo project as a unified, multi‑language solution, and provides detailed CRD examples and demo steps for implementing tag, canary, and full‑link gray routing in Kubernetes.

CRDKubernetesMicroservices
0 likes · 16 min read
How OpenSergo Enables Unified Traffic Routing for Cloud‑Native Microservices
Programmer DD
Programmer DD
Oct 12, 2022 · Cloud Native

Istio Joins CNCF: Exploring the New Ambient Mesh and Its Impact

Google donated Istio to the CNCF, which was accepted as an incubating project, introducing the experimental Ambient Mesh mode while retaining sidecar support, and the article outlines Istio's architecture, history, community contributions, and industry perspectives on this cloud‑native milestone.

Ambient MeshCNCFCloud Native
0 likes · 6 min read
Istio Joins CNCF: Exploring the New Ambient Mesh and Its Impact
Open Source Linux
Open Source Linux
Oct 11, 2022 · Cloud Native

How to Build a Fully HA Kubernetes Cluster with Nginx, Keepalived, and Harbor

This step‑by‑step guide walks you through deploying a production‑grade Kubernetes environment, covering node preparation, Docker and containerd setup, kubeadm initialization, high‑availability configuration with Nginx and Keepalived, installing the dashboard, and setting up a private Harbor registry with NFS storage, all using cloud‑native best practices.

DockerHarborKubernetes
0 likes · 41 min read
How to Build a Fully HA Kubernetes Cluster with Nginx, Keepalived, and Harbor
Efficient Ops
Efficient Ops
Oct 11, 2022 · Cloud Native

How Vivo Built a Scalable Karmada Operator with Ansible for Multi‑Cluster Management

Vivo’s engineering team shares their practical experience creating a Karmada‑Operator using the Operator SDK and Ansible, detailing background, deployment challenges, design choices, API and architecture, etcd management, member cluster handling, CI pipeline, and performance testing to enable robust multi‑cloud Kubernetes orchestration.

AnsibleCloud NativeKarmada
0 likes · 15 min read
How Vivo Built a Scalable Karmada Operator with Ansible for Multi‑Cluster Management
MaGe Linux Operations
MaGe Linux Operations
Oct 10, 2022 · Cloud Native

Unlock Scalable Cloud‑Native Alerting with Grafana Mimir: Architecture, Components, and Setup

This article explains how Grafana Mimir extends Prometheus and Alertmanager to provide a horizontally scalable, highly available, multi‑tenant monitoring solution for Kubernetes, covering its architecture, key components, compression mechanisms, deployment steps, and configuration of Alertmanager and multi‑tenant support.

AlertmanagerCloud Native MonitoringGrafana Mimir
0 likes · 23 min read
Unlock Scalable Cloud‑Native Alerting with Grafana Mimir: Architecture, Components, and Setup
Alibaba Cloud Native
Alibaba Cloud Native
Oct 10, 2022 · Cloud Native

What’s New in Koordinator v0.7? Enhanced Coscheduling, ElasticQuota, and Fine‑Grained GPU Sharing

Koordinator v0.7 adds major cloud‑native scheduling features—including enhanced gang (coscheduling) with Strict/NonStrict modes, multi‑hierarchy ElasticQuota management, fine‑grained GPU resource protocols, richer diagnostic APIs, and safer descheduling—targeting machine‑learning and big‑data workloads on Kubernetes.

Cloud NativeCoschedulingElasticQuota
0 likes · 25 min read
What’s New in Koordinator v0.7? Enhanced Coscheduling, ElasticQuota, and Fine‑Grained GPU Sharing
MaGe Linux Operations
MaGe Linux Operations
Oct 9, 2022 · Big Data

Master Flink on Kubernetes: Step‑by‑Step Deployment Guide

This guide walks you through deploying Apache Flink on Kubernetes, covering runtime modes, building Docker images, creating ConfigMaps and Services, launching session and application clusters, submitting jobs, monitoring the Web UI, and cleaning up resources, all with practical code snippets and commands.

Big DataDockerFlink
0 likes · 26 min read
Master Flink on Kubernetes: Step‑by‑Step Deployment Guide
Alibaba Cloud Native
Alibaba Cloud Native
Oct 9, 2022 · Cloud Computing

OpenKruise v1.3 Highlights: Custom Probes, SidecarSet Upgrades, and Faster Scaling

OpenKruise v1.3 introduces the PodProbeMarker CRD for custom probes, adds new features to CloneSet, WorkloadSpread, Advanced DaemonSet, and SidecarSet—including historical version injection and pod annotation policies—while delivering significant performance gains for large‑scale clusters through queue optimizations and reduced CPU/memory usage.

CRDCustom ProbeKubernetes
0 likes · 12 min read
OpenKruise v1.3 Highlights: Custom Probes, SidecarSet Upgrades, and Faster Scaling
DevOps Cloud Academy
DevOps Cloud Academy
Oct 4, 2022 · Operations

Production Considerations for Deploying Linkerd: HA, Helm Charts, Prometheus, and Multi‑Cluster

This article explains how to prepare Linkerd for production use by covering high‑availability deployment, Helm chart installation, Prometheus metric handling, external Prometheus integration, multi‑cluster communication, and additional operational best‑practices such as resource tuning and security considerations.

KubernetesLinkerdMulti‑Cluster
0 likes · 12 min read
Production Considerations for Deploying Linkerd: HA, Helm Charts, Prometheus, and Multi‑Cluster
Alibaba Cloud Native
Alibaba Cloud Native
Oct 4, 2022 · Cloud Native

How Service Mesh Redefines Cloud‑Native Networking, Security, and Observability

This article explains the fundamentals of service mesh as a cloud‑native infrastructure layer, covering its control‑plane and data‑plane architecture, sidecar and waypoint proxies, L4/L7 decoupling, eBPF acceleration, zero‑trust security, traffic management, observability, and real‑world deployment scenarios.

Cloud NativeKubernetesZero Trust
0 likes · 20 min read
How Service Mesh Redefines Cloud‑Native Networking, Security, and Observability
Alibaba Cloud Native
Alibaba Cloud Native
Oct 3, 2022 · Cloud Native

Can AHPA Predict Kubernetes Scaling Before Load Spikes?

This article introduces the Advanced Horizontal Pod Autoscaler (AHPA), explains its three‑stage architecture of data collection, prediction, and scaling, details the RobustScaler forecasting algorithm and CRD‑based deployment, and evaluates its ability to proactively and reactively adjust pod counts with high robustness.

CRDCloud NativeKubernetes
0 likes · 13 min read
Can AHPA Predict Kubernetes Scaling Before Load Spikes?
Tencent Cloud Developer
Tencent Cloud Developer
Sep 29, 2022 · Cloud Native

Improving Kubernetes Cluster Utilization: Practices and Optimization Strategies

The session detailed how Tencent’s container experts boost Kubernetes cluster utilization by correcting pod resource requests, employing two‑level auto‑scaling, dynamic over‑commit, adaptive scheduling and eviction, and using HPA/EHPA/VPA, achieving up to 38.7% node usage and roughly 60% cost savings in real‑world workloads.

Auto ScalingKubernetesPod Scheduling
0 likes · 11 min read
Improving Kubernetes Cluster Utilization: Practices and Optimization Strategies
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Sep 29, 2022 · Cloud Native

How to Deploy SaaS Private Cloud with Kubernetes and KubeSphere: A Complete Guide

This article explains the challenges of SaaS private‑cloud deployment, compares source‑code and container deployment, introduces Kubernetes and KubeSphere, outlines required infrastructure, provides Docker and Harbor installation steps, and shows how to use KubeKey to configure and manage a full‑stack private cloud platform.

Cloud NativeDeploymentDocker
0 likes · 17 min read
How to Deploy SaaS Private Cloud with Kubernetes and KubeSphere: A Complete Guide
Open Source Linux
Open Source Linux
Sep 28, 2022 · Cloud Native

What Really Makes a Kubernetes Pod? Inside Its Isolation, Namespaces & Cgroups

This article explores how Kubernetes Pods differ from simple containers by examining their underlying implementation, shared network and IPC namespaces, cgroup hierarchies, and the role of the pause sandbox, while also demonstrating how similar pod-like behavior can be achieved using Docker and cgroup tools.

ContainerDockerKubernetes
0 likes · 20 min read
What Really Makes a Kubernetes Pod? Inside Its Isolation, Namespaces & Cgroups