Tagged articles
4047 articles
Page 25 of 41
Ops Development Stories
Ops Development Stories
Feb 23, 2022 · Cloud Native

Why Your Kubernetes Pod Can't Reach the Server: DNS Search Domain Pitfalls and Fixes

An agent service running in a Kubernetes pod appeared healthy but failed to receive heartbeats due to DNS resolution errors caused by an unintended 'HOST' search domain, leading to incorrect IP resolution; the article details the investigation, explains Kubernetes DNS mechanics, and shows how adjusting ndots or using fully qualified names resolves the issue.

DNSKubernetesPod
0 likes · 18 min read
Why Your Kubernetes Pod Can't Reach the Server: DNS Search Domain Pitfalls and Fixes
IT Architects Alliance
IT Architects Alliance
Feb 22, 2022 · Cloud Native

Evolution of Microservice Architecture and Its Essential Technology Stack

This article traces the evolution of software architecture from monolithic to microservice models, explains key concepts such as service mesh, distributed tracing, and container orchestration, and lists the essential tools and frameworks—including Docker, Kubernetes, Ocelot, Consul, and Jenkins—required to build and operate modern cloud‑native microservice systems.

ContainerizationDistributed SystemsKubernetes
0 likes · 17 min read
Evolution of Microservice Architecture and Its Essential Technology Stack
Efficient Ops
Efficient Ops
Feb 22, 2022 · Operations

Tackling Cloud‑Native Ops Challenges: Real‑World Practices from NetEase

NetEase’s cloud‑native operations team shares how they confront new challenges of Kubernetes adoption—ranging from technical stack shifts and knowledge‑base gaps to capacity planning, automated diagnostics, monitoring, alerting, and cost‑saving strategies—offering practical insights for building efficient, stable, and scalable ops systems.

Cost OptimizationKubernetesautomation
0 likes · 22 min read
Tackling Cloud‑Native Ops Challenges: Real‑World Practices from NetEase
MaGe Linux Operations
MaGe Linux Operations
Feb 22, 2022 · Cloud Native

Explore Kubernetes CI/CD Through an Engaging Comic

This article presents a comic series that visually explains Kubernetes fundamentals and demonstrates how to implement continuous integration and continuous delivery pipelines on the platform, offering a concise, illustrated guide for developers and DevOps engineers.

Cloud NativeComicDevOps
0 likes · 4 min read
Explore Kubernetes CI/CD Through an Engaging Comic
Cloud Native Technology Community
Cloud Native Technology Community
Feb 21, 2022 · Industry Insights

What the Sysdig 2022 Cloud‑Native Security Report Reveals About Container Risks

The Sysdig 2022 Cloud‑Native Security and Usage Report shows that a majority of production containers and images contain high‑severity vulnerabilities, many cloud accounts expose S3 buckets, and mis‑configured resource limits lead to significant cost overruns, highlighting urgent security and operational challenges for enterprises adopting cloud‑native technologies.

Cloud NativeContainer SecurityIndustry Insights
0 likes · 8 min read
What the Sysdig 2022 Cloud‑Native Security Report Reveals About Container Risks
Alibaba Cloud Native
Alibaba Cloud Native
Feb 20, 2022 · Cloud Native

What’s Driving the Cloud Native Revolution? Trends, Tech, and Future Directions

This article summarizes a cloud‑native expert’s talk, covering the definition and community origins of cloud native, CNCF’s role, current industry trends such as distributed cloud, AI workloads, FinOps, core technologies like containers, microservices, serverless, and Alibaba Cloud’s product ecosystem and open‑source contributions.

AICloud NativeFinOps
0 likes · 13 min read
What’s Driving the Cloud Native Revolution? Trends, Tech, and Future Directions
MaGe Linux Operations
MaGe Linux Operations
Feb 18, 2022 · Operations

Master Continuous Profiling on Kubernetes with Pyroscope: A Step‑by‑Step Guide

Learn how to use the open‑source Pyroscope continuous profiler to identify performance bottlenecks in Kubernetes‑deployed microservices, compare it with other profilers, and follow detailed installation, configuration, and integration steps for Python, Go, and .NET services, including Helm deployment and resource monitoring.

Continuous ProfilingKubernetesMicroservices
0 likes · 11 min read
Master Continuous Profiling on Kubernetes with Pyroscope: A Step‑by‑Step Guide
iQIYI Technical Product Team
iQIYI Technical Product Team
Feb 18, 2022 · Cloud Native

CPU Share Syncer: Enabling High‑Priority Task CPU Preemption in iQIYI Video Production Kubernetes Clusters

iQIYI’s cpu‑share‑syncer daemon runs on every node, reads a pod’s iqiyi.com/cpu‑share annotation, updates the pod’s cpu.shares after disabling the Kubernetes CPU CFS quota, and lets high‑priority video‑production pods pre‑empt CPU from lower‑priority pods, significantly speeding task execution.

CPU schedulingDaemonSetHigh priority tasks
0 likes · 13 min read
CPU Share Syncer: Enabling High‑Priority Task CPU Preemption in iQIYI Video Production Kubernetes Clusters
Architect
Architect
Feb 18, 2022 · Cloud Native

Large‑Scale etcd Cluster Performance Optimization and Pod Data Splitting in Ant Group’s Sigma

This article describes how Ant Group tackled the performance ceiling of its massive Sigma Kubernetes clusters by horizontally splitting etcd storage for Pods, Leases and Events, redesigning watch handling to avoid component restarts, and using snapshot‑based migration to preserve data integrity while reducing latency.

Cluster PerformanceData MigrationKubernetes
0 likes · 27 min read
Large‑Scale etcd Cluster Performance Optimization and Pod Data Splitting in Ant Group’s Sigma
Meituan Technology Team
Meituan Technology Team
Feb 17, 2022 · Cloud Native

Meituan's Cloud‑Native Cluster Scheduling System: Design, Challenges, and Future Directions

Meituan’s cloud‑native cluster scheduling system, built on a customized Kubernetes engine, unifies multi‑cluster management, improves CPU utilization, reduces costs, and enhances stability by balancing throughput, complexity, and reliability while addressing large‑scale deployment, fault‑tolerance, and dynamic resource allocation challenges.

Cloud NativeCluster SchedulingKubernetes
0 likes · 21 min read
Meituan's Cloud‑Native Cluster Scheduling System: Design, Challenges, and Future Directions
TAL Education Technology
TAL Education Technology
Feb 17, 2022 · Fundamentals

Comprehensive Overview of the Web Access Process: From DNS Resolution to Kubernetes Deployment

This article explains the complete web request lifecycle—including DNS lookup, caching mechanisms, HTTP/HTTPS communication, TCP/UDP transport, gateway routing, backend service models, virtualization, Docker containerization, and Kubernetes orchestration—providing engineers with a solid foundation for diagnosing and optimizing modern web applications.

DNSDockerKubernetes
0 likes · 19 min read
Comprehensive Overview of the Web Access Process: From DNS Resolution to Kubernetes Deployment
IT Architects Alliance
IT Architects Alliance
Feb 16, 2022 · Cloud Native

Understanding Microservices: Concepts, Benefits, Drawbacks, and Their Relationship with Containers and Kubernetes

This article explains what microservices are, how they differ from traditional monolithic applications, provides an Amazon product‑listing example, discusses containers, compares the two, outlines when to adopt microservices, lists their pros and cons, and describes how Kubernetes supports cloud‑native microservice architectures.

ContainersKubernetesMicroservices
0 likes · 12 min read
Understanding Microservices: Concepts, Benefits, Drawbacks, and Their Relationship with Containers and Kubernetes
Qingyun Technology Community
Qingyun Technology Community
Feb 15, 2022 · Cloud Native

Deploy GitLab CE on KubeSphere and Build CI/CD Pipelines – A Complete Guide

This tutorial walks you through installing KubeSphere, deploying GitLab Community Edition via Helm, configuring a minimal GitLab setup, creating a sample NodeJS project, disabling Auto DevOps, writing a Jenkinsfile, and linking GitLab with KubeSphere DevOps to run both scripted and graphical CI/CD pipelines, plus optional SSH access.

GitLabJenkinsKubeSphere
0 likes · 18 min read
Deploy GitLab CE on KubeSphere and Build CI/CD Pipelines – A Complete Guide
Ops Development Stories
Ops Development Stories
Feb 15, 2022 · Cloud Native

Mastering Canary Deployments with ingress-nginx: A Step-by-Step Guide

This article explains how to perform rolling, blue‑green, and canary (gray) releases in Kubernetes, focusing on implementing canary deployments with the ingress-nginx controller using specific annotations, demonstrating weight‑based and header‑based scenarios, providing full YAML manifests, CI/CD pipeline designs, and recommendations for automated rollouts.

CI/CDCloud NativeDevOps
0 likes · 18 min read
Mastering Canary Deployments with ingress-nginx: A Step-by-Step Guide
Architects Research Society
Architects Research Society
Feb 14, 2022 · Cloud Native

Microservices After a Decade: Evolution, Anti‑Patterns, Distributed Transactions, and the Role of Kubernetes

The panel discusses how microservices have changed over the past ten years, covering evolving definitions, common anti‑patterns, the challenges of distributed transactions, Kubernetes' influence on architecture, organizational impacts, and future trends such as application‑centric models and GitOps.

Cloud NativeDevOpsKubernetes
0 likes · 25 min read
Microservices After a Decade: Evolution, Anti‑Patterns, Distributed Transactions, and the Role of Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
Feb 11, 2022 · Cloud Native

Step‑by‑Step Guide to Building Full‑Link Gray Release with MSE on Kubernetes

This article explains how to enable full‑link gray release for microservices using Alibaba Cloud MSE, covering the underlying technologies, product features, environment preparation, Kubernetes deployment manifests, lane configuration, Ingress routing, traffic verification, and monitoring, all without modifying application code.

Ingress-nginxKubernetesMSE
0 likes · 17 min read
Step‑by‑Step Guide to Building Full‑Link Gray Release with MSE on Kubernetes
High Availability Architecture
High Availability Architecture
Feb 11, 2022 · Cloud Native

Cloud‑Native Migration of Tencent Happy Game Studio Backend Using Istio Service Mesh

The article details how Tencent's Happy Game Studio transformed its large‑scale, low‑utilization backend from a legacy distributed architecture to a cloud‑native, Istio‑enabled service‑mesh platform, achieving significant resource savings, smoother deployments, and improved observability across game, CGI, and storage services.

IstioKubernetesMicroservices
0 likes · 16 min read
Cloud‑Native Migration of Tencent Happy Game Studio Backend Using Istio Service Mesh
Cloud Native Technology Community
Cloud Native Technology Community
Feb 8, 2022 · Cloud Native

Tracing the Path of Network Traffic in Kubernetes

This article provides a comprehensive guide to Kubernetes networking, covering pod network requirements, Linux network namespaces, the role of the pause container, IP allocation, veth pairs, bridge connections, inter‑pod traffic on same and different nodes, CNI plugins, and how services use iptables and conntrack for traffic routing.

CNIKubernetesLinux Namespaces
0 likes · 26 min read
Tracing the Path of Network Traffic in Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
Feb 8, 2022 · Cloud Native

How Does OCM Placement Dynamically Schedule Multi‑Cluster Workloads?

This article explains how Open Cluster Management's Placement API selects target clusters for workloads in multi‑cluster Kubernetes environments, covering ManagedClusterSet concepts, Placement spec fields, predicate filtering, prioritizer policies, dynamic scoring, and practical YAML examples that demonstrate weight adjustments and best‑practice scenarios.

Cloud NativeDynamic SchedulingKubernetes
0 likes · 22 min read
How Does OCM Placement Dynamically Schedule Multi‑Cluster Workloads?
IT Architects Alliance
IT Architects Alliance
Feb 8, 2022 · Cloud Native

What Are Microservices? Concepts, Benefits, Drawbacks, and Their Relationship with Containers and Kubernetes

This article explains the fundamentals of microservices, illustrates them with an Amazon product‑listing example, compares microservices to containers, outlines when to adopt microservices, discusses their advantages and disadvantages, and describes how Kubernetes supports microservice architectures.

ContainersDevOpsKubernetes
0 likes · 12 min read
What Are Microservices? Concepts, Benefits, Drawbacks, and Their Relationship with Containers and Kubernetes
MaGe Linux Operations
MaGe Linux Operations
Feb 7, 2022 · Cloud Native

Why K8ssandra Is Switching from Helm to Its Own Operator

The article explains how K8ssandra, an Apache Cassandra distribution for Kubernetes, evolved from using Helm charts to developing a dedicated Operator to overcome Helm's limitations, improve multi‑cluster support, and align more closely with Kubernetes best practices.

Cloud NativeGoK8ssandra
0 likes · 13 min read
Why K8ssandra Is Switching from Helm to Its Own Operator
MaGe Linux Operations
MaGe Linux Operations
Jan 30, 2022 · Cloud Native

Upgrade a Kubernetes Cluster from v1.22 to v1.23 the Hard Way

This step‑by‑step tutorial explains how to upgrade a Kubernetes cluster from version 1.22 to 1.23 using the hard‑way approach, covering prerequisites, master and worker node procedures, package handling, and verification commands to ensure a successful upgrade.

Cluster UpgradeDevOpsHard Way
0 likes · 8 min read
Upgrade a Kubernetes Cluster from v1.22 to v1.23 the Hard Way
MaGe Linux Operations
MaGe Linux Operations
Jan 28, 2022 · Cloud Native

Top 7 Kubernetes Management Tools to Simplify Cluster Operations

Discover the most popular Kubernetes management solutions—including K9s, Rancher, Dashboard, Helm, Kubespray, Lens, and WKSctl—detailing their features, deployment options, and how they streamline cluster monitoring, scaling, and security for cloud-native environments and improve operational efficiency.

Cloud NativeCluster ManagementDevOps
0 likes · 9 min read
Top 7 Kubernetes Management Tools to Simplify Cluster Operations
Baidu Tech Salon
Baidu Tech Salon
Jan 27, 2022 · Cloud Native

How China Unicom’s Service Mesh Evolved: From SDKs to Sidecars and Beyond

This article details China Unicom Software Research Institute's multi‑year journey of adopting Kubernetes‑based service mesh, outlining the evolution from SDK‑driven microservices to sidecar‑based architectures, migration strategies with Baidu, performance optimizations, observability enhancements, and future product roadmaps.

Cloud NativeIstioKubernetes
0 likes · 13 min read
How China Unicom’s Service Mesh Evolved: From SDKs to Sidecars and Beyond
MaGe Linux Operations
MaGe Linux Operations
Jan 26, 2022 · Cloud Native

Unlock Kubernetes Essentials: Pods, Services, Deployments, and Beyond

This article introduces Kubernetes—Google's open‑source container orchestration platform—detailing its core concepts such as Pods, Namespaces, Nodes, Services, Volumes, PersistentVolumes, Deployments, StatefulSets, DaemonSets, Ingress, Jobs, HPA, ServiceAccounts, Secrets, ConfigMaps, and ResourceQuotas, providing practical commands and usage notes for each component.

Cloud NativeDevOpsInfrastructure
0 likes · 18 min read
Unlock Kubernetes Essentials: Pods, Services, Deployments, and Beyond
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jan 26, 2022 · Cloud Native

2021 Technology Year-End Review: Serverless, Kubernetes, Frontend, Service Mesh, Rust, Databases, Software 2.0, and Audio/Video Trends

The 2021 year‑end review highlights the rapid rise of Serverless, the accelerated release cadence and security focus of Kubernetes, the deepening of frontend low‑code development, the practical expansion of service mesh, the growing Rust ecosystem, cloud‑native database evolution, the maturation of Software 2.0, and emerging audio‑video innovations driven by the metaverse.

Cloud NativeKubernetesRust
0 likes · 14 min read
2021 Technology Year-End Review: Serverless, Kubernetes, Frontend, Service Mesh, Rust, Databases, Software 2.0, and Audio/Video Trends
Cloud Native Technology Community
Cloud Native Technology Community
Jan 25, 2022 · Cloud Native

Technical Overview of Kube-OVN Deployment for Hybrid VM and Container Environments at ByteDance

This article details ByteDance's technical evaluation and implementation of Kube-OVN as a CNI solution for hybrid virtual‑machine and container workloads, covering selection criteria, the initial network design, identified issues, subsequent optimizations, and future roadmap considerations within a cloud‑native Kubernetes environment.

CNICloud NativeKube-OVN
0 likes · 7 min read
Technical Overview of Kube-OVN Deployment for Hybrid VM and Container Environments at ByteDance
DataFunTalk
DataFunTalk
Jan 25, 2022 · Cloud Native

Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution

This article analyzes the complexities of deploying machine‑learning models in production, outlines the limitations of the existing ABox architecture, and details a comprehensive cloud‑native redesign using Seldon on Kubernetes—including custom HDFS initializers, GPU management, logging, and resource monitoring—to streamline operations and enable unified CPU/GPU model serving.

Cloud NativeGPUKubernetes
0 likes · 12 min read
Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution
Java High-Performance Architecture
Java High-Performance Architecture
Jan 25, 2022 · Cloud Native

Why Is Debugging Microservices on Kubernetes So Hard? Proven Strategies to Overcome It

Debugging microservices in a Kubernetes environment is challenging due to the abstraction of pods, network complexities, infrastructure issues, and application-level faults, but by monitoring at the service layer, aggregating data, and applying machine‑learning‑based anomaly detection, teams can effectively identify and resolve problems.

KubernetesMicroservicesmachine learning
0 likes · 6 min read
Why Is Debugging Microservices on Kubernetes So Hard? Proven Strategies to Overcome It
Cloud Native Technology Community
Cloud Native Technology Community
Jan 24, 2022 · Cloud Native

Building a Unified Cross‑Cluster Network Plane with Kube‑OVN for TiDB Deployment

The article explains how Kube‑OVN can create a flexible, efficient, and secure cross‑Kubernetes‑cluster network plane to meet user demands such as high availability, multi‑cloud separation, disaster‑recovery, and hybrid‑cloud compliance, and demonstrates its use with TiDB, MySQL, and WordPress workloads.

Cloud NativeCross-Cluster NetworkingKube-OVN
0 likes · 8 min read
Building a Unified Cross‑Cluster Network Plane with Kube‑OVN for TiDB Deployment
Ops Development Stories
Ops Development Stories
Jan 24, 2022 · Cloud Native

Deploy and Configure vmagent on Kubernetes for Efficient Metrics

This guide explains what vmagent is, its key features, and provides step‑by‑step instructions to install, configure, and verify vmagent on a Kubernetes cluster, including namespace and RBAC setup, custom scrape configs, monitoring endpoints, and troubleshooting tips.

KubernetesMetricsVictoriaMetrics
0 likes · 15 min read
Deploy and Configure vmagent on Kubernetes for Efficient Metrics
Open Source Linux
Open Source Linux
Jan 24, 2022 · Cloud Native

From LXC to Kubernetes: The Evolution and Core Mechanics of Containers

This article traces the history of container technology from early Linux Containers (LXC) through Docker’s rise, the emergence of Kubernetes, and explains core concepts such as cgroups, namespaces, OCI standards, and how containers appear as isolated processes from both host and container perspectives.

ContainersDockerKubernetes
0 likes · 12 min read
From LXC to Kubernetes: The Evolution and Core Mechanics of Containers
MaGe Linux Operations
MaGe Linux Operations
Jan 22, 2022 · Cloud Native

Boost Kubernetes Monitoring: Migrate from Prometheus to Thanos for Scalable Low‑Cost Metrics

This article examines the limitations of a standard Prometheus‑based monitoring stack on Kubernetes, explains how adopting Thanos improves metric retention and reduces infrastructure costs, and provides a detailed multi‑cluster deployment guide with Terraform, TLS configuration, and Grafana visualization.

KubernetesPrometheusTerraform
0 likes · 16 min read
Boost Kubernetes Monitoring: Migrate from Prometheus to Thanos for Scalable Low‑Cost Metrics
Alibaba Cloud Native
Alibaba Cloud Native
Jan 22, 2022 · Cloud Native

Why OpenKruise v1.0 Is a Game-Changer for Cloud‑Native Workloads

OpenKruise v1.0, the CNCF‑hosted Kubernetes extension, expands beyond basic workloads with enhanced Workload types, in‑place upgrades, high‑availability safeguards, and advanced operational tools, drawing on Alibaba’s massive container experience and now adopted by over 35 global adopters.

Cloud NativeIn‑Place UpgradeKubernetes
0 likes · 12 min read
Why OpenKruise v1.0 Is a Game-Changer for Cloud‑Native Workloads
Qingyun Technology Community
Qingyun Technology Community
Jan 21, 2022 · Cloud Native

How to Build a Small‑Scale KubeSphere Kubernetes Cluster: A Step‑by‑Step Guide

This guide walks you through planning, deploying, and configuring a production‑grade KubeSphere‑based Kubernetes cluster for small environments, covering node layout, storage choices, middleware setup, OS and Docker preparation, HAProxy/Keepalived high‑availability, and KubeKey installation with verification steps.

DockerHAProxyKubeSphere
0 likes · 23 min read
How to Build a Small‑Scale KubeSphere Kubernetes Cluster: A Step‑by‑Step Guide
DevOps Cloud Academy
DevOps Cloud Academy
Jan 20, 2022 · Cloud Native

Comparative Overview of Terraform and Helm for Kubernetes Management

This article examines the core functionalities of Terraform and Helm, highlighting how each tool manages Kubernetes clusters, automates infrastructure provisioning, and supports DevOps workflows, helping teams decide which solution best fits their cloud‑native development needs.

Cloud NativeDevOpsKubernetes
0 likes · 8 min read
Comparative Overview of Terraform and Helm for Kubernetes Management
IT Architects Alliance
IT Architects Alliance
Jan 20, 2022 · Cloud Native

How to Build a High‑Availability Microservices System on Kubernetes – A Complete Guide

This guide walks you through designing a simple front‑back separation microservice architecture, implementing it with Java Spring Boot, deploying multiple instances with Eureka, adding Prometheus‑Grafana monitoring, logging, tracing, flow control, and finally installing Kubernetes using K8seasy and verifying high‑availability across the cluster.

Cloud NativeKubernetesMicroservices
0 likes · 19 min read
How to Build a High‑Availability Microservices System on Kubernetes – A Complete Guide
DataFunTalk
DataFunTalk
Jan 19, 2022 · Artificial Intelligence

Alluxio for AI and Machine Learning: Architecture, Optimizations, and Performance Evaluation

This article presents a comprehensive technical overview of Alluxio, covering its role as a distributed data orchestration layer for AI workloads, core features such as caching and unified namespace, performance challenges in large‑scale machine‑learning pipelines, and the extensive optimizations and testing performed at Tencent to achieve high throughput and scalability.

AIAlluxioCephFS
0 likes · 23 min read
Alluxio for AI and Machine Learning: Architecture, Optimizations, and Performance Evaluation
Alibaba Cloud Native
Alibaba Cloud Native
Jan 18, 2022 · Cloud Native

How Alibaba Cloud’s Differential SLO Boosts Kubernetes Resource Utilization

This article explains Alibaba Cloud Container Service for Kubernetes's differential SLO approach, detailing the reclaimed‑resource model, CPU burst and topology‑aware scheduling, kernel group identity, memory watermark tiering, and real‑world case studies that demonstrate significant improvements in cluster efficiency and latency‑sensitive workload performance.

ACKAlibaba CloudCPU Burst
0 likes · 16 min read
How Alibaba Cloud’s Differential SLO Boosts Kubernetes Resource Utilization
Liulishuo Tech Team
Liulishuo Tech Team
Jan 18, 2022 · Cloud Native

Design and Implementation of an ArgoCD‑Based Continuous Delivery System at Liulishuo

This article describes how Liulishuo replaced its custom CD platform with the open‑source ArgoCD solution, detailing the motivations, architecture components, design concepts, deployment workflows, code snippets, advantages, and future plans for automated gray‑release in a cloud‑native Kubernetes environment.

ArgoCDCloud NativeContinuous Delivery
0 likes · 11 min read
Design and Implementation of an ArgoCD‑Based Continuous Delivery System at Liulishuo
Youzan Coder
Youzan Coder
Jan 17, 2022 · Artificial Intelligence

Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution

The team replaced the cumbersome ABox deployment stack with Seldon‑based cloud‑native serving on Kubernetes, unifying TensorFlow and other framework models, adding GPU sharing, automated CRUD, per‑model ingress, monitoring, and log collection, achieving scalable, fault‑tolerant, zero‑downtime model deployment.

AI servingCloud NativeGPU
0 likes · 11 min read
Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution
Architect
Architect
Jan 17, 2022 · Cloud Native

Introducing NGINX Service Mesh: Features, Architecture, and Getting Started

This article introduces NGINX Service Mesh (NSM), a lightweight, highly integrated service‑mesh solution for Kubernetes that leverages NGINX Plus as a data plane, explains its security, traffic‑management, visualization, and hybrid‑deployment capabilities, outlines its architecture and components, and provides step‑by‑step commands for installation, verification, and sidecar injection.

KubernetesNGINX Service MeshService Mesh
0 likes · 10 min read
Introducing NGINX Service Mesh: Features, Architecture, and Getting Started
TAL Education Technology
TAL Education Technology
Jan 13, 2022 · Cloud Native

Offline Mixed Deployment with Kubernetes: Architecture, Implementation, and Performance Evaluation for Big Data Workloads

This article describes a cloud‑native offline mixed‑deployment solution that leverages Kubernetes to share resources between big‑data clusters and business services, outlines its implementation steps, presents detailed performance comparisons between Yarn and Kubernetes using TPC‑DS, Spark, and Terasort workloads, and discusses production experience and future plans.

Big DataCloud NativeKubernetes
0 likes · 8 min read
Offline Mixed Deployment with Kubernetes: Architecture, Implementation, and Performance Evaluation for Big Data Workloads
Top Architect
Top Architect
Jan 13, 2022 · Backend Development

Microservice Architecture Roadmap: Core Components and Recommended Tools

This article presents a comprehensive roadmap for adopting microservice architecture, explaining why it is chosen, outlining essential concerns such as Docker, container orchestration, API gateways, load balancing, service discovery, event buses, logging, monitoring, tracing, data persistence, caching, and cloud providers, and recommending popular tools for each component.

DockerKubernetesMicroservices
0 likes · 16 min read
Microservice Architecture Roadmap: Core Components and Recommended Tools
HomeTech
HomeTech
Jan 13, 2022 · Cloud Native

AutoKH: A Mixed‑Workload Resource Management Solution on Kubernetes and Hadoop

AutoKH is a cloud‑native mixed‑workload framework that integrates Kubernetes and Hadoop to dynamically schedule online and offline tasks, improve CPU and memory utilization, enforce priority classes, and ensure service stability through operators, CronHPA, and resource‑control components.

CPU ManagerHadoopKubernetes
0 likes · 19 min read
AutoKH: A Mixed‑Workload Resource Management Solution on Kubernetes and Hadoop
Alibaba Cloud Native
Alibaba Cloud Native
Jan 12, 2022 · Cloud Native

Achieving Low‑Cost Full‑Link Gray Release with Alibaba Cloud MSE: Laidian Tech’s Journey

Laidian Technology leveraged Alibaba Cloud's Microservices Engine (MSE) to implement low‑cost, non‑intrusive full‑link gray release, improving system stability, accelerating feature rollout, and showcasing the benefits of containerization, Kubernetes integration, and comprehensive service governance in a fast‑growing microservice environment.

Cloud NativeKubernetesMSE
0 likes · 15 min read
Achieving Low‑Cost Full‑Link Gray Release with Alibaba Cloud MSE: Laidian Tech’s Journey
Cloud Native Technology Community
Cloud Native Technology Community
Jan 12, 2022 · Cloud Native

Choosing Cloud‑Native Persistent Storage Solutions for Financial Container Platforms

The article examines how banks can select and implement cloud‑native persistent storage on Kubernetes‑based container platforms, covering storage types, CSI integration, DBaaS options, data consistency challenges, and strategies for high‑concurrency fault recovery in the financial sector.

DBaaSData ConsistencyFinancial Services
0 likes · 8 min read
Choosing Cloud‑Native Persistent Storage Solutions for Financial Container Platforms
Ops Development Stories
Ops Development Stories
Jan 12, 2022 · Cloud Native

Deploy Kubernetes Inside Kubernetes: Mastering Kubefarm and PXE Boot

This article explains how to use Helm to deploy a full Kubernetes cluster within another Kubernetes cluster, detailing the Kubefarm project, PXE‑bootable server farms, Kubernetes‑in‑Kubernetes architecture, node provisioning via PXE, certificate management with cert‑manager, and advanced networking and webhook integration.

Cloud NativeInfrastructure as CodeKubefarm
0 likes · 17 min read
Deploy Kubernetes Inside Kubernetes: Mastering Kubefarm and PXE Boot
Java High-Performance Architecture
Java High-Performance Architecture
Jan 12, 2022 · Cloud Native

Mastering Service Mesh with Istio: A Hands‑On Guide to Traffic, Security, and Observability

This tutorial explains the fundamentals of service mesh, explores Istio’s architecture and core components, and provides step‑by‑step instructions for installing Istio on Kubernetes, deploying a sample microservice application, and leveraging traffic management, mutual TLS, observability, and advanced use cases such as routing, circuit breaking, and JWT‑based access control.

IstioKubernetesService Mesh
0 likes · 22 min read
Mastering Service Mesh with Istio: A Hands‑On Guide to Traffic, Security, and Observability
DevOps Cloud Academy
DevOps Cloud Academy
Jan 12, 2022 · Cloud Native

Deploying Apache APISIX as a Kubernetes Ingress Controller with Helm

This article introduces Apache APISIX, a high‑performance API gateway built on OpenResty and etcd, explains its architecture, demonstrates how to install it as a Kubernetes Ingress controller using Helm charts, and shows configuration, routing, and dashboard usage with example manifests and commands.

APISIXCloud NativeIngress
0 likes · 11 min read
Deploying Apache APISIX as a Kubernetes Ingress Controller with Helm
ITPUB
ITPUB
Jan 11, 2022 · Cloud Native

How Ctrip Achieved High Availability for Service Mesh Across Multiple Data Centers

Facing large‑scale production demands, Ctrip’s Cloud Container team designed a multi‑data‑center Service Mesh architecture that isolates failures, employs dual‑active IDC, multi‑cluster deployments, and optimized xDS push mechanisms, ensuring high availability, rapid fault recovery, and efficient rollout across hundreds of services.

Cloud NativeIstioKubernetes
0 likes · 18 min read
How Ctrip Achieved High Availability for Service Mesh Across Multiple Data Centers
Yiche Technology
Yiche Technology
Jan 11, 2022 · Databases

Elasticsearch Overview, Comparison, Maintenance Challenges, Deployment Strategies, and Automation Management Platform

This document provides a comprehensive technical overview of Elasticsearch, comparing it with Solr and ClickHouse, detailing common operational pain points and configuration solutions, describing containerized and ECK deployments, and outlining a company‑wide automation platform for cluster provisioning, monitoring, index and security management, with future directions for lifecycle and backup strategies.

Cluster ManagementKubernetesautomation
0 likes · 31 min read
Elasticsearch Overview, Comparison, Maintenance Challenges, Deployment Strategies, and Automation Management Platform
Open Source Linux
Open Source Linux
Jan 11, 2022 · Cloud Native

Why Bare‑Metal Kubernetes Beats Virtual Machines: 6 Compelling Advantages

This article explains why deploying Kubernetes on bare‑metal servers offers six key benefits—simpler networking, better cost efficiency for demanding workloads, superior performance, stronger security, workload‑specific optimization, and freedom from vendor lock‑in—making it a smarter choice than virtual‑machine‑based solutions.

Bare MetalCloud NativeKubernetes
0 likes · 7 min read
Why Bare‑Metal Kubernetes Beats Virtual Machines: 6 Compelling Advantages
Laravel Tech Community
Laravel Tech Community
Jan 11, 2022 · Cloud Native

Alibaba Sentinel: Open‑Source Cloud‑Native High‑Availability Protection for Distributed Services (Sentinel Go v1.0.4 Release)

The article introduces Alibaba's open‑source Sentinel, a cloud‑native high‑availability protection component for distributed microservices that offers flow control, circuit breaking, concurrency limiting and adaptive overload protection, and highlights the new Sentinel Go v1.0.4 features such as Prometheus metrics, enhanced circuit‑breaker recovery, Apollo data source, and updated Kubernetes CRD support for version 1.22+.

Circuit BreakingCloud NativeFlow Control
0 likes · 2 min read
Alibaba Sentinel: Open‑Source Cloud‑Native High‑Availability Protection for Distributed Services (Sentinel Go v1.0.4 Release)
58 Tech
58 Tech
Jan 10, 2022 · Artificial Intelligence

Resource Utilization Optimization Practices for the 58.com Machine Learning Platform (WPAI)

This article details the 58.com WPAI machine learning platform's architecture and the optimizations applied to training task scheduling, inference service elastic scaling, and offline‑online resource mixing, demonstrating how these techniques significantly improve GPU/CPU utilization and inference performance across both GPU and CPU environments.

AIInference AccelerationKubernetes
0 likes · 27 min read
Resource Utilization Optimization Practices for the 58.com Machine Learning Platform (WPAI)
DevOps Cloud Academy
DevOps Cloud Academy
Jan 7, 2022 · Cloud Native

Understanding Containerization in DevOps: Benefits, Pipelines, and Orchestration

This article explains how containerization integrates with DevOps, detailing the advantages of container‑based delivery pipelines, the differences between traditional and containerized workflows, and the role of orchestration platforms like Kubernetes in creating efficient, portable, and scalable software deployments.

Cloud NativeContainerizationDevOps
0 likes · 10 min read
Understanding Containerization in DevOps: Benefits, Pipelines, and Orchestration
Open Source Linux
Open Source Linux
Jan 6, 2022 · Cloud Native

Choosing the Right CNI Plugin for Kubernetes: A Practical Guide

This article explains what the Container Network Interface (CNI) is, how Kubernetes uses CNI plugins, and provides a detailed comparison of Overlay, Routing, and Underlay plugins based on environment constraints, functional needs, and performance considerations.

CNICloud NativeKubernetes
0 likes · 8 min read
Choosing the Right CNI Plugin for Kubernetes: A Practical Guide
DevOps Cloud Academy
DevOps Cloud Academy
Jan 5, 2022 · Operations

Top DevOps Trends Shaping the Future (2022 and Beyond)

This article outlines the major DevOps trends—including micro‑service architecture, serverless computing, low‑code platforms, Kubernetes evolution, DevSecOps, and AI integration—that are expected to drive faster delivery, higher quality, and greater agility in software development and operations.

AIDevOpsDevSecOps
0 likes · 6 min read
Top DevOps Trends Shaping the Future (2022 and Beyond)
Architect
Architect
Jan 5, 2022 · Cloud Native

Introduction to Service Mesh and Istio: Concepts, Architecture, and Hands‑On Guide

This tutorial explains the fundamentals of service mesh, outlines Istio’s architecture and core components, demonstrates how to install Istio on Kubernetes, and walks through practical examples such as traffic routing, security policies, observability, and common use‑cases, while also comparing alternative solutions.

IstioKubernetesMicroservices
0 likes · 20 min read
Introduction to Service Mesh and Istio: Concepts, Architecture, and Hands‑On Guide
Baidu Geek Talk
Baidu Geek Talk
Jan 5, 2022 · Cloud Native

Baidu Cloud‑Native Mixed Workload (Offline Co‑location) Technology Overview

Baidu’s mixed‑workload approach co‑locates offline batch jobs with latency‑sensitive online services on shared nodes, using a dynamic resource view, priority‑based scheduling, cpuset/NUMA isolation, eBPF policies, and predictive profiling, boosting CPU utilization above 40 % and saving billions of RMB in total cost of ownership.

KubernetesMixed Workloadcloud-native
0 likes · 17 min read
Baidu Cloud‑Native Mixed Workload (Offline Co‑location) Technology Overview
Architects' Tech Alliance
Architects' Tech Alliance
Jan 5, 2022 · Backend Development

Essential Microservice Architecture Roadmap: Tools, Patterns, and Best Practices

This guide outlines why microservice architecture is preferred for large applications, presents a clear learning roadmap, and details each critical concern—such as Docker, orchestration, API gateways, load balancing, service discovery, event buses, logging, monitoring, tracing, persistence, caching, and cloud providers—along with recommended tools.

Backend ArchitectureCloud NativeDocker
0 likes · 14 min read
Essential Microservice Architecture Roadmap: Tools, Patterns, and Best Practices
21CTO
21CTO
Jan 4, 2022 · Cloud Computing

Why Kubernetes Adoption Stalls and Proven Strategies to Simplify It

This article examines why many enterprises struggle to adopt containers and Kubernetes, outlines the four main sources of complexity, and presents a range of open‑source tools, managed services, serverless options, and practical guidance to help teams choose the most suitable solution.

Cloud NativeContainersDevOps
0 likes · 16 min read
Why Kubernetes Adoption Stalls and Proven Strategies to Simplify It
MaGe Linux Operations
MaGe Linux Operations
Jan 3, 2022 · Cloud Native

Why Cloud Native Matters: From Docker to Serverless Explained

This article introduces cloud native concepts, explains the evolution from Docker containers to Kubernetes orchestration and serverless services, and surveys major industry offerings, helping readers grasp the fundamentals and practical implications of modern cloud-native architectures.

CNCFCloud NativeDocker
0 likes · 10 min read
Why Cloud Native Matters: From Docker to Serverless Explained
Architects Research Society
Architects Research Society
Jan 1, 2022 · Cloud Native

Running Kubernetes Across Multiple Failure Zones

This article explains how Kubernetes clusters can be deployed across multiple failure zones and regions, detailing control plane replication, node labeling, pod topology constraints, storage zone awareness, network considerations, and disaster recovery strategies to achieve high availability in cloud‑native environments.

Cloud NativeCluster DesignKubernetes
0 likes · 8 min read
Running Kubernetes Across Multiple Failure Zones
DataFunTalk
DataFunTalk
Jan 1, 2022 · Big Data

JD's Flink Journey: Evolution, Optimizations, and Future Directions

This article details JD's adoption of Flink for real‑time computing, covering its evolution from Storm to Flink on Kubernetes, the platform architecture, major optimization techniques such as preview topology, backpressure handling, dynamic rebalance, checkpoint‑as‑savepoint, and outlines future plans including stream‑batch integration, stability improvements, intelligent operations, and AI integration.

Big DataFlinkJD
0 likes · 10 min read
JD's Flink Journey: Evolution, Optimizations, and Future Directions
Baidu Tech Salon
Baidu Tech Salon
Dec 31, 2021 · Industry Insights

How Baidu Boosted CPU Utilization by Up to 80% with Offline Mixed‑Tenant Scheduling

This article analyzes Baidu's offline mixed‑tenant technology that combines online and offline workloads on the same physical servers, detailing the resource‑usage problems, dynamic resource views, priority schemes, isolation mechanisms, high‑performance scheduling, and future directions for cloud‑native clusters.

Cloud NativeKubernetescpu-utilization
0 likes · 18 min read
How Baidu Boosted CPU Utilization by Up to 80% with Offline Mixed‑Tenant Scheduling
ITPUB
ITPUB
Dec 31, 2021 · Cloud Native

How We Slashed Istio xDS Latency from Minutes to Seconds at Scale

This article details the challenges Ctrip faced with Istio control‑plane performance at massive scale and explains the systematic methodology, concrete optimizations—including O(n²) to O(n) patching, Merkle‑tree based success‑rate metrics, and startup improvements—that reduced xDS push latency from minutes to seconds while enhancing reliability.

Cloud NativeIstioKubernetes
0 likes · 14 min read
How We Slashed Istio xDS Latency from Minutes to Seconds at Scale
Architecture Digest
Architecture Digest
Dec 31, 2021 · Backend Development

Why I Chose Microservice Architecture and a Roadmap of Its Core Components

This article explains why microservice architecture is preferred over monolithic applications, outlines a learning roadmap, and details essential components such as Docker, container orchestration, API gateways, load balancing, service discovery, event buses, logging, monitoring, distributed tracing, data persistence, caching, and cloud providers.

Backend ArchitectureDockerKubernetes
0 likes · 13 min read
Why I Chose Microservice Architecture and a Roadmap of Its Core Components
Alibaba Cloud Native
Alibaba Cloud Native
Dec 30, 2021 · Operations

How to Implement Chaos Engineering for Cloud‑Native Applications: A Step‑by‑Step Guide

This article explains how cloud‑native teams can adopt chaos engineering—defining its concepts, outlining its unique characteristics, and detailing a four‑stage implementation process from manual drills to production‑level raids, with practical steps, environment setups, and real‑world results.

Cloud NativeFault InjectionKubernetes
0 likes · 14 min read
How to Implement Chaos Engineering for Cloud‑Native Applications: A Step‑by‑Step Guide
G7 EasyFlow Tech Circle
G7 EasyFlow Tech Circle
Dec 30, 2021 · Cloud Native

Why Kubernetes OOM Kills Use WSS, Not RSS – Diagnose & Fix Container Memory

After moving IoT services to Kubernetes, containers were OOM‑killed despite RSS staying below limits because Kubernetes bases OOM decisions on the Working Set Size (WSS) metric, which includes file cache, and the article explains its calculation, reproduces the issue, and offers practical mitigation strategies.

Cache ManagementContainer MemoryKernel Parameters
0 likes · 12 min read
Why Kubernetes OOM Kills Use WSS, Not RSS – Diagnose & Fix Container Memory
Ctrip Technology
Ctrip Technology
Dec 30, 2021 · Cloud Computing

Ctrip’s Practice of Using AWS Spot Instances for Cost Reduction and High Availability

This article details Ctrip’s large‑scale use of AWS Spot instances on Kubernetes, explaining the cost benefits, the challenges of spot interruptions, and the architectural and operational strategies—including multi‑AZ deployment, scheduling policies, autoscaling group design, and observability—that enable a 50% reduction in container costs while maintaining system stability and reliability.

AWS SpotCost OptimizationKubernetes
0 likes · 13 min read
Ctrip’s Practice of Using AWS Spot Instances for Cost Reduction and High Availability