Tagged articles
4047 articles
Page 10 of 41
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 30, 2024 · Cloud Computing

Using Alibaba Cloud ACK One Registration Cluster for Elastic Hybrid Cloud Deployment

This guide explains how enterprises can overcome IDC data‑center capacity limits by leveraging Alibaba Cloud ACK One registration clusters to achieve flexible, cost‑effective elastic scaling, detailing architecture, registration steps, node‑pool creation, virtual nodes, multi‑level scheduling, and associated command‑line examples.

ACKCloud NativeKubernetes
0 likes · 10 min read
Using Alibaba Cloud ACK One Registration Cluster for Elastic Hybrid Cloud Deployment
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 30, 2024 · Cloud Native

Best Practices for High Availability and Stability in Alibaba Cloud Container Service for Kubernetes (ACK)

This article presents a comprehensive overview of high‑availability design patterns and best‑practice recommendations for Alibaba Cloud Container Service for Kubernetes (ACK), covering common error scenarios, single‑cluster and multi‑cluster architectures, workload resilience, monitoring, and real‑world case studies.

ACKCloud NativeKubernetes
0 likes · 13 min read
Best Practices for High Availability and Stability in Alibaba Cloud Container Service for Kubernetes (ACK)
Soul Technical Team
Soul Technical Team
Sep 30, 2024 · Cloud Native

Practical Experience with Envoy in Soul: Cloud‑Native Traffic Management and Service Mesh

This article shares Soul's two‑year practice of using the cloud‑native Envoy proxy for high‑performance, high‑throughput, and highly available traffic management across north‑south and east‑west flows, covering architecture, dynamic service discovery, load balancing, health checks, WASM extensions, service‑mesh integration, Redis proxying, and future directions.

Cloud NativeEnvoyKubernetes
0 likes · 14 min read
Practical Experience with Envoy in Soul: Cloud‑Native Traffic Management and Service Mesh
Alibaba Cloud Native
Alibaba Cloud Native
Sep 30, 2024 · Cloud Native

Unlock Elastic Scaling: Connect Your IDC Kubernetes to Alibaba Cloud ACK One Registration Cluster

This guide explains how enterprises can overcome IDC data‑center capacity limits by registering on‑premises Kubernetes clusters to Alibaba Cloud ACK One, detailing architecture, resource‑elastic benefits, step‑by‑step integration, node‑pool creation, GPU script setup, virtual node deployment, and multi‑level scheduling with concrete commands.

ACK OneAlibaba CloudKubernetes
0 likes · 11 min read
Unlock Elastic Scaling: Connect Your IDC Kubernetes to Alibaba Cloud ACK One Registration Cluster
System Architect Go
System Architect Go
Sep 29, 2024 · Cloud Native

Understanding CSI Driver Workflow with an NFS Example

This article explains the architecture, components, and operational flow of a CSI driver using NFS as a concrete example, covering CSI fundamentals, sidecar containers, dynamic volume provisioning, pod creation, and the role of the Linux VFS in exposing remote storage to applications.

CSIKubernetesNFS
0 likes · 6 min read
Understanding CSI Driver Workflow with an NFS Example
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 29, 2024 · Cloud Native

Building a Production‑Grade Observability System for Alibaba Cloud ACK Container Service

The presentation outlines Alibaba Cloud's ACK container service observability framework, covering its architecture, key capabilities such as eBPF‑based tracing, GPU profiling, network diagnostics, storage monitoring, and FinOps integration, and demonstrates how these features support AI workloads, large‑scale production stability, and automated incident response.

AICloud NativeContainer Service
0 likes · 15 min read
Building a Production‑Grade Observability System for Alibaba Cloud ACK Container Service
Linux Ops Smart Journey
Linux Ops Smart Journey
Sep 29, 2024 · Cloud Native

How to Install and Use the kubectl ingress-nginx Plugin Offline with Krew

This guide walks you through installing Krew, the kubectl plugin manager, then shows step‑by‑step offline installation of the ingress‑nginx plugin and demonstrates key commands for inspecting backends, certificates, and generated Nginx configurations, empowering efficient Kubernetes cloud‑native operations.

Cloud NativeIngress-NginxKubernetes
0 likes · 9 min read
How to Install and Use the kubectl ingress-nginx Plugin Offline with Krew
Soul Technical Team
Soul Technical Team
Sep 29, 2024 · Cloud Computing

Soul's Container Cluster Cost Governance: A Case Study on Resource Optimization

Soul's container cluster cost governance case study details their approach to optimizing resource utilization through Kubernetes-based solutions, addressing challenges like resource fragmentation and implementing strategies such as SNAS for elastic scaling and HPA+CronHPA coordination to achieve significant cost reductions.

Cost OptimizationKubernetesResource Management
0 likes · 16 min read
Soul's Container Cluster Cost Governance: A Case Study on Resource Optimization
Infra Learning Club
Infra Learning Club
Sep 29, 2024 · Cloud Native

Current State of Kubernetes DRA and the New Architecture with ResourceClaimParameters and ResourceSlice

The article examines the scheduling performance and tight coupling issues of Kubernetes DRA before version 1.30, explains the original workflow involving PodSchedulingContext and DRA driver, and then details the latest design that introduces ResourceClaimParameters and ResourceSlice to let the scheduler handle complex device constraints internally.

Cloud NativeDRAKubernetes
0 likes · 4 min read
Current State of Kubernetes DRA and the New Architecture with ResourceClaimParameters and ResourceSlice
Infra Learning Club
Infra Learning Club
Sep 28, 2024 · Cloud Native

Effective Ways to Debug Kubernetes Components

Debugging Kubernetes components is challenging due to version mismatches, service dependencies, and IDE limitations; this guide outlines two practical methods—log‑based analysis with redeployed services and direct IDE debugging—while detailing environment setup, local and remote development workflows, and testing procedures.

DockerIDEKubernetes
0 likes · 9 min read
Effective Ways to Debug Kubernetes Components
Architecture and Beyond
Architecture and Beyond
Sep 28, 2024 · Cloud Native

Core Concepts of Docker and Kubernetes for Architects

This article explains the fundamental principles of Docker containerization and Kubernetes orchestration, covering containerization, image layering, core components, declarative APIs, Pods, Services, Namespaces, ConfigMaps, Secrets, Volumes, and their combined advantages and challenges for enterprise‑level system design.

Cloud NativeDockerKubernetes
0 likes · 30 min read
Core Concepts of Docker and Kubernetes for Architects
Infra Learning Club
Infra Learning Club
Sep 27, 2024 · Cloud Native

Inside Kubelet: How Pod Admission Works

This article dissects Kubelet's Pod admission pipeline, explaining how syncLoopIteration gathers pod data, how HandlePodAdditions invokes canAdmitPod, and how six registered admit handlers—Eviction, System Allowlist, Resource Allocation, Predicate, AppArmor, and Shutdown—evaluate each pod with concrete code examples and decision logic.

Admission HandlersGoKubernetes
0 likes · 14 min read
Inside Kubelet: How Pod Admission Works
Linux Ops Smart Journey
Linux Ops Smart Journey
Sep 27, 2024 · Cloud Native

How to Install and Configure cert-manager on Kubernetes with Helm

Learn step‑by‑step how to install Jetstack’s open‑source cert-manager on a Kubernetes cluster using Helm, from prerequisites and chart download to configuring TLS settings and deploying the service, ensuring secure inter‑service communication in cloud‑native environments.

Cloud NativeKubernetesTLS
0 likes · 6 min read
How to Install and Configure cert-manager on Kubernetes with Helm
System Architect Go
System Architect Go
Sep 26, 2024 · Cloud Native

Introduction to OCI: Image, Distribution, and Runtime Specifications in the Code‑to‑Container Workflow

This article explains the Open Container Initiative (OCI) standards—Image-spec, Distribution-spec, and Runtime-spec—detailing how code is built into OCI images, pushed to registries, and executed via Kubernetes components such as kubelet, containerd/CRI‑O, and runc or kata‑containers, ensuring portable, interoperable containers.

ContainerKubernetesOCI
0 likes · 10 min read
Introduction to OCI: Image, Distribution, and Runtime Specifications in the Code‑to‑Container Workflow
DevOps Operations Practice
DevOps Operations Practice
Sep 24, 2024 · Cloud Native

How to Install MiniKube on RHEL 8

This guide provides a step‑by‑step tutorial for installing MiniKube on RHEL 8, covering system updates, required dependencies, Docker and kubectl installation, MiniKube binary setup, cluster startup, verification, and optional add‑ons, ensuring a functional local Kubernetes environment.

DockerInstallationKubernetes
0 likes · 6 min read
How to Install MiniKube on RHEL 8
Huolala Tech
Huolala Tech
Sep 24, 2024 · Cloud Native

How We Revamped a Large‑Scale API Automation Platform with Kubernetes and Tekton

This article details the evolution of a high‑traffic API automation testing platform, covering challenges such as multi‑environment isolation, execution speed, quality assessment, and stability, and explains how the team leveraged Kubernetes, Rancher, Tekton, precise testing, and modern reporting to dramatically improve efficiency and reliability.

API testingKubernetesci/cd
0 likes · 32 min read
How We Revamped a Large‑Scale API Automation Platform with Kubernetes and Tekton
Infra Learning Club
Infra Learning Club
Sep 23, 2024 · Cloud Native

How to Extend the Kubernetes Scheduler: Four Approaches

The article outlines four ways to extend the Kubernetes scheduler—Scheduler Extender, Scheduler Framework, deploying multiple schedulers, and a WebAssembly‑based plugin model—detailing their mechanisms, configuration steps, advantages, and drawbacks.

KubernetesMulti SchedulerScheduler
0 likes · 5 min read
How to Extend the Kubernetes Scheduler: Four Approaches
Selected Java Interview Questions
Selected Java Interview Questions
Sep 23, 2024 · Operations

Comprehensive Guide to Using Apollo Configuration Center with Spring Boot and Kubernetes

This article provides a step‑by‑step tutorial on Apollo’s concepts, features, model, namespace handling, client design, high‑availability considerations, project creation, configuration management, Spring Boot client implementation, testing across environments, clusters and namespaces, and deployment on Kubernetes with Docker.

ApolloConfiguration ManagementKubernetes
0 likes · 25 min read
Comprehensive Guide to Using Apollo Configuration Center with Spring Boot and Kubernetes
Architecture and Beyond
Architecture and Beyond
Sep 21, 2024 · Cloud Native

Understanding Cloud Native Architecture: Definitions, Evolution, Core Technologies, Maturity Models, and Challenges

Cloud native architecture, a modern design paradigm leveraging containers, microservices, CI/CD, and infrastructure-as-code, enables flexible, scalable, and automated applications; this article explores its definition, evolution, core technologies, maturity models, benefits, challenges, and best practices for enterprises transitioning from traditional systems.

Cloud NativeContainersDevOps
0 likes · 37 min read
Understanding Cloud Native Architecture: Definitions, Evolution, Core Technologies, Maturity Models, and Challenges
Linux Ops Smart Journey
Linux Ops Smart Journey
Sep 20, 2024 · Cloud Native

How to Enable CORS in Kubernetes Ingress Nginx: Step‑by‑Step Guide

This tutorial explains why CORS is needed for modern web applications, outlines common scenarios such as front‑end/back‑end separation and third‑party API integration, and provides a complete Ingress Nginx configuration with code examples to enable cross‑origin requests in a Kubernetes cluster.

CORSCloud NativeKubernetes
0 likes · 5 min read
How to Enable CORS in Kubernetes Ingress Nginx: Step‑by‑Step Guide
MaGe Linux Operations
MaGe Linux Operations
Sep 20, 2024 · Cloud Native

Master Kubernetes: From Pods to Deployments and Seamless Scaling

This comprehensive guide walks you through the origins, architecture, core components, and practical workflows of Kubernetes, explaining how Pods and Deployments work together, how the control plane orchestrates resources, and how to create, manage, and troubleshoot containerized applications at scale.

Cloud NativeDeploymentKubernetes
0 likes · 44 min read
Master Kubernetes: From Pods to Deployments and Seamless Scaling
Ops Development Stories
Ops Development Stories
Sep 19, 2024 · Artificial Intelligence

How to Connect Qwen LLMs with Higress AI Gateway: A Hands‑On Guide

This tutorial walks through setting up a local k3d cluster, installing Higress, and using its AI plugins—including AI Proxy, AI JSON formatter, AI Agent, and AI Statistics—to integrate and observe Alibaba Cloud's Qwen large language models across various use cases such as weather and flight queries.

AI gatewayAI pluginsHigress
0 likes · 30 min read
How to Connect Qwen LLMs with Higress AI Gateway: A Hands‑On Guide
System Architect Go
System Architect Go
Sep 18, 2024 · Cloud Native

Understanding DNS and Its Configuration in Kubernetes

This article explains the fundamentals of DNS, its role as an application‑layer protocol, and how Kubernetes generates and manages each pod's /etc/resolv.conf through kubelet, covering nameserver, search, ndots options, dnsPolicy choices, and custom dnsConfig examples.

CoreDNSDNSKubernetes
0 likes · 6 min read
Understanding DNS and Its Configuration in Kubernetes
DevOps Operations Practice
DevOps Operations Practice
Sep 17, 2024 · Operations

Comparison of ELK, EFK, and PLG Log Systems

This article introduces three major log management stacks—ELK, EFK, and PLG (Promtail + Loki + Grafana)—explaining their components, deployment scenarios, and provides a detailed comparison of their storage, indexing, and visualization approaches, especially in Kubernetes environments.

EFKELKGrafana
0 likes · 5 min read
Comparison of ELK, EFK, and PLG Log Systems
Infra Learning Club
Infra Learning Club
Sep 16, 2024 · Cloud Native

Survey of GPU Sharing and Virtualization Solutions for Kubernetes

The article surveys open‑source GPU sharing and virtualization approaches for AI workloads, comparing soft isolation, CUDA‑level isolation, NVIDIA MPS, driver‑level isolation, GPU pooling and deep‑learning memory sharing, and highlights their architectures, isolation guarantees, and performance trade‑offs.

Device PluginGPUKubernetes
0 likes · 5 min read
Survey of GPU Sharing and Virtualization Solutions for Kubernetes
21CTO
21CTO
Sep 14, 2024 · Artificial Intelligence

How Kubernetes Powers AI at the Edge: Overcoming Real-World Challenges

This article explores how Kubernetes enables AI workloads at the edge, detailing the challenges of edge environments, the need for sensors and data processing, hybrid VM‑container solutions, and future AI use cases that drive local, real‑time insights.

Artificial IntelligenceContainersEdge Computing
0 likes · 7 min read
How Kubernetes Powers AI at the Edge: Overcoming Real-World Challenges
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 13, 2024 · Industry Insights

Why Distributed Cloud‑Native Is the Next Enterprise Cloud Choice – Expert Insights

In an interview, Alibaba Cloud’s distributed cloud‑native platform lead explains how distributed cloud‑native addresses elasticity, high‑availability, and multi‑cluster management challenges, outlines the evolution of ACK One, and forecasts its role in AI and edge computing for modern enterprises.

ACK OneAI workloadsEdge Computing
0 likes · 11 min read
Why Distributed Cloud‑Native Is the Next Enterprise Cloud Choice – Expert Insights
Infra Learning Club
Infra Learning Club
Sep 11, 2024 · Cloud Native

A Bottom‑Up Learning Path for Containers

The article outlines a comprehensive, bottom‑up learning path for containers, covering Linux container fundamentals, images, runtimes, managers, orchestrators, and non‑Linux implementations, while clarifying common misconceptions, explaining how runtimes like runc work, and comparing Docker, containerd, and Kubernetes.

Cloud NativeContainersDocker
0 likes · 14 min read
A Bottom‑Up Learning Path for Containers
MaGe Linux Operations
MaGe Linux Operations
Sep 10, 2024 · Cloud Native

How to Connect Kubernetes to Ceph for Dynamic PVC Provisioning

This guide walks through integrating a Ceph cluster with a Kubernetes cluster to use Ceph as backend storage, covering prerequisites, pool and secret creation, StorageClass definition, PVC and pod deployment, and detailed troubleshooting steps for common errors.

CephCloud NativeDynamic PVC
0 likes · 10 min read
How to Connect Kubernetes to Ceph for Dynamic PVC Provisioning
Selected Java Interview Questions
Selected Java Interview Questions
Sep 10, 2024 · Backend Development

Graceful Shutdown in Spring Boot: Mechanisms, Code Samples, and Configuration

This article explains how Spring Boot handles graceful shutdown in Kubernetes, manual Actuator shutdown, SIGTERM handling, Tomcat graceful termination, and Logback log flushing, providing detailed code examples, configuration snippets, and the underlying shutdown hook mechanisms that ensure resources are released cleanly.

GracefulShutdownKubernetesSpringBoot
0 likes · 18 min read
Graceful Shutdown in Spring Boot: Mechanisms, Code Samples, and Configuration
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Sep 9, 2024 · Cloud Native

Applying eBPF for Cloud‑Native Observability and Continuous Profiling

By deploying eBPF agents as DaemonSets that hook kernel network and performance events, the Xiaohongshu observability team extended cloud‑native monitoring from the application to the kernel, delivering real‑time traffic analysis and low‑overhead continuous profiling for C++ services, aggregating data into centralized collectors for dashboards, flame‑graphs, and rapid root‑cause diagnosis.

KubernetesPerformance MonitoringProfiling
0 likes · 37 min read
Applying eBPF for Cloud‑Native Observability and Continuous Profiling
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Sep 9, 2024 · Cloud Native

Optimizing Network and Storage for 5,000‑Node Kubernetes Clusters

This guide outlines practical strategies for designing and optimizing network and storage in Kubernetes clusters of over 5,000 nodes, covering overlay networks, IP pool segmentation, bandwidth allocation, load balancing, security policies, distributed storage options, performance tuning, and reliable backup solutions.

Cloud NativeIP PoolKubernetes
0 likes · 9 min read
Optimizing Network and Storage for 5,000‑Node Kubernetes Clusters
System Architect Go
System Architect Go
Sep 7, 2024 · Cloud Native

How Kubelet, CRI, and CNI Collaborate to Launch a New Pod

When a new Pod is created, Kubelet coordinates with the CRI and CNI components to set up the sandbox, configure networking, pull images, create and start containers, using gRPC calls and command‑line interactions, with details varying across container runtimes such as containerd, cri‑o, and Docker.

CNICRICloud Native
0 likes · 5 min read
How Kubelet, CRI, and CNI Collaborate to Launch a New Pod
Infra Learning Club
Infra Learning Club
Sep 5, 2024 · Cloud Native

Deep Dive into Kubelet’s DeviceManager Source Code

This article explains how Kubernetes uses the device‑plugin framework to extend resources beyond CPU and memory, details the kubelet registration and allocation workflow, and walks through the relevant source code in pkg/kubelet/cm/devicemanager that builds the OCI spec.

CDIDRADevice Plugin
0 likes · 5 min read
Deep Dive into Kubelet’s DeviceManager Source Code
DevOps Operations Practice
DevOps Operations Practice
Sep 4, 2024 · Cloud Native

An Overview of Talos Linux: A Minimalist, Immutable OS for Kubernetes

Talos Linux is a minimalist, immutable operating system built specifically for Kubernetes, offering a reduced attack surface, read‑only filesystem, and tight integration that simplifies deployment, management, and security for production clusters, edge computing, and cloud‑native development environments.

Immutable OSKubernetesTalos Linux
0 likes · 4 min read
An Overview of Talos Linux: A Minimalist, Immutable OS for Kubernetes
Xiaolei Talks DB
Xiaolei Talks DB
Sep 4, 2024 · Databases

Master Online TiDB Migration: Step‑by‑Step Guide for Cross‑Data‑Center Moves

This guide details three online TiDB migration scenarios—including placement‑rule replica placement, TiCDC with BR backup, and hybrid strategies—covering preparation, network and resource requirements, configuration commands, region balancing, PD leader transfer, and post‑migration cleanup for seamless cross‑data‑center database relocation.

BR BackupKubernetesPlacement Rules
0 likes · 19 min read
Master Online TiDB Migration: Step‑by‑Step Guide for Cross‑Data‑Center Moves
Cloud Native Technology Community
Cloud Native Technology Community
Sep 4, 2024 · Cloud Native

Debugging Kubernetes Container Networks with nsenter and Temporary Pods

This article explains how to use the nsenter tool to enter a container's network namespace for packet capture and other host‑level debugging, provides scripts for logging into Kubernetes nodes, shows a DaemonSet deployment for persistent access, and compares nsenter with the newer kubectl debug temporary container approach.

Cloud NativeDaemonSetKubernetes
0 likes · 5 min read
Debugging Kubernetes Container Networks with nsenter and Temporary Pods
macrozheng
macrozheng
Sep 4, 2024 · Cloud Native

Upgrade Mall‑Swarm to Spring Boot 3 & Spring Cloud 2023: A Complete Guide

This article walks through upgrading the Mall‑Swarm microservice e‑commerce project to Spring Boot 3, Spring Cloud 2023, JDK 17, and related dependencies, covering architecture, version changes, permission solution migration, documentation updates, Elasticsearch usage, and deployment on Windows and Linux.

DockerKubernetesMicroservices
0 likes · 13 min read
Upgrade Mall‑Swarm to Spring Boot 3 & Spring Cloud 2023: A Complete Guide
Infra Learning Club
Infra Learning Club
Sep 3, 2024 · Cloud Native

How Kubelet’s VolumeManager Orchestrates Async Volume Attach, Mount, and Unmount

The article dissects Kubelet’s VolumeManager, detailing its asynchronous loops, the VolumeManager interface, how it is started from Kubelet.Run, the handling of Attach/Mount and Unmount operations during pod sync, the internal struct fields, and the plugin initialization process that together manage the full lifecycle of pod volumes.

GoKubernetesPod Lifecycle
0 likes · 10 min read
How Kubelet’s VolumeManager Orchestrates Async Volume Attach, Mount, and Unmount
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 2, 2024 · Cloud Native

How Lilith Games Used Cloud‑Native Architecture to Transform AFK Journey

This article examines Lilith Games' cloud‑native migration of the new title AFK Journey, detailing the motivations, technical challenges of containerizing stateful game servers, the adoption of OpenKruise for in‑place updates, and the measurable improvements in resource utilization, release speed, and operational costs.

Cloud NativeDevOpsGame Development
0 likes · 8 min read
How Lilith Games Used Cloud‑Native Architecture to Transform AFK Journey
Alibaba Cloud Native
Alibaba Cloud Native
Sep 1, 2024 · Cloud Native

What’s New in OpenKruise v1.7? K8s 1.28 Upgrade, SidecarSet, Advanced StatefulSet, and More

OpenKruise v1.7, released in August 2024, upgrades its Kubernetes dependency to 1.28 and introduces features such as native Sidecar container support in SidecarSet, start ordinal configuration for Advanced StatefulSet, image‑preheat credential plugins, external CA injection, and structured logging, while maintaining compatibility with clusters running Kubernetes ≥ 1.18.

Advanced StatefulSetCredential ProviderKubernetes
0 likes · 12 min read
What’s New in OpenKruise v1.7? K8s 1.28 Upgrade, SidecarSet, Advanced StatefulSet, and More
System Architect Go
System Architect Go
Aug 31, 2024 · Cloud Native

Understanding Kubernetes Ingress and the Gateway API: Concepts, Comparison, and Best Practices

This article explains how Kubernetes manages inbound traffic using Ingress and the newer Gateway API, compares their features and limitations, provides YAML examples for Ingress, GatewayClass, Gateway, and Route resources, and discusses the future direction of traffic management in cloud‑native environments.

Gateway APIIngressK8s Networking
0 likes · 7 min read
Understanding Kubernetes Ingress and the Gateway API: Concepts, Comparison, and Best Practices
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Aug 30, 2024 · Cloud Native

Middleware Containerization and Cloud‑Native Transformation at OPPO

OPPO transformed its sprawling, manually‑provisioned middleware clusters into a cloud‑native, containerized platform by building custom Kubernetes controllers, IP‑preserving StatefulSets, resource‑isolated containers, automated monitoring and self‑healing workflows, enabling rapid provisioning, efficient utilization, fault‑tolerant scaling and future serverless and service‑mesh integration.

KubernetesOperatorcloud-native
0 likes · 20 min read
Middleware Containerization and Cloud‑Native Transformation at OPPO
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Aug 30, 2024 · Industry Insights

How GPU Virtualization Powers AI and Cloud Computing: Techniques, Challenges, and Future Directions

This article examines the rapid rise of GPU virtualization as a solution for efficient GPU resource utilization in AI, big data, and high‑performance computing, detailing its concepts, implementation methods across user, kernel, and hardware layers, Kubernetes integration, real‑world use cases, challenges, and emerging research trends.

Device PluginGPU virtualizationKubernetes
0 likes · 25 min read
How GPU Virtualization Powers AI and Cloud Computing: Techniques, Challenges, and Future Directions
Infra Learning Club
Infra Learning Club
Aug 30, 2024 · Cloud Native

Kubelet Source Dive: syncLoopIteration (Part 3) – How probeCh Is Built from Probe Managers

The article explains that the apparent probeCh in kubelet is actually three separate channels—livenessCh, readinessCh, and startupCh—managed by livenessManager, readinessManager, and startupManager, details the ProbeManager implementation that creates probe workers via AddPod, and shows how syncLoopIteration processes probe updates to adjust pod status.

GoKubernetescloud-native
0 likes · 8 min read
Kubelet Source Dive: syncLoopIteration (Part 3) – How probeCh Is Built from Probe Managers
Linux Ops Smart Journey
Linux Ops Smart Journey
Aug 29, 2024 · Operations

How to Diagnose and Fix CoreDNS Timeout Issues in Kubernetes

This article explains why CoreDNS may experience DNS resolution timeouts in a Kubernetes cluster, how to analyze logs and timeout settings, locate upstream DNS problems, and apply practical solutions such as adjusting timeout values, switching upstream DNS servers, and deploying a local DNS service.

Cloud NativeCoreDNSDNS
0 likes · 4 min read
How to Diagnose and Fix CoreDNS Timeout Issues in Kubernetes
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Aug 29, 2024 · Cloud Native

Mastering Kubernetes: Core Concepts, Architecture, and Real‑World Use Cases

This article provides a comprehensive overview of Kubernetes (K8S), covering its origins, key problems it solves, master‑node architecture, core components such as kube‑apiserver, scheduler, controllers, node agents, and practical applications like CI/CD integration, multi‑tenant and micro‑service deployments.

Cloud NativeCluster ManagementKubernetes
0 likes · 9 min read
Mastering Kubernetes: Core Concepts, Architecture, and Real‑World Use Cases
JD Tech Talk
JD Tech Talk
Aug 28, 2024 · Cloud Native

Joylive Agent: Java Bytecode Enhancement for Proxyless Service Mesh and Microservice Governance

The document introduces Joylive Agent, a Java bytecode‑enhancement framework that implements a proxyless service‑mesh architecture, detailing its microkernel design, plugin system, request abstraction, core governance features such as multi‑live routing, lane management, and providing implementation examples and Kubernetes deployment guidance.

Kubernetesbytecode enhancementproxyless
0 likes · 33 min read
Joylive Agent: Java Bytecode Enhancement for Proxyless Service Mesh and Microservice Governance
Cloud Native Technology Community
Cloud Native Technology Community
Aug 28, 2024 · Cloud Native

Kubernetes 1.31 Introduces the Alpha ‘distribute-cpus-across-cores’ Option in CPUManager Static Policy

Kubernetes 1.31 adds an alpha‑stage ‘distribute-cpus-across-cores’ option to the CPUManager static policy, allowing CPUs to be spread across physical cores for better cache locality, reduced contention, and improved performance in multi‑core and performance‑sensitive workloads.

CPUManagerCloud NativeKubernetes
0 likes · 7 min read
Kubernetes 1.31 Introduces the Alpha ‘distribute-cpus-across-cores’ Option in CPUManager Static Policy
Linux Ops Smart Journey
Linux Ops Smart Journey
Aug 27, 2024 · Information Security

Secure Your Container Images: Integrate Trivy Scanning into Harbor

This guide explains how to secure container images by integrating the Trivy vulnerability scanner into the Harbor registry, covering Helm configuration, offline database setup, automated updates via cron, verification steps, and useful references for a robust cloud‑native security workflow.

Container SecurityKubernetesTrivy
0 likes · 7 min read
Secure Your Container Images: Integrate Trivy Scanning into Harbor
Infra Learning Club
Infra Learning Club
Aug 27, 2024 · Cloud Native

Kubelet Source Code Deep Dive: Understanding Its Core Workflows

The article dissects the kubelet architecture, detailing its main syncLoop control cycle, auxiliary loops, and key managers such as podManager, podWorkers, evictionManager, probeManager, and runtime components, while explaining how pod updates, PLEG mechanisms, and various channels coordinate pod lifecycle and resource handling.

Cloud NativeKubernetesRuntime
0 likes · 9 min read
Kubelet Source Code Deep Dive: Understanding Its Core Workflows
MaGe Linux Operations
MaGe Linux Operations
Aug 24, 2024 · Databases

Recovering PostgreSQL After Power Loss: Fixing Invalid Checkpoint Errors

After a power outage caused a PostgreSQL instance running in Kubernetes to fail with connection errors and an invalid primary checkpoint record, this guide explains how to diagnose missing socket files, inspect process IDs, use pg_resetwal to repair the checkpoint, and successfully restart the database.

Checkpoint RepairDatabase RecoveryKubernetes
0 likes · 5 min read
Recovering PostgreSQL After Power Loss: Fixing Invalid Checkpoint Errors
Linux Ops Smart Journey
Linux Ops Smart Journey
Aug 23, 2024 · Cloud Native

How to Seamlessly Sync Images Between Harbor and Docker Registry

This guide walks you through configuring Harbor and Docker Registry replication, covering background considerations, creating replication repositories and rules, manual sync triggers, and reference resources, so you can keep container images up‑to‑date and secure.

Docker RegistryKubernetesimage replication
0 likes · 5 min read
How to Seamlessly Sync Images Between Harbor and Docker Registry
JD Cloud Developers
JD Cloud Developers
Aug 23, 2024 · Cloud Native

Scaling Across Clusters: JD Cloud’s Large‑Scale Application Management Practices

At KubeCon + CloudNativeCon 2024 in Hong Kong, JD Cloud presented its cross‑cluster, large‑scale application management practice, detailing a federated Serverless model that oversees over 10,000 nodes, improves resource utilization, simplifies multi‑cluster scheduling, and offers efficient elastic scaling solutions.

Cloud NativeJD CloudKubernetes
0 likes · 3 min read
Scaling Across Clusters: JD Cloud’s Large‑Scale Application Management Practices
Alibaba Cloud Native
Alibaba Cloud Native
Aug 20, 2024 · Cloud Native

How We Built a Scalable Cloud‑Native DevOps Platform from Scratch

From zero‑to‑hero in 2021, a Shanghai tech firm designed, selected, and implemented a full‑stack Cloud‑Native DevOps solution on Alibaba Cloud, covering project management, GitFlow, CI/CD pipelines, multi‑environment image promotion, infrastructure‑as‑code, and monitoring to support hundreds of micro‑services.

DevOpsGitOpsInfrastructure as Code
0 likes · 16 min read
How We Built a Scalable Cloud‑Native DevOps Platform from Scratch
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Aug 20, 2024 · Cloud Native

Create a Custom Kubernetes Health‑Check Script that Auto‑Restarts Unhealthy Pods

To prevent service disruption caused by overly aggressive or sluggish Kubernetes liveness and readiness probes, this guide shows how to implement a custom health‑check script that periodically tests a service endpoint, exits the container on failure, and lets Kubernetes automatically restart the pod.

Container RestartCustom ScriptDocker
0 likes · 5 min read
Create a Custom Kubernetes Health‑Check Script that Auto‑Restarts Unhealthy Pods
DevOps Operations Practice
DevOps Operations Practice
Aug 15, 2024 · Cloud Native

Five Best Open-Source Kubernetes Storage Solutions

This article reviews five leading open‑source storage solutions for Kubernetes—OpenEBS, Rook, GlusterFS, Ceph, and LongHorn—detailing their architectures, key features, and ideal use‑cases to help readers select the most appropriate storage option for various application requirements.

Kubernetesdistributed storage
0 likes · 6 min read
Five Best Open-Source Kubernetes Storage Solutions
dbaplus Community
dbaplus Community
Aug 13, 2024 · Artificial Intelligence

Why Kubernetes Is the Ideal Platform for AI Inference: 5 Key Benefits

Kubernetes aligns perfectly with AI inference demands by offering built‑in scalability, resource and performance optimization, seamless portability across clouds, and robust fault‑tolerance, making it a cost‑effective, high‑availability foundation for deploying large‑scale machine‑learning models.

AI inferenceKubernetesResource Optimization
0 likes · 10 min read
Why Kubernetes Is the Ideal Platform for AI Inference: 5 Key Benefits
Open Source Linux
Open Source Linux
Aug 13, 2024 · Cloud Native

How to Enable Hot Reload of ConfigMaps and Secrets in Kubernetes with Reloader

This article explains the challenges of updating ConfigMaps and Secrets in cloud‑native environments, compares env and volume injection methods, and demonstrates how the open‑source Reloader tool can automatically watch changes and trigger rolling updates for deployments, statefulsets, and other workloads.

Cloud NativeConfigMapKubernetes
0 likes · 10 min read
How to Enable Hot Reload of ConfigMaps and Secrets in Kubernetes with Reloader
ByteDance Cloud Native
ByteDance Cloud Native
Aug 12, 2024 · Cloud Native

How to Deploy NVIDIA NIM AI Models on Volcengine VKE in Minutes

This guide walks you through deploying large language models with NVIDIA NIM on Volcengine's Kubernetes Engine (VKE), covering environment setup, model optimization, Helm chart deployment, monitoring integration, and the key advantages of using NIM as a cloud‑native AI micro‑service.

AI deploymentGPUKubernetes
0 likes · 12 min read
How to Deploy NVIDIA NIM AI Models on Volcengine VKE in Minutes
Go Programming World
Go Programming World
Aug 12, 2024 · Cloud Native

Setting /dev/shm Size for Kubernetes Pods: A Production Troubleshooting Guide

During a production deployment of large language model training on Kubernetes, a pod failed due to insufficient /dev/shm shared memory; the article details the root cause, explores missing pod spec parameters, and presents a complete solution using an emptyDir volume with medium: Memory and sizeLimit to configure shared memory.

KubernetesPodemptyDir
0 likes · 11 min read
Setting /dev/shm Size for Kubernetes Pods: A Production Troubleshooting Guide
Architect's Guide
Architect's Guide
Aug 11, 2024 · Operations

Comprehensive Guide to Using Apollo Distributed Configuration Center

This article provides an in‑depth tutorial on Apollo, Ctrip's open‑source distributed configuration center, covering its core concepts, architecture, four‑dimensional configuration model, client design, deployment, and step‑by‑step instructions for creating projects, adding configurations, testing dynamic updates, and running the service in Kubernetes with Docker.

ApolloConfiguration CenterDocker
0 likes · 22 min read
Comprehensive Guide to Using Apollo Distributed Configuration Center
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Aug 8, 2024 · Cloud Native

Understanding Kubernetes (K8S) Architecture and Core Components

This article provides a comprehensive overview of Kubernetes, detailing its cloud‑native architecture, the roles of Master and Node components, key services such as API Server, etcd, Controller Manager, Scheduler, as well as node‑level elements like Kubelet, container runtimes, and Kube‑Proxy, while also offering practical code examples and resource links.

Cloud NativeKubernetesMaster Node
0 likes · 9 min read
Understanding Kubernetes (K8S) Architecture and Core Components
macrozheng
macrozheng
Aug 8, 2024 · Backend Development

Master Modern Microservices with the Mall‑Swarm 2024 Video Course

This article introduces the Mall‑Swarm open‑source microservice e‑commerce system, outlines its 2024 video tutorial covering Spring Cloud, Kubernetes deployment, and the full tech stack, and explains how learners can access demos, study materials, and join the learning community.

KubernetesMicroservicesSpring Cloud
0 likes · 9 min read
Master Modern Microservices with the Mall‑Swarm 2024 Video Course