Tagged articles
4047 articles
Page 6 of 41
Efficient Ops
Efficient Ops
Jun 17, 2025 · Operations

Boost Kubernetes Efficiency with K9s: A Terminal UI Guide

K9s delivers a terminal‑based UI that streamlines Kubernetes cluster management by providing real‑time monitoring, shortcut‑driven operations, context switching, and RBAC visualization, with cross‑platform installation options and practical tips for overview, resource analysis, pod handling, and log inspection, boosting efficiency for both novices and experts.

CLICluster MonitoringDevOps
0 likes · 4 min read
Boost Kubernetes Efficiency with K9s: A Terminal UI Guide
DevOps Operations Practice
DevOps Operations Practice
Jun 16, 2025 · Cloud Native

Mastering Kubernetes: 6 Essential Tools for Cluster Management

This article introduces six indispensable tools—kubectl, Helm, Prometheus + Grafana, Istio, Velero, and K9s—that simplify Kubernetes cluster management by covering resource handling, monitoring, networking, security, backup, and interactive UI, helping readers efficiently operate production‑grade clusters.

Cloud NativeCluster ManagementDevOps
0 likes · 7 min read
Mastering Kubernetes: 6 Essential Tools for Cluster Management
Linux Ops Smart Journey
Linux Ops Smart Journey
Jun 16, 2025 · Cloud Native

Mastering PrometheusRule: Streamline Kubernetes Alerting & Recording

This article explains how PrometheusRule, a Kubernetes custom resource, simplifies the management of alerting and recording rules by centralizing configurations, reducing restarts, avoiding conflicts, and enabling version‑controlled, modular monitoring for cloud‑native environments.

Cloud NativeKubernetesPrometheus
0 likes · 6 min read
Mastering PrometheusRule: Streamline Kubernetes Alerting & Recording
IT Services Circle
IT Services Circle
Jun 15, 2025 · Backend Development

How to Diagnose and Fix JVM GC Pauses in High‑Concurrency Microservices

This article walks through a real‑world production case, detailing how to systematically detect, analyze, and resolve severe JVM garbage‑collection pauses in a high‑concurrency Spring Boot microservice, covering resource analysis, JVM flag tuning, G1GC migration, JMX listeners, and GC‑log investigation.

Garbage CollectionJVMKubernetes
0 likes · 16 min read
How to Diagnose and Fix JVM GC Pauses in High‑Concurrency Microservices
Su San Talks Tech
Su San Talks Tech
Jun 14, 2025 · Operations

Master Apollo Config Center: From Core Concepts to SpringBoot Integration

This comprehensive guide walks you through Apollo's background, core model, four‑dimensional configuration management, client design, local caching, and deployment steps—including Maven setup, SpringBoot client implementation, Kubernetes Dockerization, and practical tests across environments, clusters, and namespaces.

ApolloConfiguration ManagementDevOps
0 likes · 26 min read
Master Apollo Config Center: From Core Concepts to SpringBoot Integration
MaGe Linux Operations
MaGe Linux Operations
Jun 13, 2025 · Cloud Native

Master Helm: From Installation to Advanced Chart Management in Kubernetes

This guide provides a comprehensive introduction to Helm, covering its purpose as a Kubernetes package manager, detailed installation steps, repository management, chart creation and structure, deployment commands, debugging techniques, built‑in objects, and practical examples such as deploying MySQL and Nginx with Helm.

Cloud NativeDeploymentDevOps
0 likes · 18 min read
Master Helm: From Installation to Advanced Chart Management in Kubernetes
MaGe Linux Operations
MaGe Linux Operations
Jun 13, 2025 · Cloud Native

Mastering Nginx Troubleshooting in Cloud‑Native Environments: A Step‑by‑Step Guide

Learn how to systematically diagnose and resolve Nginx failures in cloud‑native deployments by understanding core concepts, applying a step‑by‑step algorithm, analyzing logs, configurations, and system metrics, and using practical Kubernetes examples, code snippets, and performance models to ensure reliable service operation.

Cloud NativeDevOpsKubernetes
0 likes · 31 min read
Mastering Nginx Troubleshooting in Cloud‑Native Environments: A Step‑by‑Step Guide
MaGe Linux Operations
MaGe Linux Operations
Jun 12, 2025 · Cloud Native

Deep Dive into Kubernetes Networking: Layers, Models, and Plugins

This article provides a comprehensive deep‑dive into Kubernetes networking, explaining the four networking layers (CNI, Pod, Service, Ingress), detailing the underlying models, comparing popular plugins such as Kube‑router, Flannel, Calico, Weave Net and Cilium, and offering concrete YAML examples for deployments, services and pods.

CNIIngressKubernetes
0 likes · 16 min read
Deep Dive into Kubernetes Networking: Layers, Models, and Plugins
Ops Development Stories
Ops Development Stories
Jun 12, 2025 · Cloud Native

One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models

This tutorial walks you through using a one‑click script to create a GPU‑enabled Kind Kubernetes cluster, evenly distribute GPU resources across nodes with nvkind, install necessary drivers and toolkits, deploy a vLLM‑served large language model, and verify its operation, all on a local or cloud environment.

AI Model DeploymentDockerGPU
0 likes · 23 min read
One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models
Efficient Ops
Efficient Ops
Jun 11, 2025 · Operations

Master cURL: Essential Commands for DevOps, Monitoring, and Automation

This guide presents essential cURL commands for service health checks, API testing, file transfer, debugging, Kubernetes interactions, monitoring, load balancing, and webhook triggering, demonstrating how the versatile tool can streamline automation, CI/CD pipelines, and daily DevOps tasks.

API testingDevOpsKubernetes
0 likes · 5 min read
Master cURL: Essential Commands for DevOps, Monitoring, and Automation
Liangxu Linux
Liangxu Linux
Jun 11, 2025 · Cloud Native

What Sets K3s Apart from Full‑Featured Kubernetes? A Hands‑On Guide

This article explains the lightweight nature of K3s, compares it with standard Kubernetes, and provides step‑by‑step commands for installation, cluster access, node addition, service deployment, and ingress configuration, highlighting practical differences and use‑case considerations.

ClusterDockerIngress
0 likes · 10 min read
What Sets K3s Apart from Full‑Featured Kubernetes? A Hands‑On Guide
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jun 10, 2025 · Cloud Native

Build a Custom Kubernetes Controller from Scratch: Init, Code, Docker, Helm

This step‑by‑step guide walks you through initializing a Kubernetes controller project with kubebuilder, writing the Reconcile logic, debugging and performance monitoring, building and pushing Docker images, and deploying the controller using Helm charts, while also covering metrics collection, RBAC configuration, and best practices for cloud‑native workloads.

ControllerDockerGo
0 likes · 19 min read
Build a Custom Kubernetes Controller from Scratch: Init, Code, Docker, Helm
MaGe Linux Operations
MaGe Linux Operations
Jun 9, 2025 · Operations

Essential Kubernetes Troubleshooting Checklist for Ops Engineers

This guide provides Kubernetes operators with a comprehensive, step‑by‑step troubleshooting manual covering pod, node, and cluster‑level issues, common pod states, exit‑code analysis, and practical commands such as kubectl describe, logs, top, and drain, enabling rapid diagnosis and resolution of K8s problems.

KubernetesNodeOperations
0 likes · 10 min read
Essential Kubernetes Troubleshooting Checklist for Ops Engineers
JakartaEE China Community
JakartaEE China Community
Jun 9, 2025 · Cloud Native

How to Choose the Right Cloud‑Native Microservice Framework (MicroProfile vs Spring)

This article explains why cloud‑native microservices are beneficial, defines their key characteristics, compares the MicroProfile and Spring frameworks, and provides detailed code examples for REST APIs, configuration, fault tolerance, security, health checks, metrics, and distributed tracing to help developers select the most suitable technology stack.

Cloud NativeKubernetesMicroProfile
0 likes · 26 min read
How to Choose the Right Cloud‑Native Microservice Framework (MicroProfile vs Spring)
MaGe Linux Operations
MaGe Linux Operations
Jun 8, 2025 · Cloud Native

Master Kubernetes RBAC: A Complete Guide to Roles, Bindings, and Permissions

This comprehensive article explains Kubernetes security by detailing authentication types, authentication methods (HTTP Basic, Token, and HTTPS), the RBAC authorization model, and the definitions and practical examples of Role, ClusterRole, RoleBinding, and ClusterRoleBinding, helping readers implement fine‑grained access control in their clusters.

AuthenticationAuthorizationCloud Native
0 likes · 23 min read
Master Kubernetes RBAC: A Complete Guide to Roles, Bindings, and Permissions
IT Xianyu
IT Xianyu
Jun 6, 2025 · Cloud Native

Master Kubernetes on AlmaLinux: Step‑by‑Step Setup with Containerd, kubeadm, and More

This guide walks you through preparing three AlmaLinux servers, disabling firewalls and SELinux, installing Containerd as the CRI, adding Kubernetes repositories, installing kubeadm, kubelet and kubectl, configuring the runtime, and verifying each component so you can confidently bootstrap a production‑ready Kubernetes cluster.

AlmaLinuxKubernetescontainerd
0 likes · 21 min read
Master Kubernetes on AlmaLinux: Step‑by‑Step Setup with Containerd, kubeadm, and More
21CTO
21CTO
Jun 5, 2025 · Backend Development

Why Apple Replaced Java with Swift for a High‑Performance Password Service

Apple’s engineering team reports that replacing a high‑traffic password‑monitoring service’s Java backend with Swift reduced memory usage by 90%, cut hardware utilization in half, and boosted throughput by 40%, while also shrinking code size dramatically.

KubernetesMemory Management
0 likes · 5 min read
Why Apple Replaced Java with Swift for a High‑Performance Password Service
Linux Ops Smart Journey
Linux Ops Smart Journey
Jun 5, 2025 · Cloud Native

Secure Longhorn UI on Kubernetes: Add Basic Auth and Enable HTTPS

This guide walks you through securing the Longhorn UI in a Kubernetes environment by configuring basic authentication and enabling HTTPS via Ingress and cert‑manager, including prerequisite setup, secret creation, Helm upgrades, and verification steps to ensure production‑grade protection.

Basic AuthHTTPSIngress
0 likes · 6 min read
Secure Longhorn UI on Kubernetes: Add Basic Auth and Enable HTTPS
FunTester
FunTester
Jun 5, 2025 · Cloud Native

Automating Thread Dump Generation and Retrieval in Kubernetes for Efficient Fault Diagnosis

The article explains how automating thread dump creation and download in Kubernetes using tools like Fabric8, Prometheus, and CI/CD pipelines dramatically improves fault‑diagnosis speed, data centralization, real‑time capture, and integration with testing frameworks, transforming manual, error‑prone processes into streamlined, intelligent operations.

KubernetesThread Dumpautomation
0 likes · 6 min read
Automating Thread Dump Generation and Retrieval in Kubernetes for Efficient Fault Diagnosis
Linux Ops Smart Journey
Linux Ops Smart Journey
Jun 4, 2025 · Cloud Native

Deploy Longhorn on Kubernetes with Helm: Step‑by‑Step Guide

This article provides a comprehensive, hands‑on tutorial for deploying the open‑source Longhorn distributed block storage system on a Kubernetes cluster using Helm, covering prerequisites, Helm chart preparation, installation, validation, and PVC mounting to ensure reliable stateful workloads.

KubernetesLonghornPersistent Volumes
0 likes · 11 min read
Deploy Longhorn on Kubernetes with Helm: Step‑by‑Step Guide
MaGe Linux Operations
MaGe Linux Operations
Jun 4, 2025 · Cloud Native

Master Helm Repository Management: Add, Update, Search, and Secure Charts

This guide explains Helm repository fundamentals, outlines official, community, and private repository types, lists popular chart sources, and provides step‑by‑step commands for adding, updating, listing, removing, searching, versioning, pulling charts, and managing private repositories with Helm.

Chart RepositoryCloud NativeDevOps
0 likes · 8 min read
Master Helm Repository Management: Add, Update, Search, and Secure Charts
DataFunTalk
DataFunTalk
Jun 4, 2025 · Artificial Intelligence

Coupang’s Distributed Cache Architecture Accelerates AI/ML Model Training

Coupang’s AI platform replaces costly data‑copy steps with a distributed cache that automatically pulls data from a central lake, boosts GPU utilization across regions, cuts storage and operational expenses, and speeds up model training by up to 40% while simplifying deployment via Kubernetes.

AIData LakeGPU
0 likes · 9 min read
Coupang’s Distributed Cache Architecture Accelerates AI/ML Model Training
Open Source Linux
Open Source Linux
Jun 4, 2025 · Operations

How kubectl‑ai Transforms Kubernetes Management with Natural‑Language AI

kubectl‑ai is an AI‑powered Kubernetes CLI plugin that lets users issue natural‑language commands to query, create, and troubleshoot resources, supports multiple large language models, offers an interactive mode, and provides installation guidance, usage scenarios, technical highlights, and a balanced view of its benefits and limitations.

AIKuberneteskubectl
0 likes · 14 min read
How kubectl‑ai Transforms Kubernetes Management with Natural‑Language AI
dbaplus Community
dbaplus Community
Jun 3, 2025 · Operations

Mastering Kubernetes High Availability: Control Plane, Nodes, Networking, Storage, and More

This comprehensive guide walks you through designing a highly available Kubernetes cluster, covering multi‑master control‑plane deployment, worker‑node resilience, advanced networking with Cilium, durable storage with Rook/Ceph, monitoring with Thanos, security policies, disaster‑recovery strategies, cost control, and automated rollouts, all illustrated with concrete configuration snippets and real‑world performance results.

Cluster DesignDevOpsKubernetes
0 likes · 13 min read
Mastering Kubernetes High Availability: Control Plane, Nodes, Networking, Storage, and More
Efficient Ops
Efficient Ops
Jun 3, 2025 · Operations

What Anthropic’s SRE Team Learned: 23 Practical Ops Tips for Scalable AI Infrastructure

This article shares Anthropic’s SRE engineer insights on 23 actionable practices—from schema migration and Karpenter node management to OpenTelemetry adoption, Helm chart storage, and Terraform versus CloudFormation—offering concrete recommendations for building reliable, cost‑effective AI and cloud‑native platforms.

Cloud NativeDevOpsInfrastructure
0 likes · 12 min read
What Anthropic’s SRE Team Learned: 23 Practical Ops Tips for Scalable AI Infrastructure
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 3, 2025 · Artificial Intelligence

Deploying and Managing Ray on Alibaba Cloud ACK with KubeRay: Architecture, Code Samples, and Scheduling Strategies

This article explains how to build a flexible machine‑learning infrastructure on Alibaba Cloud ACK using Ray and KubeRay, covering Ray's core components, AI libraries, deployment options on VMs and Kubernetes, code examples for data processing, model serving, and advanced scheduling and quota management techniques.

AIAlibaba CloudKubeRay
0 likes · 17 min read
Deploying and Managing Ray on Alibaba Cloud ACK with KubeRay: Architecture, Code Samples, and Scheduling Strategies
Cloud Native Technology Community
Cloud Native Technology Community
Jun 3, 2025 · Cloud Native

Custom Stop Signals for Containers in Kubernetes v1.33 (Alpha Feature)

Starting with Kubernetes v1.33, the Alpha feature ContainerStopSignals introduces a Pod‑level lifecycle.stopSignal field that lets platform teams specify which Unix signal (e.g., SIGINT, SIGUSR1) should be sent to containers on termination, overcoming the previous reliance on image‑defined STOPSIGNAL and improving graceful shutdown control across Linux and Windows workloads.

Alpha FeatureContainerStopSignalsGraceful Shutdown
0 likes · 6 min read
Custom Stop Signals for Containers in Kubernetes v1.33 (Alpha Feature)
Top Architecture Tech Stack
Top Architecture Tech Stack
Jun 3, 2025 · Backend Development

Token Transmission and Internal Service Call Strategies in Spring Boot Microservices

The article discusses why token transmission is discouraged in microservice authentication, proposes passing explicit parameters, compares Feign, Dubbo, and Spring Boot Web internal call approaches, and explains how to integrate these patterns with a unified gateway, regular authentication modules, and Kubernetes Ingress for scalable backend architectures.

DubboKubernetesMicroservices
0 likes · 8 min read
Token Transmission and Internal Service Call Strategies in Spring Boot Microservices
Linux Ops Smart Journey
Linux Ops Smart Journey
May 30, 2025 · Cloud Native

Capture Live Pod Traffic with kubectl sniff: A Complete Guide

This guide introduces the kubectl sniff plugin for Kubernetes, explains its non‑privileged and privileged modes, walks through installation via krew or offline, and provides step‑by‑step commands to capture pod traffic with tcpdump and view it in Wireshark, enhancing network debugging efficiency.

KubernetesPacket CaptureWireshark
0 likes · 11 min read
Capture Live Pod Traffic with kubectl sniff: A Complete Guide
Huolala Tech
Huolala Tech
May 29, 2025 · Artificial Intelligence

How LWS Enables Scalable Multi‑Node Large Model Deployment on Kubernetes

The article explains how the Dolphin AI platform tackles large‑model deployment challenges by replacing standard Kubernetes Deployments with LeaderWorkerSet, detailing its architecture, features, installation steps, example configurations, testing, scaling, rolling updates, fault recovery, and future roadmap for AI workloads.

AI PlatformDistributed inferenceKubernetes
0 likes · 12 min read
How LWS Enables Scalable Multi‑Node Large Model Deployment on Kubernetes
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
May 28, 2025 · Operations

How to Fix etcd “NOSPACE” Errors in Kubernetes Clusters

When a Kubernetes cluster’s etcd reaches its default 2 GB quota, it triggers a “NOSPACE” alarm that blocks all write operations, causing critical services to fail; this guide explains the root cause, how to diagnose the issue with etcdctl, and step‑by‑step remediation including compaction, defragmentation, and quota expansion.

KubernetesNOSPACEcompaction
0 likes · 7 min read
How to Fix etcd “NOSPACE” Errors in Kubernetes Clusters
FunTester
FunTester
May 28, 2025 · Cloud Native

Extending Automated Thread Dumps: Log Collection, Resource Monitoring, Chaos Engineering, Performance Analysis, and Environment Cleanup

The article explores how automated thread dumps can be expanded into multiple testing scenarios—including log collection, resource monitoring, fault injection, performance result analysis, and environment cleanup—by leveraging Kubernetes APIs, Prometheus, Chaos Mesh, and scripting tools to improve efficiency, observability, and system resilience.

KubernetesPerformance TestingResource Monitoring
0 likes · 9 min read
Extending Automated Thread Dumps: Log Collection, Resource Monitoring, Chaos Engineering, Performance Analysis, and Environment Cleanup
Alibaba Cloud Native
Alibaba Cloud Native
May 24, 2025 · Cloud Native

Getting Started with Higress: A Cloud‑Native AI‑Powered API Gateway

Higress is a cloud‑native API gateway built on Istio and Envoy that adds AI capabilities, supports Go/Rust/JS Wasm plugins, and offers a web console; the article explains its architecture, core components, deployment via Helm, plugin development, and how developers can contribute to the open‑source project.

Kubernetesapi-gatewaycloud-native
0 likes · 16 min read
Getting Started with Higress: A Cloud‑Native AI‑Powered API Gateway
Architect's Tech Stack
Architect's Tech Stack
May 23, 2025 · Databases

RedisInsight Guide: Installation, Configuration, and Basic Usage

This article provides a comprehensive tutorial on RedisInsight, covering its features, step‑by‑step physical and Kubernetes installation, environment variable setup, service startup, and basic usage together with Redis server configuration, illustrated with code snippets and screenshots.

Database GUIInstallationKubernetes
0 likes · 6 min read
RedisInsight Guide: Installation, Configuration, and Basic Usage
FunTester
FunTester
May 23, 2025 · Operations

Mastering JVM Thread Dumps: From Diagnosis to Kubernetes Automation

This article explains what JVM thread dumps are, why they are crucial for fault testing, outlines common scenarios such as deadlocks and resource leaks, and provides step‑by‑step methods—including jstack, kill‑3, VisualVM, programmatic APIs, and Fabric8‑driven Kubernetes automation—to capture and analyze them effectively.

Fabric8JVMKubernetes
0 likes · 15 min read
Mastering JVM Thread Dumps: From Diagnosis to Kubernetes Automation
Cloud Native Technology Community
Cloud Native Technology Community
May 22, 2025 · Information Security

How to Prevent Common Kubernetes Security Mistakes and Harden Your Cluster

This article analyzes typical Kubernetes security pitfalls—from weak authentication and overly permissive network policies to missing real‑time monitoring, exposed services, outdated versions, and default component settings—and provides concrete, layered mitigation steps and tool recommendations.

Cloud NativeKubernetesNetwork Policy
0 likes · 13 min read
How to Prevent Common Kubernetes Security Mistakes and Harden Your Cluster
Efficient Ops
Efficient Ops
May 21, 2025 · Operations

Why We Dropped Kubernetes: Cutting Costs by 62% and Boosting DevOps Happiness

Six months after abandoning Kubernetes, our DevOps team reduced infrastructure spend by 62%, cut deployment time by 89%, eliminated weekend on‑call duties, and improved overall happiness, demonstrating that simplifying the tech stack can deliver substantial operational and business benefits.

Cost reductionDevOpsInfrastructure
0 likes · 9 min read
Why We Dropped Kubernetes: Cutting Costs by 62% and Boosting DevOps Happiness
MaGe Linux Operations
MaGe Linux Operations
May 16, 2025 · Artificial Intelligence

Deploying Massive AI Models with Docker: A Complete From‑Zero‑to‑Production Guide

Learn how to efficiently package, build, and run large AI models in Docker containers—from preparing the model and API code, creating Dockerfiles, building and testing images, to scaling in production with Kubernetes and GPU support—complete with step‑by‑step commands and best‑practice tips.

AI Model DeploymentDockerFastAPI
0 likes · 10 min read
Deploying Massive AI Models with Docker: A Complete From‑Zero‑to‑Production Guide
MaGe Linux Operations
MaGe Linux Operations
May 15, 2025 · Cloud Native

Kubernetes Security Deep Dive: Auth, AuthZ, and Admission Control

This article explains Kubernetes' comprehensive security architecture, detailing the three critical gates—authentication, authorization, and admission control—along with token, basic, and certificate methods, RBAC policies, service accounts, kubeconfig setup, and practical examples for managing user permissions within clusters.

AdmissionControlAuthenticationAuthorization
0 likes · 20 min read
Kubernetes Security Deep Dive: Auth, AuthZ, and Admission Control
Cloud Native Technology Community
Cloud Native Technology Community
May 15, 2025 · Cloud Native

What’s New in Kubernetes 1.33? Deep Dive into Sidecar, GPU/TPU Allocation, and Namespace Enhancements

Kubernetes 1.33 introduces native sidecar support, enhanced namespace isolation, and dedicated GPU/TPU resource allocation, delivering 64 feature upgrades—including stable sidecar containers, dynamic resource APIs, and user‑namespace security—while providing practical adoption guidance for production environments.

Cloud NativeKubernetesSidecar
0 likes · 5 min read
What’s New in Kubernetes 1.33? Deep Dive into Sidecar, GPU/TPU Allocation, and Namespace Enhancements
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
May 15, 2025 · Cloud Native

How 360’s AI Platform Boosted GPU Utilization with Volcano Scheduler

360’s AI platform migrated its GPU clusters to a cloud‑native architecture and adopted the Volcano scheduler, achieving over 45% GPU utilization, less than 7% fragmentation, and more than 1000000 scheduled Pods, while leveraging flexible plugins, hierarchical queues, and resource pooling to optimize AI and big‑data workloads.

AI PlatformGPU schedulingKubernetes
0 likes · 13 min read
How 360’s AI Platform Boosted GPU Utilization with Volcano Scheduler
Go Programming World
Go Programming World
May 15, 2025 · Cloud Native

Deploying MCP Gateway on Kubernetes: A Step‑by‑Step Guide

This article explains how to quickly turn existing HTTP APIs into MCP services using the Go‑based mcp‑gateway, covering the installation of a local Kind Kubernetes cluster, the required YAML manifests, Docker image handling, OpenAPI import, and verification through the web console and chat interface.

Cloud NativeGoKubernetes
0 likes · 14 min read
Deploying MCP Gateway on Kubernetes: A Step‑by‑Step Guide
Alibaba Cloud Developer
Alibaba Cloud Developer
May 14, 2025 · Cloud Native

How LoongCollector Redefines Log Collection: Performance, Flexibility, and Zero‑Downtime Upgrades

LoongCollector, the 2025 evolution of iLogtail, delivers a comprehensive overhaul of log collection with high‑performance C++/Go pipelines, dynamic plugin combos, hot‑reload isolation, adaptive network throttling, enhanced tag handling, seamless migration, and robust monitoring for cloud‑native environments, achieving up to 80% higher throughput while cutting CPU usage by 35% and memory by 10%.

KubernetesPipelinelog collection
0 likes · 15 min read
How LoongCollector Redefines Log Collection: Performance, Flexibility, and Zero‑Downtime Upgrades
MaGe Linux Operations
MaGe Linux Operations
May 13, 2025 · Operations

Master Argo CD: A Hands‑On Guide to GitOps Continuous Delivery

This comprehensive guide introduces Argo CD, explains its architecture and core concepts, walks through installation, cluster registration, application creation from a Git repository, and demonstrates how to sync and monitor applications using both the UI and CLI, empowering developers to adopt GitOps for Kubernetes deployments.

Argo CDCLIContinuous Delivery
0 likes · 17 min read
Master Argo CD: A Hands‑On Guide to GitOps Continuous Delivery
Efficient Ops
Efficient Ops
May 12, 2025 · Cloud Native

Master Kubernetes Management with Kuboard: Visual UI Guide & Installation

Kuboard is a web‑based visual tool for managing Kubernetes clusters, offering multi‑auth, multi‑cluster support, micro‑service layering, and storage integration; the guide explains Docker installation, adding clusters via KubeConfig, workload inspection, and how the UI simplifies complex command‑line operations.

Cloud NativeCluster ManagementDocker
0 likes · 5 min read
Master Kubernetes Management with Kuboard: Visual UI Guide & Installation
Linux Ops Smart Journey
Linux Ops Smart Journey
May 12, 2025 · Cloud Native

Boost CI/CD Efficiency: Using Kubernetes Pods as Jenkins Agents

This article explains how to replace traditional Jenkins agents with Kubernetes containers, covering prerequisites, benefits such as dynamic scaling and environment consistency, step‑by‑step configuration, pipeline code, and verification to create a more efficient and reliable CI/CD workflow.

DevOpsJenkinsKubernetes
0 likes · 5 min read
Boost CI/CD Efficiency: Using Kubernetes Pods as Jenkins Agents
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
May 12, 2025 · Cloud Native

Transform a Single‑Cluster CD Pipeline into a Multi‑Cluster System with ACK One

This guide explains how to leverage Alibaba Cloud's ACK One multi‑cluster application distribution together with the Cloud Effect DevOps platform to convert an existing single‑cluster continuous delivery pipeline into a resilient, multi‑region, multi‑cluster CD solution without modifying original YAML resources.

ACK OneCloud EffectContinuous Delivery
0 likes · 9 min read
Transform a Single‑Cluster CD Pipeline into a Multi‑Cluster System with ACK One
Linux Ops Smart Journey
Linux Ops Smart Journey
May 6, 2025 · Operations

Enable Calico API Service in Kubernetes: Step‑by‑Step Guide

Learn how to correctly install, configure, and activate Calico's API server on a Kubernetes cluster—including prerequisites, downloading manifests, adjusting namespaces, pulling and tagging images, generating certificates, applying resources, and verifying the service—to manage Calico custom resources via kubectl.

APICNICalico
0 likes · 8 min read
Enable Calico API Service in Kubernetes: Step‑by‑Step Guide
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
May 1, 2025 · Artificial Intelligence

Fine-grained Profiling of Online AI Workloads on Kubernetes Using ACK AI Profiling

This article demonstrates how to use ACK AI Profiling, built on eBPF and dynamic process injection, to perform non-intrusive, low‑overhead profiling of Kubernetes‑deployed large‑language‑model inference services, identify GPU memory growth causes, and apply optimization recommendations to prevent OOM issues.

AI profilingGPU MemoryKubernetes
0 likes · 10 min read
Fine-grained Profiling of Online AI Workloads on Kubernetes Using ACK AI Profiling
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 28, 2025 · Cloud Native

Improving OSS Small‑File Access Performance with StrmVol Storage Volumes in Kubernetes

StrmVol storage volumes replace the FUSE‑based OSS mount with a virtual block device and kernel‑mode file system, dramatically reducing latency for massive small‑file reads in Kubernetes workloads such as AI training datasets, and the article demonstrates setup, configuration, and performance testing using Argo Workflows.

Argo WorkflowsCSIKubernetes
0 likes · 13 min read
Improving OSS Small‑File Access Performance with StrmVol Storage Volumes in Kubernetes
Linux Ops Smart Journey
Linux Ops Smart Journey
Apr 25, 2025 · Cloud Native

How to Seamlessly Migrate Calico from etcd to Kubernetes Datastore

Learn step‑by‑step how to transition Calico’s data store from etcd to Kubernetes, covering prerequisite checks, locking the datastore, exporting and importing data, reconfiguring calicoctl, applying the new manifests, and unlocking the store, while highlighting benefits and tips for a smooth migration.

CalicoKubernetesNetwork Policy
0 likes · 11 min read
How to Seamlessly Migrate Calico from etcd to Kubernetes Datastore
Cloud Native Technology Community
Cloud Native Technology Community
Apr 24, 2025 · Cloud Native

Simplifying Kubernetes Application Deployment with Korifi and Optimizing Cloud Costs using OpenCost

Korifi provides a Cloud Foundry‑style abstraction layer that simplifies Kubernetes application deployment and management, while OpenCost offers comprehensive, multi‑cloud cost visibility and optimization; together they enable developers to deploy apps easily, monitor pod‑level expenses, and make data‑driven resource allocation decisions.

Cost OptimizationDevOpsKorifi
0 likes · 7 min read
Simplifying Kubernetes Application Deployment with Korifi and Optimizing Cloud Costs using OpenCost
Raymond Ops
Raymond Ops
Apr 23, 2025 · Cloud Native

Secure Kubernetes Secrets with kubeseal: Installation & Usage Guide

This article explains how to install kubeseal and its controller, create and encrypt Kubernetes Secret manifests, apply sealed secrets, and provides useful tips on API routes, custom certificates, and RBAC to securely manage secrets in a cloud‑native environment.

KubeSealKubernetesSealed Secrets
0 likes · 5 min read
Secure Kubernetes Secrets with kubeseal: Installation & Usage Guide
Go Programming World
Go Programming World
Apr 22, 2025 · Artificial Intelligence

Design and Implementation of an Enterprise‑Grade LLMOPS Platform (EasyAI)

This article presents a comprehensive overview of building an enterprise‑level LLMOPS platform—including concept definitions, the relationship between LLMOPS, MLOps and intelligent agent platforms, four development tiers, architecture layers, core technical concerns, deployment options, and the benefits of cloud‑native AI development.

AI PlatformCloud NativeDevOps
0 likes · 15 min read
Design and Implementation of an Enterprise‑Grade LLMOPS Platform (EasyAI)
IT Xianyu
IT Xianyu
Apr 21, 2025 · Cloud Native

Step-by-Step Guide to Setting Up a Kubernetes 1.19 Cluster on CentOS 7.9

This guide walks through preparing two CentOS 7.9 servers, installing Docker and Kubernetes 1.19 components, initializing a master node, joining a worker node, and validating the cluster with a sample Nginx deployment, including common troubleshooting tips.

CalicoCentOSCloud Native
0 likes · 10 min read
Step-by-Step Guide to Setting Up a Kubernetes 1.19 Cluster on CentOS 7.9
Selected Java Interview Questions
Selected Java Interview Questions
Apr 18, 2025 · Cloud Native

One‑Click Deployment of Spring Cloud Microservices Using Jenkins, Docker, and Kubernetes

This guide walks through a complete one‑click Jenkins pipeline that pulls Spring Cloud source code from Git, builds it with Maven, packages the JAR into a Docker image, pushes the image to a registry, and finally deploys or updates the service on Kubernetes, covering SSH‑key setup, pipeline scripting, and Kubernetes resource definitions.

DockerJenkinsKubernetes
0 likes · 16 min read
One‑Click Deployment of Spring Cloud Microservices Using Jenkins, Docker, and Kubernetes
IT Services Circle
IT Services Circle
Apr 17, 2025 · Backend Development

McDonald's Java Backend Interview Experience and Technical Q&A

This article shares a candidate's interview experience at McDonald's China Technology R&D Center, detailing salary expectations, work schedule, and an extensive technical Q&A covering IoC/DI, MyBatis, JVM memory and garbage‑collection algorithms, MySQL storage engines, concurrency, locking, RPC, HTTPS, Docker/Kubernetes commands, and Git workflows, while also including some promotional material.

BackendDockerKubernetes
0 likes · 22 min read
McDonald's Java Backend Interview Experience and Technical Q&A
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Apr 17, 2025 · Cloud Native

Kubernetes Architecture and Core Principles Explained

This article provides a comprehensive overview of Kubernetes, covering its cloud‑native architecture, core components such as API Server, Scheduler, Controller Manager, etcd, kubelet and kube‑proxy, and explains the workflow that enables automated deployment, scaling and management of containerized applications.

Cloud NativeDevOpsKubernetes
0 likes · 6 min read
Kubernetes Architecture and Core Principles Explained
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 17, 2025 · Cloud Native

OpenKruise 1.8 Release Highlights: In‑Place VPA, StatefulSet Volume Expansion, AI WorkloadSpread, Serverless Probe, SidecarSet Gray‑Release, and Helm Pre‑Delete Hook

OpenKruise 1.8, the latest CNCF‑incubated cloud‑native automation suite, introduces in‑place vertical pod autoscaling, native StatefulSet volume expansion, AI‑aware WorkloadSpread, serverless probe support, sidecar gray‑release capabilities, and a Helm pre‑delete safety hook, all backed by detailed YAML examples and future roadmap.

Cloud NativeInPlaceVPAKubernetes
0 likes · 13 min read
OpenKruise 1.8 Release Highlights: In‑Place VPA, StatefulSet Volume Expansion, AI WorkloadSpread, Serverless Probe, SidecarSet Gray‑Release, and Helm Pre‑Delete Hook