Tagged articles
3116 articles
Page 3 of 32
dbaplus Community
dbaplus Community
Oct 13, 2025 · Cloud Native

10 Common Kubernetes Deployment Errors and How to Fix Them

When Kubernetes deployments fail, most issues stem from misconfigurations, image problems, or resource constraints, and this guide explains the ten most frequent errors, detailed troubleshooting commands, a generic debugging framework, and proactive practices to prevent future failures.

Cloud NativeContainersDeployment
0 likes · 14 min read
10 Common Kubernetes Deployment Errors and How to Fix Them
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Oct 13, 2025 · Cloud Native

Mastering Kubernetes Load Balancing: Internal & External Strategies Explained

This article provides a comprehensive overview of Kubernetes load balancing, detailing internal Service‑based balancing and external cloud‑provider solutions, illustrating configuration examples for ClusterIP, NodePort, and LoadBalancer types, and comparing common traffic‑distribution algorithms such as round‑robin, least connections, session affinity, and weighted routing.

Cloud NativeClusterIPKubernetes
0 likes · 4 min read
Mastering Kubernetes Load Balancing: Internal & External Strategies Explained
Ray's Galactic Tech
Ray's Galactic Tech
Oct 12, 2025 · Cloud Native

Master Production‑Grade Kubernetes YAML: 10+ Security & Performance Checks

This guide presents a comprehensive, production‑ready Kubernetes YAML checklist covering over ten essential security, stability, observability, and scalability validation points, plus eight advanced best‑practice recommendations, enabling teams to create robust, maintainable, and automated configuration pipelines.

Cloud NativeKubernetesSecurity
0 likes · 12 min read
Master Production‑Grade Kubernetes YAML: 10+ Security & Performance Checks
Alibaba Cloud Native
Alibaba Cloud Native
Oct 12, 2025 · Cloud Native

Boost Code Review Accuracy with Single‑Commit AI Review Mode

The article explains how the single‑commit review mode in Alibaba Cloud Codeup uses AI to evaluate each commit individually, addressing the shortcomings of default bulk diff reviews, detailing configuration steps, recommended scenarios, observed benefits, and its performance trade‑offs.

AI code reviewCloud NativeGit
0 likes · 6 min read
Boost Code Review Accuracy with Single‑Commit AI Review Mode
Open Source Tech Hub
Open Source Tech Hub
Oct 11, 2025 · Cloud Native

What’s New in Docker‑PHP v4.0.0‑beta1? Deep Dive into FrankenPHP, Laravel Scripts, and QoL Boosts

Docker‑PHP v4.0.0‑beta1 introduces hardened FrankenPHP images, upgraded Laravel automation, a slew of fine‑grained environment variables, and numerous quality‑of‑life and bug‑fix improvements that make containerized PHP deployments smoother, more secure, and ready for production‑grade Kubernetes workloads.

Cloud NativeDockerFrankenPHP
0 likes · 8 min read
What’s New in Docker‑PHP v4.0.0‑beta1? Deep Dive into FrankenPHP, Laravel Scripts, and QoL Boosts
Ops Community
Ops Community
Oct 8, 2025 · Cloud Native

How I Cut My Kubernetes Cloud Bill by 60% in 3 Months – Proven Strategies

Facing a 35‑million‑yuan monthly Kubernetes bill, the author analyzed hidden cost components, implemented five optimization campaigns—including resource request tuning, autoscaling, spot instances, storage tiering, and network consolidation—and reduced monthly expenses by 60% while boosting performance, delivering a detailed, reproducible methodology.

Cloud NativeCost OptimizationFinOps
0 likes · 33 min read
How I Cut My Kubernetes Cloud Bill by 60% in 3 Months – Proven Strategies
IT Architects Alliance
IT Architects Alliance
Oct 7, 2025 · Cloud Native

How to Transform Legacy IT Systems into Cloud‑Native Architecture: A Step‑by‑Step Guide

This article examines the limitations of traditional IT infrastructures and presents a progressive, cloud‑native refactoring roadmap—including containerization, micro‑service migration, Kubernetes orchestration, and DevOps practices—supported by concrete YAML examples and best‑practice recommendations for risk mitigation, cost optimization, and continuous evolution.

Cloud NativeDevOpsMicroservices
0 likes · 10 min read
How to Transform Legacy IT Systems into Cloud‑Native Architecture: A Step‑by‑Step Guide
Architect's Guide
Architect's Guide
Oct 7, 2025 · Backend Development

Mastering Backend Architecture: From Microservices to Service Mesh and Message Queues

This article presents a comprehensive roadmap for backend architects, covering microservice fundamentals, design principles, gateway patterns, communication protocols, service registration, configuration management, observability pillars, service mesh options, and a detailed comparison of modern message‑queue technologies.

BackendCloud NativeMessage Queue
0 likes · 29 min read
Mastering Backend Architecture: From Microservices to Service Mesh and Message Queues
IT Architects Alliance
IT Architects Alliance
Oct 6, 2025 · Cloud Native

Mastering Cloud‑Native Observability: From Metrics to Tracing

The article explains why enterprises struggle with cloud‑native observability, outlines the exponential complexity and dynamic nature of modern microservice environments, and presents a comprehensive three‑pillar approach—metrics, logging, tracing—along with practical Prometheus, OpenTelemetry, and sidecar configurations, storage choices, sampling, alerting, cost‑control, team upskilling, and future trends such as AIOps and eBPF.

Cloud NativeObservabilityOpenTelemetry
0 likes · 12 min read
Mastering Cloud‑Native Observability: From Metrics to Tracing
DevOps Coach
DevOps Coach
Oct 5, 2025 · Cloud Native

How Medium Scales Microservices with Kubernetes: Architecture, Tools, and Tuning

Medium explains why it chose Kubernetes for microservice management, describes its multi‑cluster deployment across four availability zones, details configuration tooling with Terraform, and shares scaling optimizations using a cluster over‑provisioner and pod preemption to achieve smoother node utilization.

Cloud NativeCluster OverprovisionerKubernetes
0 likes · 7 min read
How Medium Scales Microservices with Kubernetes: Architecture, Tools, and Tuning
MaGe Linux Operations
MaGe Linux Operations
Oct 4, 2025 · Operations

How I Doubled My Salary by Switching from Traditional Ops to SRE in 18 Months

Over 18 months, the author details a step‑by‑step transformation from a fire‑fighting traditional operations role to a high‑paying SRE/DevOps career, covering motivations, skill gaps, learning plans, project implementations, interview preparation, and real‑world outcomes, offering a practical roadmap for engineers seeking similar growth.

Cloud NativeOperationsSRE
0 likes · 44 min read
How I Doubled My Salary by Switching from Traditional Ops to SRE in 18 Months
Open Source Linux
Open Source Linux
Oct 4, 2025 · Operations

Beyond 35: How Ops Professionals Can Thrive in Cloud Native & DevOps

After 35, operations engineers can still advance by mastering cloud-native technologies, DevOps automation, security operations, ITIL service management, or transitioning into leadership roles, with practical advice on skill development, market trends, and real-world examples illustrating diverse career paths.

Career DevelopmentCloud NativeITIL
0 likes · 9 min read
Beyond 35: How Ops Professionals Can Thrive in Cloud Native & DevOps
IT Architects Alliance
IT Architects Alliance
Oct 2, 2025 · Cloud Native

Mastering Cloud‑Native Architecture: 6 Core Principles Every Engineer Should Know

This article outlines six fundamental cloud‑native architecture principles—immutable infrastructure, service mesh, observability, declarative APIs, resilient design, and shift‑left security—explaining their purpose, key practices, code examples, and how they interrelate to build scalable, reliable, and secure distributed systems.

Cloud NativeDeclarative APIObservability
0 likes · 11 min read
Mastering Cloud‑Native Architecture: 6 Core Principles Every Engineer Should Know
Raymond Ops
Raymond Ops
Sep 29, 2025 · Cloud Native

Master GitLab: From History to Full Installation & Configuration Guide

This article provides a comprehensive overview of GitLab—including its origins, key features, and evolution—followed by detailed, step‑by‑step instructions for installing, configuring, and accessing GitLab on Rocky Linux, covering repository management, CI/CD pipelines, and Kubernetes integration.

Cloud NativeConfigurationDevOps
0 likes · 9 min read
Master GitLab: From History to Full Installation & Configuration Guide
Alibaba Cloud Observability
Alibaba Cloud Observability
Sep 29, 2025 · Cloud Native

What Makes HarmonyOS NEXT a Pure Cloud‑Native OS? Inside the Architecture and SDK

This article introduces HarmonyOS NEXT's pure, fast, and minimal design, its development base with DevEco Studio and ArkTS, the compilation artifacts (HAR, HSP, HAP), the system‑level open capability map, and the ARMS RUM SDK's architecture, session management, and three unobtrusive data‑collection schemes for performance monitoring.

Cloud NativeHarmonyOSPerformance Monitoring
0 likes · 11 min read
What Makes HarmonyOS NEXT a Pure Cloud‑Native OS? Inside the Architecture and SDK
Alibaba Cloud Observability
Alibaba Cloud Observability
Sep 29, 2025 · Artificial Intelligence

Building a Cloud‑Native Observability Stack for LLM Apps with Alibaba SLS

This article details the engineering practice of constructing a complete data infrastructure for large‑language‑model (LLM) applications using Alibaba Cloud SLS, covering the observability challenges of the Dify platform, the redesign of the architecture, and the resulting improvements in monitoring, diagnosis, and quality optimization.

Cloud NativeDifyLLM
0 likes · 23 min read
Building a Cloud‑Native Observability Stack for LLM Apps with Alibaba SLS
Alibaba Cloud Observability
Alibaba Cloud Observability
Sep 29, 2025 · Cloud Native

How Alibaba Cloud SLS Soft Delete Enables Instant, Low‑Cost Data Cleanup

This article explains Alibaba Cloud's Log Service (SLS) soft‑delete feature, describing its mark‑and‑filter mechanism, implementation steps, and real‑world scenarios where it replaces costly hard‑delete or ETL solutions with near‑instant, low‑impact data removal for compliance, emergencies, and test‑data contamination.

Alibaba CloudCloud NativeData Governance
0 likes · 9 min read
How Alibaba Cloud SLS Soft Delete Enables Instant, Low‑Cost Data Cleanup
DataFunTalk
DataFunTalk
Sep 29, 2025 · Big Data

How ByteHouse Cuts Data Warehouse Costs While Boosting Performance

This article examines the exploding data volumes that pressure modern enterprises, outlines the explicit (hardware, performance) and implicit (operations, migration) cost challenges of OLAP data warehouses, and presents ByteHouse’s cloud‑native architecture and features as a solution for cost reduction and efficiency gains.

ByteHouseCloud NativeCost Optimization
0 likes · 6 min read
How ByteHouse Cuts Data Warehouse Costs While Boosting Performance
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 28, 2025 · Cloud Native

How to Build Python Microservices with Zero‑Intrusion Service Discovery Using Nacos

This article introduces a Python‑centric microservice solution built on Nacos that brings non‑intrusive service registration, low‑threshold discovery, and flexible configuration to Python applications, offering code examples, CLI tools, import hooks, and middleware to achieve Java‑like ease of use in the cloud‑native era.

Backend DevelopmentCloud NativeMicroservices
0 likes · 11 min read
How to Build Python Microservices with Zero‑Intrusion Service Discovery Using Nacos
Ray's Galactic Tech
Ray's Galactic Tech
Sep 27, 2025 · Backend Development

Spring Boot 4.0 Preview: Key New Features, Roadmap, and Upgrade Guide

Spring Boot 4.0, currently in preview with a GA slated for November 2025, introduces modular architecture, API versioning, BeanRegistrar, JSpecify null‑safety, GraalVM native image support, virtual threads, and up to 800% performance gains, while outlining a detailed release timeline and migration considerations for developers.

Backend DevelopmentCloud NativeJava
0 likes · 6 min read
Spring Boot 4.0 Preview: Key New Features, Roadmap, and Upgrade Guide
Ops Community
Ops Community
Sep 26, 2025 · Cloud Native

Cut Your Kubernetes Cloud Bill by 50%: Proven Cost‑Optimization Tricks

This article reveals why Kubernetes can become a costly “money‑eater” and provides a step‑by‑step, data‑driven methodology—including resource profiling, Spot instance mixing, HPA/VPA pairing, smart scheduling, and FinOps practices—that can halve your cloud expenses within weeks.

Cloud NativeCost OptimizationFinOps
0 likes · 14 min read
Cut Your Kubernetes Cloud Bill by 50%: Proven Cost‑Optimization Tricks
Programmer DD
Programmer DD
Sep 26, 2025 · Backend Development

Spring Boot 4 & Spring Framework 7: Key Features, Code Samples, and Upgrade Guide

This article reviews the major enhancements in Spring Boot 4 and Spring Framework 7, covering baseline requirements, performance boosts, native image support, modularization, new annotations, API versioning, declarative HTTP clients, resilient method support, multiple TaskDecorators, and null‑safety improvements, with practical code examples to help developers plan their migration.

API VersioningCloud NativeMicroservices
0 likes · 19 min read
Spring Boot 4 & Spring Framework 7: Key Features, Code Samples, and Upgrade Guide
DevOps Coach
DevOps Coach
Sep 24, 2025 · Cloud Native

Why Ingress Falls Short and How the New Kubernetes Gateway API Solves It

This article examines the shortcomings of traditional Kubernetes Ingress, introduces the more flexible Gateway API, explains its core components and role‑based architecture, and provides a step‑by‑step YAML example that migrates an Ingress configuration to Gateway API resources.

Cloud NativeGateway APIIngress
0 likes · 9 min read
Why Ingress Falls Short and How the New Kubernetes Gateway API Solves It
dbaplus Community
dbaplus Community
Sep 23, 2025 · Cloud Native

Mastering Kubernetes in 2025: 6 Proven Strategies for Secure, Scalable Ops

This guide outlines six essential Kubernetes management practices for 2025—Infrastructure as Code, centralized monitoring and logging, SSL-offloaded ingress, RBAC with OIDC, GitOps deployments, and secret management—offering concrete steps, tool recommendations, and security tips to build reliable, automated cloud‑native clusters.

Cloud NativeDevOpsGitOps
0 likes · 7 min read
Mastering Kubernetes in 2025: 6 Proven Strategies for Secure, Scalable Ops
Alibaba Cloud Native
Alibaba Cloud Native
Sep 23, 2025 · Artificial Intelligence

Why Independent Runtime Agents Are the Future of Scalable AI Systems

The article explains how a configuration‑driven, cloud‑native architecture with independent runtime agents solves performance isolation, availability, scalability, security, and technology heterogeneity problems of low‑code platforms, and introduces a unified Agent Spec, Agent Studio, execution engine, A2A protocol, and dynamic governance to enable enterprise‑grade AI deployments.

Cloud NativeDynamic ScalingMicroservices
0 likes · 29 min read
Why Independent Runtime Agents Are the Future of Scalable AI Systems
Volcano Engine Developer Services
Volcano Engine Developer Services
Sep 23, 2025 · Information Security

How to Secure Model Context Protocol (MCP) in AI Ecosystems: Threats and Solutions

This article examines the security challenges of the Model Context Protocol (MCP) in AI applications, analyzes attack surface expansion across creation, runtime, and update phases, and presents a comprehensive AI‑enhanced scanning architecture with mitigation strategies to protect the entire AI ecosystem.

AI securityCloud NativeMCP
0 likes · 23 min read
How to Secure Model Context Protocol (MCP) in AI Ecosystems: Threats and Solutions
DevOps Coach
DevOps Coach
Sep 20, 2025 · Cloud Native

Why a Tiny Memory‑Intensive Process Caused 100× Latency Spikes After Pinterest’s Search Migration to Kubernetes

During Pinterest’s migration of its high‑traffic Manas search platform to the PinCompute Kubernetes environment, engineers observed an extremely rare latency outlier—one in a million requests took 100 times longer—prompting a deep investigation that traced the root cause to cAdvisor’s memory‑intensive smaps scans interfering with leaf node processing.

Cloud NativeKubernetesMemory Management
0 likes · 14 min read
Why a Tiny Memory‑Intensive Process Caused 100× Latency Spikes After Pinterest’s Search Migration to Kubernetes
IT Architects Alliance
IT Architects Alliance
Sep 20, 2025 · Operations

Mastering Microservice Governance: Tracing, Config, and Monitoring Strategies

This article explores the three core challenges of microservice governance—distributed tracing, centralized configuration management, and comprehensive monitoring—offering practical solutions, tool comparisons, and best‑practice guidelines to help architects build reliable, observable, and maintainable systems.

Cloud NativeConfiguration ManagementDistributed Tracing
0 likes · 12 min read
Mastering Microservice Governance: Tracing, Config, and Monitoring Strategies
IT Architects Alliance
IT Architects Alliance
Sep 18, 2025 · Cloud Native

Why Containers, Kubernetes, and Service Mesh Are the Modern Cloud‑Native Trinity

An in‑depth look at how containers, Kubernetes, and Service Mesh together form the core of modern cloud‑native architectures, covering their evolution, practical adoption stages, trade‑offs in complexity, decision‑making matrices, best‑practice implementation tips, and emerging trends such as edge computing and WebAssembly.

Cloud NativeContainersKubernetes
0 likes · 10 min read
Why Containers, Kubernetes, and Service Mesh Are the Modern Cloud‑Native Trinity
Ops Community
Ops Community
Sep 17, 2025 · Operations

Mastering System Fault Tolerance: From Theory to Production‑Ready High‑Availability

This comprehensive guide explores the philosophy, core patterns, and practical techniques for designing fault‑tolerant, highly available systems, covering circuit breakers, retries, rate limiting, monitoring, cloud‑native deployment, and real‑world case studies to help engineers build resilient production architectures.

Cloud Nativecircuit breakerfault tolerance
0 likes · 24 min read
Mastering System Fault Tolerance: From Theory to Production‑Ready High‑Availability
Raymond Ops
Raymond Ops
Sep 16, 2025 · Cloud Native

How to Build a Secure High‑Availability Etcd Cluster on Linux

This guide walks through installing etcd, configuring a three‑node high‑availability cluster with TLS certificates, setting up host files, disabling SELinux and firewalld, creating a Certificate Authority using cfssl, generating node certificates, distributing them, and finally deploying and verifying the cluster on Linux systems.

CertificateCloud NativeRaft
0 likes · 19 min read
How to Build a Secure High‑Availability Etcd Cluster on Linux
Ops Community
Ops Community
Sep 15, 2025 · Cloud Native

Master Kubernetes Log Collection: From Basics to Advanced EFK & Loki Solutions

This comprehensive guide explains why log management is critical for large Kubernetes clusters, outlines common pain points, presents full‑stack architectures, details EFK and Loki implementations with code samples, and offers performance, security, cost‑optimization, and future‑trend recommendations.

Cloud NativeEFKKubernetes
0 likes · 16 min read
Master Kubernetes Log Collection: From Basics to Advanced EFK & Loki Solutions
IT Architects Alliance
IT Architects Alliance
Sep 13, 2025 · Cloud Computing

Why Multi‑Cloud Strategies Are Replacing Single‑Vendor Cloud Choices

Enterprises are shifting from single‑vendor cloud adoption to multi‑cloud and hybrid architectures, driven by concerns over vendor lock‑in, compliance, cost, performance, and resilience, with Kubernetes providing a unified layer to orchestrate workloads across AWS, Azure, and Alibaba Cloud.

Cloud NativeCost OptimizationKubernetes
0 likes · 10 min read
Why Multi‑Cloud Strategies Are Replacing Single‑Vendor Cloud Choices
ITPUB
ITPUB
Sep 12, 2025 · Cloud Native

Why Deploying MySQL in Production Containers Is a Bad Idea

Deploying MySQL in production containers introduces volatile storage, unstable networking, performance variability, and replication challenges that clash with the database's need for persistent, high‑availability, and consistent operation, making native VM or bare‑metal deployments far more reliable.

Cloud NativeContainersDatabase Deployment
0 likes · 7 min read
Why Deploying MySQL in Production Containers Is a Bad Idea
dbaplus Community
dbaplus Community
Sep 11, 2025 · Cloud Native

Building a Scalable Kubernetes Monitoring Architecture and Alert Management

This guide presents a comprehensive, layered Kubernetes monitoring architecture—including control plane, node, resource, and extension layers—detailing high‑availability Prometheus deployment, alert grouping strategies, custom CRD metrics, visualization dashboards, and practical best‑practice recommendations for reliable observability in cloud‑native environments.

AlertingCloud NativeKubernetes
0 likes · 11 min read
Building a Scalable Kubernetes Monitoring Architecture and Alert Management
MaGe Linux Operations
MaGe Linux Operations
Sep 11, 2025 · Operations

Mastering Kubernetes Pod Lifecycle: Real‑World Troubleshooting Techniques

This comprehensive guide dissects every stage of the Kubernetes Pod lifecycle, explains underlying mechanisms, and equips operators with practical debugging commands, scripts, and best‑practice configurations to swiftly resolve common production issues such as pending pods, crash loops, slow startups, and network failures.

Cloud NativeKubernetesPod Lifecycle
0 likes · 21 min read
Mastering Kubernetes Pod Lifecycle: Real‑World Troubleshooting Techniques
Alibaba Cloud Native
Alibaba Cloud Native
Sep 10, 2025 · Information Security

How Alibaba Cloud SLS’s New mask Function Simplifies Large‑Scale Log Desensitization

In the AI era, massive interaction data drives rapid smart‑app growth, but personal privacy risks demand robust data‑masking; Alibaba Cloud Log Service (SLS) introduces a versatile mask function that replaces complex regex pipelines with concise configurations, boosting performance, reducing maintenance, and meeting strict compliance such as GDPR and China’s Personal Information Protection Law.

Cloud NativeLog ProcessingSLS
0 likes · 12 min read
How Alibaba Cloud SLS’s New mask Function Simplifies Large‑Scale Log Desensitization
StarRocks
StarRocks
Sep 9, 2025 · Big Data

From Hadoop to StarRocks: Revamping a Government Procurement Data Platform

Facing massive data volumes, complex component dependencies, high TCO, and real‑time processing limits, the政采云 platform replaced its Hadoop stack with StarRocks’ minimalist, decoupled architecture, achieving lower costs, elastic scaling, faster queries, easier operations, and robust fault tolerance across diverse government procurement workloads.

Cloud NativeCost OptimizationData Warehouse
0 likes · 16 min read
From Hadoop to StarRocks: Revamping a Government Procurement Data Platform
Linux Ops Smart Journey
Linux Ops Smart Journey
Sep 8, 2025 · Cloud Native

How Envoy’s Circuit Breakers and Outlier Detection Stop Service Avalanches

This article explains how Envoy’s circuit‑breaker and outlier‑detection features protect micro‑service architectures from avalanche failures by limiting concurrent connections, ejecting unhealthy instances, and provides configuration examples, testing methods, and best‑practice tips for building resilient cloud‑native systems.

Circuit BreakingCloud NativeEnvoy
0 likes · 11 min read
How Envoy’s Circuit Breakers and Outlier Detection Stop Service Avalanches
macrozheng
macrozheng
Sep 8, 2025 · Backend Development

How to Refresh Spring Boot Configurations at Runtime Without Restart

This article explains why dynamic configuration refresh is needed in Spring Boot, details the @RefreshScope mechanism, provides step‑by‑step implementation code, and shares production best practices and troubleshooting tips for zero‑downtime config updates.

Backend DevelopmentCloud NativeDynamic Configuration
0 likes · 10 min read
How to Refresh Spring Boot Configurations at Runtime Without Restart
Ops Community
Ops Community
Sep 7, 2025 · Cloud Native

Master Helm: From Beginner to Pro – A Complete Guide to Standardized Kubernetes Deployments

This comprehensive guide walks you through the challenges of managing Kubernetes applications with Helm, explains core concepts like Charts, Releases, and Repositories, and provides detailed best‑practice patterns for chart design, multi‑environment values, CI/CD pipelines, security, monitoring, and disaster‑recovery to achieve standardized, reliable cloud‑native deployments.

Cloud NativeDevOpsInfrastructure as Code
0 likes · 29 min read
Master Helm: From Beginner to Pro – A Complete Guide to Standardized Kubernetes Deployments
Ops Development Stories
Ops Development Stories
Sep 4, 2025 · Cloud Native

Why Kubernetes HPA Ignores High CPU Usage and How Tolerance Affects Scaling

This article explains the internal architecture and source‑code flow of Kubernetes Horizontal Pod Autoscaler, detailing how components like HorizontalController and ReplicaCalculator compute desired replicas, why a default 10% tolerance can prevent scaling even when CPU exceeds the target, and how behavior policies and scaling limits influence HPA decisions.

Cloud NativeHPAHorizontal Pod Autoscaler
0 likes · 16 min read
Why Kubernetes HPA Ignores High CPU Usage and How Tolerance Affects Scaling
Code Ape Tech Column
Code Ape Tech Column
Sep 4, 2025 · Backend Development

10 Essential Microservice Best Practices to Boost Efficiency and Security

This article outlines ten practical microservice best practices—including single responsibility, cross‑functional teams, proper tooling, asynchronous communication, DevSecOps, independent data stores, isolated deployment, orchestration, and monitoring—to help developers build scalable, maintainable, and secure backend systems.

Backend ArchitectureCloud NativeMicroservices
0 likes · 13 min read
10 Essential Microservice Best Practices to Boost Efficiency and Security
AI Large Model Application Practice
AI Large Model Application Practice
Sep 4, 2025 · Artificial Intelligence

Can Message Queues Power the Next Generation of AI Agents? A Deep Dive into Pulsar

This article examines how traditional high‑performance message queues and event‑driven architectures can be revitalized for AI agents, tracing the evolution of messaging middleware, highlighting key integration points, and showcasing Apache Pulsar's cloud‑native features that enable reliable, scalable, and intelligent multi‑agent systems.

AI AgentApache PulsarCloud Native
0 likes · 16 min read
Can Message Queues Power the Next Generation of AI Agents? A Deep Dive into Pulsar
Architecture Digest
Architecture Digest
Sep 2, 2025 · Cloud Native

Is Docker Still the Best Choice? Discover the Future of Containerization

This article examines Docker's diminishing universality, its historical impact, current limitations, and emerging alternatives such as lightweight runtimes, micro‑Kubernetes, and serverless platforms, guiding developers toward more efficient, secure, and adaptable container strategies for 2025 and beyond.

Cloud NativeDevOpsDocker
0 likes · 8 min read
Is Docker Still the Best Choice? Discover the Future of Containerization
Alibaba Cloud Native
Alibaba Cloud Native
Sep 1, 2025 · Cloud Native

Why Browser Automation Needs a Sandbox: Building a Secure Cloud‑Native AI Agent Environment

This article traces the evolution of browser automation from Selenium to Playwright, highlights emerging AI‑agent security risks such as prompt injection and credential theft, and provides a detailed cloud‑native sandbox architecture with deployment steps, usage examples for Playwright, Puppeteer, REST APIs, and code snippets for secure AI‑driven web interactions.

Browser AutomationCloud NativeFunction Compute
0 likes · 21 min read
Why Browser Automation Needs a Sandbox: Building a Secure Cloud‑Native AI Agent Environment
Linux Ops Smart Journey
Linux Ops Smart Journey
Sep 1, 2025 · Cloud Native

Master Envoy Proxy in 5 Minutes: Build, Configure, and Test a Service Mesh

This tutorial introduces Envoy Proxy, explains its core building blocks—listeners, routes, clusters, and filters—demonstrates a quick Docker-based setup, and walks through a complete configuration that routes traffic based on URI paths, providing a practical five‑minute overview of service‑mesh fundamentals.

Cloud NativeDockerEnvoy
0 likes · 8 min read
Master Envoy Proxy in 5 Minutes: Build, Configure, and Test a Service Mesh
ITPUB
ITPUB
Aug 31, 2025 · Cloud Native

Is Docker Still the King? Exploring the Next Generation of Container Tools

The article examines Docker's historic impact, its growing limitations in modern, large‑scale and diverse development environments, and surveys emerging lightweight runtimes, local development alternatives like ServBay, evolving orchestration options, and future trends shaping a more modular, secure, and AI‑driven container ecosystem.

Cloud NativeDockerKubernetes
0 likes · 9 min read
Is Docker Still the King? Exploring the Next Generation of Container Tools
Alibaba Cloud Native
Alibaba Cloud Native
Aug 29, 2025 · Artificial Intelligence

Build AI Apps with Natural Language Using VibeCoding: A Hands‑On Guide

This article demonstrates how to create diverse AI applications—such as a children’s picture‑book generator, a guessing‑game app, and an enterprise website with knowledge‑base Q&A—entirely through natural‑language prompts in VibeCoding, covering the setup, step‑by‑step construction, code‑free publishing, integration of various AI services, deployment configurations, and practical tips for reliable development.

AI DevelopmentAI integrationCloud Native
0 likes · 17 min read
Build AI Apps with Natural Language Using VibeCoding: A Hands‑On Guide
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Aug 28, 2025 · Cloud Native

How to Upgrade Ztunnel in ASM Ambient Mode Without Traffic Disruption

This article explains the Ztunnel upgrade process in Alibaba Service Mesh Ambient mode, details the rolling and graceful‑shutdown mechanisms, presents performance test results with and without graceful shutdown, and offers best‑practice recommendations to minimize traffic interruption during upgrades.

Cloud NativeGraceful ShutdownPerformance Testing
0 likes · 10 min read
How to Upgrade Ztunnel in ASM Ambient Mode Without Traffic Disruption
Alibaba Cloud Native
Alibaba Cloud Native
Aug 27, 2025 · Artificial Intelligence

How LoongSuite Enables Full‑Stack Observability for LLM Applications

The article explains the rapid evolution of the AI application ecosystem, outlines the challenges of end‑to‑end observability for large‑language‑model services, and details how the open‑source LoongSuite suite—through non‑intrusive instrumentation for Python and Go agents and tight integration with the Dify platform—provides comprehensive, cloud‑native monitoring, tracing, and metric collection across the entire AI stack.

AICloud NativeDify
0 likes · 19 min read
How LoongSuite Enables Full‑Stack Observability for LLM Applications
Code Wrench
Code Wrench
Aug 25, 2025 · Cloud Native

Master Docker for Go Microservices: Real‑World Project Walkthrough

This article walks through a complete Go microservice project, demonstrating Dockerfile optimization, advanced docker‑compose techniques, essential commands, and best‑practice deployment strategies to help you master containerizing microservices.

Cloud NativeDockerDocker Compose
0 likes · 10 min read
Master Docker for Go Microservices: Real‑World Project Walkthrough
Alibaba Cloud Native
Alibaba Cloud Native
Aug 19, 2025 · Artificial Intelligence

Boost Dify AI App Performance with Higress AI Gateway: A Full-Scale High‑Availability Guide

This guide explains why Dify’s system components and model services become performance bottlenecks at scale, and how integrating the Higress AI gateway can provide protocol standardization, observability, security, and stability features to achieve full‑stack high availability for AI applications.

AI gatewayCloud NativeDify
0 likes · 16 min read
Boost Dify AI App Performance with Higress AI Gateway: A Full-Scale High‑Availability Guide
Alibaba Cloud Observability
Alibaba Cloud Observability
Aug 18, 2025 · Information Security

How to Use MCP Server for Real‑Time AccessKey Security Audits in Cloud Native Environments

This guide walks through a real‑time security investigation of suspicious AccessKey activity using Alibaba Cloud's MCP server, detailing environment setup, JSON configuration, natural‑language queries, and four practical audit scenarios—including tracking AK usage, identifying high‑risk operations, monitoring root account activity, and summarizing recent cloud service events.

AccessKeyAlibaba CloudCloud Native
0 likes · 8 min read
How to Use MCP Server for Real‑Time AccessKey Security Audits in Cloud Native Environments
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Aug 11, 2025 · Cloud Native

Simplify Kubernetes Egress with ASM Ambient Mode and Waypoint

Learn how ASM's Ambient mode and the Waypoint component streamline L7 egress traffic management in Kubernetes by replacing complex Sidecar configurations with simple ServiceEntry and Waypoint labels, reducing configuration overhead while preserving powerful security and observability features.

Cloud NativeEgress TrafficKubernetes
0 likes · 7 min read
Simplify Kubernetes Egress with ASM Ambient Mode and Waypoint
Alibaba Cloud Native
Alibaba Cloud Native
Aug 8, 2025 · Cloud Native

How Cloud‑Native Architecture Powers Global Game Publishing at Lingxi Interactive

Lingxi Interactive transformed its overseas game publishing by adopting a cloud‑native infrastructure built on Alibaba Cloud ACK, creating a unified platform with the KUN ops layer, automating scaling, monitoring, and FinOps, which dramatically improved stability, efficiency, and cost while supporting diverse game genres worldwide.

AutomationCloud NativeFinOps
0 likes · 12 min read
How Cloud‑Native Architecture Powers Global Game Publishing at Lingxi Interactive
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 7, 2025 · Operations

How Alibaba Scales Flink to Millions of Cores: Real‑Time Ops Secrets

This article details Alibaba's decade‑long evolution of its real‑time computing platform, the massive operational challenges of managing Flink clusters at million‑core scale, and the comprehensive strategies—including SLA metrics, self‑healing services, cloud‑native redesign, and job‑level advisory tools—used to ensure stability, cost efficiency, and performance during peak events like Double‑11.

Apache FlinkCloud NativeJob Advisory
0 likes · 19 min read
How Alibaba Scales Flink to Millions of Cores: Real‑Time Ops Secrets
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 7, 2025 · Cloud Native

Mastering Helm: Simplify Kubernetes Deployments with Powerful Package Management

Helm, the Kubernetes package manager, streamlines the creation, packaging, configuration, and deployment of complex applications by bundling multiple YAML resources into reusable charts, offering lifecycle management, command‑line convenience, ecosystem plugins, secret handling, debugging tools, and addressing both its advantages and limitations within SREWorks.

Cloud NativeSREWorkshelm
0 likes · 12 min read
Mastering Helm: Simplify Kubernetes Deployments with Powerful Package Management
Didi Tech
Didi Tech
Aug 7, 2025 · Cloud Native

How HUATUO Revolutionizes Cloud‑Native Observability with Zero‑Impact BPF Tracing

HUATUO, Didi's open‑source cloud‑native observability project, leverages BPF‑based low‑overhead kernel tracing, unified metric and event frameworks, automatic flame‑graph generation, and seamless integration with Prometheus, Grafana and Elasticsearch to provide panoramic, zero‑intrusive monitoring and continuous performance profiling for complex production environments.

BPFCloud NativeDistributed Systems
0 likes · 11 min read
How HUATUO Revolutionizes Cloud‑Native Observability with Zero‑Impact BPF Tracing
Alibaba Cloud Native
Alibaba Cloud Native
Aug 7, 2025 · Cloud Native

How Dubbo’s Cloud‑Native Upgrade Enables One‑Click Cluster Deployment and a New Visual Console

Apache Dubbo’s latest cloud‑native release introduces one‑click microservice cluster deployment, a brand‑new visual console, Helm chart support, and the Dubbo Operator, providing developers and operators with streamlined deployment, comprehensive lifecycle management, and automated Kubernetes‑native operations.

Cloud NativeDubboMicroservices
0 likes · 8 min read
How Dubbo’s Cloud‑Native Upgrade Enables One‑Click Cluster Deployment and a New Visual Console
Practical DevOps Architecture
Practical DevOps Architecture
Aug 7, 2025 · Cloud Native

How to Rebuild Harbor and Sync Images to a New Registry

This guide explains why the original Harbor server needed to be taken offline, walks through setting up a new Harbor instance, and details the step‑by‑step configuration of source and replication repositories plus enabling the replication rule to synchronize images.

Cloud NativeContainer Registryimage replication
0 likes · 2 min read
How to Rebuild Harbor and Sync Images to a New Registry
Practical DevOps Architecture
Practical DevOps Architecture
Aug 7, 2025 · Cloud Native

Step-by-Step Guide to Install Docker, Docker‑Compose, and Harbor on CentOS

This tutorial walks you through removing old Docker packages, configuring yum repositories (official and Alibaba Cloud mirrors), installing a specific Docker Engine version, setting up image acceleration, verifying Docker, installing Docker‑Compose, downloading and configuring Harbor, and finally validating the Harbor registry on a CentOS system.

CentOSCloud NativeContainer Registry
0 likes · 5 min read
Step-by-Step Guide to Install Docker, Docker‑Compose, and Harbor on CentOS
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 6, 2025 · Cloud Native

Demystifying Kubernetes Pods: From YAML to Full Lifecycle

This article walks through the complete structure of a Kubernetes Pod—covering its Resource, Object, Spec, and Status sections, the corresponding REST API paths, metadata fields, lifecycle phases, scheduling strategies, and status reporting—providing a clear, step‑by‑step guide for both beginners and experienced users.

Cloud NativePodResource Orchestration
0 likes · 15 min read
Demystifying Kubernetes Pods: From YAML to Full Lifecycle
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 6, 2025 · Operations

How Alibaba Cloud’s Serverless Elasticsearch Powers Data‑Driven Operations

Alibaba Cloud’s Serverless Elasticsearch service, combined with the SREWorks data‑driven operations platform, offers a cloud‑native, real‑time search and analytics engine that integrates metric and log collection, cost management, and health monitoring to enhance scalability, performance, and operational efficiency for enterprise applications.

Cloud NativeDataOpsElasticsearch
0 likes · 11 min read
How Alibaba Cloud’s Serverless Elasticsearch Powers Data‑Driven Operations
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 6, 2025 · Cloud Native

What’s New in SREWorks v1.1? Streamlined Deployment, Plug‑in Storage, and Low‑Code Templates

SREWorks v1.1 introduces a lightweight base edition, plug‑in support for ElasticSearch, MySQL and MinIO, a low‑code page template center, default OpenEBS‑based StorageClass, numerous bug fixes, and a straightforward upgrade path, all aimed at simplifying cloud‑native operations platforms.

Cloud NativeSREWorksStorageClass
0 likes · 8 min read
What’s New in SREWorks v1.1? Streamlined Deployment, Plug‑in Storage, and Low‑Code Templates
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 5, 2025 · Operations

How Alibaba’s Open‑Source SREWorks Transforms Cloud‑Native Data Operations

Alibaba's SREWorks platform, now open‑source, combines cloud‑native architecture, DataOps and AIOps to address the growing complexity of big‑data and AI operations, offering a layered SaaS/PaaS/IaaS solution that streamlines delivery, monitoring, management, control, operation, and service for modern enterprises.

Cloud NativeDataOpsOperations
0 likes · 10 min read
How Alibaba’s Open‑Source SREWorks Transforms Cloud‑Native Data Operations
DevOps Engineer
DevOps Engineer
Aug 3, 2025 · Cloud Native

Why KubeSphere’s Sudden Open‑Source Shutdown Shook the Cloud‑Native Community

After Qingyun Technology abruptly halted downloads and support for the open‑source KubeSphere distribution, users faced broken pipelines, missing images, and a trust collapse, prompting fierce community backlash and raising questions about open‑source business models, licensing changes, and the sustainability of cloud‑native projects.

Business ModelCloud NativeKubeSphere
0 likes · 14 min read
Why KubeSphere’s Sudden Open‑Source Shutdown Shook the Cloud‑Native Community
TAL Education Technology
TAL Education Technology
Jul 31, 2025 · Databases

How Milvus Powers Billion-Scale Vector Search for AI at TAL Education

This article explains how TAL Education leverages the open‑source Milvus vector database—covering its architecture, features, cloud‑native deployment, monitoring, and real‑world AI applications such as intelligent grading and multimodal search—to handle billions of vectors with millisecond‑level similarity retrieval.

AICloud NativeEducation Technology
0 likes · 14 min read
How Milvus Powers Billion-Scale Vector Search for AI at TAL Education
Alibaba Cloud Native
Alibaba Cloud Native
Jul 26, 2025 · Cloud Native

How OpenKruiseGame Solves the Last Mile of Cloud‑Native Game Connection Governance

This article explains how cloud‑native game services can achieve precise, state‑aware connection management and graceful shutdown by combining OpenKruiseGame with a cloud‑native API gateway, detailing the challenges of seven‑layer networking, the custom lifecycle hooks, deployment steps, and the benefits of zero‑downtime configuration changes.

Cloud NativeConnection ManagementOpenKruiseGame
0 likes · 11 min read
How OpenKruiseGame Solves the Last Mile of Cloud‑Native Game Connection Governance
DevOps Operations Practice
DevOps Operations Practice
Jul 22, 2025 · Operations

Top 7 DevOps Best Practices to Accelerate Delivery and Boost Reliability

These seven essential DevOps best practices—from cultural transformation and full automation to continuous integration, observability, security, cloud-native microservices, and performance optimization—guide teams in accelerating software delivery, enhancing quality, ensuring reliability, and reducing costs through collaborative, automated, and measurable processes.

AutomationCloud NativeDevOps
0 likes · 4 min read
Top 7 DevOps Best Practices to Accelerate Delivery and Boost Reliability
Alibaba Cloud Observability
Alibaba Cloud Observability
Jul 21, 2025 · Artificial Intelligence

How AI Agent Architecture Is Transforming Software Engineering Practices

The article explores the evolution of AI Agent technical architecture, its four core capability dimensions, the pivotal role of observability and security, and introduces LoongSuite as a high‑performance, low‑cost observability suite that supports multi‑language agents and future cloud‑native AI developments.

AI AgentCloud NativeLoongSuite
0 likes · 18 min read
How AI Agent Architecture Is Transforming Software Engineering Practices
Liangxu Linux
Liangxu Linux
Jul 20, 2025 · Cloud Native

Master Helm Repository Management: Add, Update, Search, Pull, and Push Charts

This guide explains Helm repository concepts, lists common public and private repo types, provides URLs for official sources, and details step‑by‑step commands for adding, updating, listing, removing, searching, version‑checking, pulling charts, and managing private repositories with index creation and chart pushing.

Chart RepositoryCloud NativeKubernetes
0 likes · 7 min read
Master Helm Repository Management: Add, Update, Search, Pull, and Push Charts
Youzan Coder
Youzan Coder
Jul 18, 2025 · Cloud Native

How Mixed Workloads Boost Kubernetes CPU Utilization by Over 40%

This article explains how Youzan transformed its Kubernetes clusters from static over‑commit scheduling to load‑balanced mixed workloads using Koordinator and the Longxi kernel, achieving higher CPU utilization, lower costs, and better resource management for both online and offline services.

Big DataCloud NativeKoordinator
0 likes · 10 min read
How Mixed Workloads Boost Kubernetes CPU Utilization by Over 40%