Tagged articles
3116 articles
Page 1 of 32
ITPUB
ITPUB
May 11, 2026 · Databases

What Human Evolution Teaches About IT Architecture Trade‑offs (Ahead of the 2026 SACC)

The article draws a detailed analogy between millions of years of human evolution—standing up, shedding hair, expanding the brain, and recruiting ancient bacteria—and modern IT architecture, showing how each design choice brings hidden costs, why perfect systems are impossible, and how embracing trade‑offs, extensions, and continuous iteration can lead to resilient, self‑healing databases.

Cloud Nativearchitecturedatabases
0 likes · 22 min read
What Human Evolution Teaches About IT Architecture Trade‑offs (Ahead of the 2026 SACC)
DataFunTalk
DataFunTalk
May 8, 2026 · Big Data

How MaxCompute Evolves into a Data+AI Platform: Architecture, Core Capabilities, and Real-World Cases

The article explains how Alibaba Cloud's MaxCompute has been transformed into a cloud‑native Data+AI platform, detailing its layered architecture, multimodal storage, model management, hybrid compute scheduling, SQL AI functions, the MaxFrame Python framework, and several enterprise case studies that demonstrate performance gains and flexible resource orchestration.

AI integrationBig DataCloud Native
0 likes · 11 min read
How MaxCompute Evolves into a Data+AI Platform: Architecture, Core Capabilities, and Real-World Cases
DataFunSummit
DataFunSummit
May 3, 2026 · Databases

ScopeDB: Real-Time Data Analytics Solution for the Cloud‑Native Era

ScopeDB introduces a cloud‑native, real‑time analytics database that combines structured core columns with a flexible JSON column, adaptive indexing, a custom query language (ScopeQL), and true compute‑storage separation, delivering sub‑second query latency, high throughput, and up to 70% cost reduction compared with traditional big‑data stacks.

Cloud NativeReal-time analyticsScopeDB
0 likes · 14 min read
ScopeDB: Real-Time Data Analytics Solution for the Cloud‑Native Era
DataFunSummit
DataFunSummit
May 2, 2026 · Cloud Native

GooseFS + Lance: Accelerating Vector Storage for the AI Era

The article explains how GooseFS integrates with the Lance vector format to overcome the IO bottlenecks of object storage, detailing native acceleration mechanisms such as namespace catalog services, event‑driven warm caching, automatic compaction, native transactions, and page‑level caching that together deliver up to three‑fold performance gains for AI workloads.

AICache AccelerationCloud Native
0 likes · 12 min read
GooseFS + Lance: Accelerating Vector Storage for the AI Era
Architecture & Thinking
Architecture & Thinking
Apr 30, 2026 · Cloud Native

How RocketMQ 5.0’s New Proxy Layer Enables Compute‑Storage Separation and Cloud‑Native Scaling

RocketMQ 5.0 replaces the monolithic Broker with a stateless Proxy layer that decouples compute from storage, solves scalability, multi‑protocol and cloud‑native adaptation challenges, and is demonstrated through detailed architecture comparisons, Java code samples, and two real‑world IoT and finance case studies showing significant performance and cost benefits.

Cloud NativeCompute-Storage SeparationMessage Queue
0 likes · 20 min read
How RocketMQ 5.0’s New Proxy Layer Enables Compute‑Storage Separation and Cloud‑Native Scaling
Alibaba Cloud Native
Alibaba Cloud Native
Apr 29, 2026 · Artificial Intelligence

Evolving Agent Development: Simplifying Multi‑Source Real‑Time Context from an Environment‑Engineering Perspective

The article analyzes why AI coding agents thrive in software engineering while agents in other industries lag, identifies context‑supply as the core bottleneck, and proposes a five‑dimensional framework—information completeness, sensory management, knowledge reconciliation, change governance, and accessibility—illustrated with EventHouse’s polling, event subscription, and mount‑query approaches, unified catalog, knowledge wiki, and CI/CD‑style release to make enterprise agents simple, reliable, and production‑ready.

AI agentsCI/CD for AICloud Native
0 likes · 15 min read
Evolving Agent Development: Simplifying Multi‑Source Real‑Time Context from an Environment‑Engineering Perspective
Alibaba Cloud Observability
Alibaba Cloud Observability
Apr 27, 2026 · Operations

Scaling Humanoid Robot Operations: Insights from the Human‑Robot Half‑Marathon

The half‑marathon race of over 300 humanoid robots highlighted three core operational bottlenecks—environmental uncertainty, hidden hardware‑software coupling risks, and outdated maintenance models—prompting a cloud‑native observability solution that combines metrics, tracing, and log governance to enable predictive, tiered fault handling for large‑scale deployments.

Cloud NativeEdge ComputingHumanoid Robots
0 likes · 15 min read
Scaling Humanoid Robot Operations: Insights from the Human‑Robot Half‑Marathon
ITPUB
ITPUB
Apr 27, 2026 · Cloud Native

Why Skipping Backups Makes Kubernetes Operations Impossible

The article explains that running production Kubernetes clusters without regular backup and recovery plans exposes businesses to severe risks such as cluster failures, data loss, and prolonged downtime, and it details practical etcd physical and Velero logical backup strategies to mitigate these threats.

BackupCloud NativeKubernetes
0 likes · 9 min read
Why Skipping Backups Makes Kubernetes Operations Impossible
DevOps Coach
DevOps Coach
Apr 26, 2026 · Cloud Native

Accelerating Kubernetes Automation: Mastering GitOps Best Practices

This guide explains GitOps fundamentals—declarative, versioned, automated deployments—and shows how tools like Argo CD, Flux, Helm, Kustomize, Tekton, and Sealed Secrets can speed up Kubernetes delivery, improve reliability, enhance security, and foster better collaboration across DevOps teams.

Argo CDCloud NativeGitOps
0 likes · 16 min read
Accelerating Kubernetes Automation: Mastering GitOps Best Practices
Ray's Galactic Tech
Ray's Galactic Tech
Apr 23, 2026 · Artificial Intelligence

From Black‑Box to Explainable: Cloud‑Native AI Demand Engineering for Life‑Insurance

This guide explains why life‑insurance AI must move beyond black‑box recommendations, outlines eight production‑grade requirements, and presents a cloud‑native architecture that combines GraphRAG, rule engines, AI orchestration, observability, security, and Kubernetes to deliver explainable, auditable underwriting decisions.

Backend DevelopmentCloud NativeOperations
0 likes · 37 min read
From Black‑Box to Explainable: Cloud‑Native AI Demand Engineering for Life‑Insurance
vivo Internet Technology
vivo Internet Technology
Apr 22, 2026 · Cloud Native

Deep Dive into Vivo Activity System Globalization: Boost Reuse, Cut Maintenance

The article details Vivo's activity system globalization, covering multilingual support, multi‑timezone handling, multi‑tenant architecture, multi‑datacenter deployment, and a unified global code line, showing how these measures increase component reuse, lower maintenance costs, and improve global service stability.

Cloud Nativearchitectureglobalization
0 likes · 17 min read
Deep Dive into Vivo Activity System Globalization: Boost Reuse, Cut Maintenance
Airbnb Technology Team
Airbnb Technology Team
Apr 22, 2026 · Backend Development

How Airbnb Rebuilt Its Next‑Generation Key‑Value Store

Airbnb completely rewrote its internal key‑value store Mussel, moving from a legacy V1 system to a cloud‑native, NewSQL‑backed V2 that reduces operational complexity, improves scalability, adds flexible consistency, and supports massive batch imports, all while achieving zero‑downtime migration of over 1 PB of data.

AirbnbCloud NativeStorage Engine
0 likes · 16 min read
How Airbnb Rebuilt Its Next‑Generation Key‑Value Store
Alibaba Cloud Native
Alibaba Cloud Native
Apr 22, 2026 · Artificial Intelligence

Can OpenClaw and Hermes Work Together? 500+ Community Insights Reveal the Truth

Analyzing over 500 community comments from Reddit, Hacker News, and X, this report compares OpenClaw and Hermes Agent, outlines their architectural differences, lists each platform's strengths and weaknesses, quantifies user adoption and cost, and proposes best‑practice patterns for combining the two in cloud‑native AI workflows.

AI agentsCloud NativeCommunity analysis
0 likes · 20 min read
Can OpenClaw and Hermes Work Together? 500+ Community Insights Reveal the Truth
Alibaba Cloud Native
Alibaba Cloud Native
Apr 21, 2026 · Cloud Native

Why Alibaba Cloud’s AgentRun Is Redefining Managed AI Agents for Enterprises

AgentRun offers a cloud‑native, serverless platform that abstracts the full lifecycle of AI agents—definition, runtime, session, and event stream—while providing enterprise‑grade features such as model‑agnostic services, data‑in‑region networking, unified credential management, multi‑tenant isolation, full‑stack observability, and elastic scaling.

AI agentsCloud NativeEnterprise AI
0 likes · 16 min read
Why Alibaba Cloud’s AgentRun Is Redefining Managed AI Agents for Enterprises
Alibaba Cloud Native
Alibaba Cloud Native
Apr 20, 2026 · Operations

How Cloud‑Native Observability Powers Scalable Humanoid Robot Fleets

The article analyzes the unprecedented challenges of operating hundreds of humanoid robots in outdoor, network‑unstable, and heterogeneous environments, and demonstrates how Alibaba Cloud's unified observability stack—combining metric monitoring, distributed tracing, and log governance—delivers a standardized, reusable, and edge‑aware operations framework for large‑scale embodied AI deployments.

AIAlibaba CloudCloud Native
0 likes · 13 min read
How Cloud‑Native Observability Powers Scalable Humanoid Robot Fleets
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 20, 2026 · Information Security

How Immutable Rootfs and Remove Shell Reinvent Container Node Security

When Kubernetes becomes the backbone of enterprise applications and AI workloads, securing the container node OS is critical; ContainerOS tackles this by introducing an immutable root filesystem and eliminating the shell, thereby creating a read‑only, attack‑surface‑free environment that simplifies operations, guarantees integrity, and reduces costs.

Cloud NativeContainerOSImmutable Rootfs
0 likes · 7 min read
How Immutable Rootfs and Remove Shell Reinvent Container Node Security
Ray's Galactic Tech
Ray's Galactic Tech
Apr 19, 2026 · Cloud Native

Building a Production‑Ready Cloud‑Native Kubernetes Platform: From Zero to SRE Success

This article presents a step‑by‑step guide to designing and implementing a production‑grade Kubernetes platform with GitOps, observability, capacity governance, fault‑injection, and SRE practices, showing how to achieve unified delivery, reliability, and low‑cost operation for high‑concurrency business services.

Cloud NativeGitOpsInfrastructure
0 likes · 37 min read
Building a Production‑Ready Cloud‑Native Kubernetes Platform: From Zero to SRE Success
MaGe Linux Operations
MaGe Linux Operations
Apr 19, 2026 · Cloud Native

Unlock the Full Deployment‑to‑Service Workflow in Kubernetes

This comprehensive guide walks operators through the entire Kubernetes workflow from creating a Deployment to exposing a Service, explaining core resources, control loops, scheduling, networking, rolling updates, troubleshooting steps, best‑practice configurations, performance tuning, and security hardening.

Cloud NativeDeploymentKubernetes
0 likes · 29 min read
Unlock the Full Deployment‑to‑Service Workflow in Kubernetes
Advanced AI Application Practice
Advanced AI Application Practice
Apr 16, 2026 · Artificial Intelligence

Can AI Deliver Scalable, High‑Quality Test Assets for Enterprises?

The article analyzes enterprise testing challenges and presents the AIO intelligent testing platform, which combines cloud‑native architecture, MLLM‑RAG dual engines, and a knowledge‑graph to automate test case generation, improve coverage, and cut maintenance costs, backed by concrete benchmarks and multi‑modal inputs.

AI testingCloud NativeKnowledge Graph
0 likes · 18 min read
Can AI Deliver Scalable, High‑Quality Test Assets for Enterprises?
Woodpecker Software Testing
Woodpecker Software Testing
Apr 15, 2026 · Operations

Automating Performance Test Cases: A Practical Guide to Overcome Bottlenecks

With microservices and cloud‑native workloads, manual performance test case creation consumes most testing time; this article details a four‑step method—traffic profiling, boundary stress injection, data factory integration, and smart script orchestration—to automatically generate realistic JMeter scripts, avoid common pitfalls, and embed performance contracts into CI/CD.

Cloud NativeJMeterMicroservices
0 likes · 9 min read
Automating Performance Test Cases: A Practical Guide to Overcome Bottlenecks
Architect Chen
Architect Chen
Apr 15, 2026 · Cloud Native

Choosing the Right Microservice Deployment: Multi-Instance, Containers, Serverless & Kubernetes

This article compares four microservice deployment strategies—single-host multi-instance, containerized with Kubernetes, serverless functions, and full orchestration—detailing their architectures, benefits, drawbacks, and suitable scenarios, helping engineers select the most appropriate approach for scalability, reliability, and operational complexity.

Cloud NativeMicroservicesServerless
0 likes · 4 min read
Choosing the Right Microservice Deployment: Multi-Instance, Containers, Serverless & Kubernetes
IT Services Circle
IT Services Circle
Apr 11, 2026 · Databases

Why Sharding Isn’t Dead: Modern Alternatives and When to Use Them

The article revisits the rise and fall of database sharding, explains why it became problematic, and evaluates newer cloud‑native, distributed‑SQL, and serverless databases as modern replacements, offering a practical four‑step guide to help engineers choose the right solution for their workload and team.

Cloud NativeDistributed SQLOperations
0 likes · 23 min read
Why Sharding Isn’t Dead: Modern Alternatives and When to Use Them
Architect Chen
Architect Chen
Apr 9, 2026 · Cloud Native

Understanding Docker: Architecture, Namespaces, cgroups, and UnionFS

This article explains Docker's client‑server architecture, core isolation mechanisms like Linux namespaces and cgroups, the layered UnionFS image model, and how libcontainer/runc creates containers, providing a comprehensive technical overview of Docker as a cloud‑native platform.

Cloud NativeDockerDocker Architecture
0 likes · 5 min read
Understanding Docker: Architecture, Namespaces, cgroups, and UnionFS
Alibaba Cloud Native
Alibaba Cloud Native
Apr 8, 2026 · Operations

How HiClaw Transforms SRE with Multi‑Agent Collaboration in Cloud‑Native Environments

The article details how the HiClaw distributed multi‑agent platform is built and organized for SRE teams, explains the roles of human users and digital bots, describes permission design, showcases fault‑diagnosis and release scenarios, and evaluates the efficiency and innovation gains of this cloud‑native automation approach.

AI OpsAutomationCloud Native
0 likes · 14 min read
How HiClaw Transforms SRE with Multi‑Agent Collaboration in Cloud‑Native Environments
Alibaba Cloud Observability
Alibaba Cloud Observability
Apr 6, 2026 · Cloud Native

How Alibaba Cloud Built Real‑Time OpenAPI Monitoring with Flink + SLS

This article details the design and implementation of a cloud‑native, real‑time monitoring system for Alibaba Cloud OpenAPI, covering background challenges, a Flink‑SLS architecture, multi‑region data processing, checkpoint and state‑backend tuning, source‑side predicate pushdown, visualization with Grafana, and production results.

Big DataCloud NativeFlink
0 likes · 21 min read
How Alibaba Cloud Built Real‑Time OpenAPI Monitoring with Flink + SLS
DevOps in Software Development
DevOps in Software Development
Apr 6, 2026 · Industry Insights

How Lockheed Martin Turns Its Software Factory into Continuous Combat Capability

This article analyses Lockheed Martin’s publicly disclosed software‑factory approach, showing how it prioritises sustained delivery of mission‑critical capabilities through tightly integrated DevSecOps, standardized environments, cloud‑native and GitOps practices, and an evolving focus on system interoperability.

Cloud NativeContinuous DeliveryDevSecOps
0 likes · 6 min read
How Lockheed Martin Turns Its Software Factory into Continuous Combat Capability
Alibaba Cloud Native
Alibaba Cloud Native
Apr 5, 2026 · Operations

How OpenClaw CMS Plugin v0.1.2 Turns Agent Tracing into Precise, Cost‑Effective Observability

The OpenClaw CMS observability plugin v0.1.2 solves the hidden‑trace problem by fully restoring multi‑round LLM execution, stabilizing concurrent chains, and introducing granular agent metrics, enabling developers, testers, and operators to debug faster, assess costs accurately, and improve cross‑team collaboration.

AgentCloud NativeObservability
0 likes · 8 min read
How OpenClaw CMS Plugin v0.1.2 Turns Agent Tracing into Precise, Cost‑Effective Observability
Alibaba Cloud Native
Alibaba Cloud Native
Apr 3, 2026 · Information Security

How a Supply‑Chain Poisoning of LiteLLM Exposed Critical AI API Secrets – and What to Do

A March 2026 supply‑chain attack injected malicious code into LiteLLM versions 1.82.7/1.82.8, silently stealing API keys, SSH credentials, cloud tokens and more, while a cloud‑native AI gateway from Alibaba offers a secure, zero‑exposure alternative and detailed remediation steps.

AI securityAPI Key LeakageAlibaba Cloud AI Gateway
0 likes · 14 min read
How a Supply‑Chain Poisoning of LiteLLM Exposed Critical AI API Secrets – and What to Do
Alibaba Cloud Native
Alibaba Cloud Native
Apr 2, 2026 · Industry Insights

How EventHouse Redefines AI‑Native Event Data Platforms for the Agent Era

EventHouse, Alibaba Cloud’s AI‑native data platform, unifies event ingestion, storage, governance and intelligent analysis through a layered architecture that supports real‑time SQL, zero‑ETL federation and Luma Agent‑driven conversational analytics, positioning it as a next‑generation AI data foundation for enterprises seeking to turn event streams into actionable insights.

AI Data PlatformAgentic AICloud Native
0 likes · 16 min read
How EventHouse Redefines AI‑Native Event Data Platforms for the Agent Era
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Apr 2, 2026 · Cloud Native

How Kthena Enables Production‑Grade LLM Inference on Kubernetes

This article analyzes the cloud‑native challenges of deploying large‑model inference on Kubernetes and presents Kthena’s architecture—ModelServing, Router, Autoscaler, and ModelBooster—along with Volcano integration, vLLM‑Ascend setup, and a real‑world Qwen3‑235B deployment case, highlighting performance gains and future directions.

Cloud NativeKthenaKubernetes
0 likes · 13 min read
How Kthena Enables Production‑Grade LLM Inference on Kubernetes
java1234
java1234
Apr 2, 2026 · Cloud Native

How a Simple Analogy Clarified Docker and Kubernetes Core Concepts

An image is a static snapshot of an OS, runtime and code; a container runs that snapshot, while Dockerfile and docker‑compose define how to build and orchestrate images. Pods group containers for shared resources, and Kubernetes schedules, scales, heals, networks and stores them, enabling true “run anywhere” deployment.

Cloud NativeContainersDocker
0 likes · 6 min read
How a Simple Analogy Clarified Docker and Kubernetes Core Concepts
Java Architect Essentials
Java Architect Essentials
Apr 2, 2026 · Backend Development

Why Micronaut Beats Spring Boot: Faster Startup, Lower Memory, Cloud‑Native Edge

This article analyzes Micronaut's design philosophy, performance advantages in startup time and memory usage, and its built‑in cloud‑native features—such as distributed configuration, service discovery, client load balancing, tracing, and serverless support—while providing step‑by‑step installation and code examples for Java developers.

Cloud NativeJavaMicronaut
0 likes · 11 min read
Why Micronaut Beats Spring Boot: Faster Startup, Lower Memory, Cloud‑Native Edge
Big Data Tech Team
Big Data Tech Team
Apr 1, 2026 · Big Data

Why Your 2026 Big Data Resume Is Being Ignored and How to Fix It

In the 2026 spring hiring season, many big‑data job seekers see their resumes disappear because they still focus on offline batch processing, while employers now demand real‑time streaming, AI‑driven data pipelines, and cloud‑native deployment skills such as Flink, vector databases, and Kubernetes.

AI integrationBig DataCloud Native
0 likes · 7 min read
Why Your 2026 Big Data Resume Is Being Ignored and How to Fix It
DevOps Coach
DevOps Coach
Mar 28, 2026 · Cloud Native

Why the Twelve-Factor App is Essential for Modern Cloud‑Native Development

The article explains how the Twelve‑Factor App methodology, created by Heroku’s Adam Wiggins, provides a set of core principles that prevent common production failures and form the foundation for modern tools like Docker, Kubernetes, and CI/CD pipelines, enabling reliable, scalable, and maintainable software.

Cloud NativeDevOpsDocker
0 likes · 22 min read
Why the Twelve-Factor App is Essential for Modern Cloud‑Native Development
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Mar 27, 2026 · Cloud Native

How AutoMQ Transforms Kafka into a Cloud‑Native, Elastic Messaging Service

This article examines the limitations of traditional Kafka in large‑scale deployments and presents AutoMQ’s cloud‑native redesign—detailing its stateless architecture, storage separation, automatic scaling, read/write isolation, performance benchmarks, and real‑world migration case studies that demonstrate reduced latency, higher throughput, and lower resource costs.

AutoMQCloud NativeKafka
0 likes · 13 min read
How AutoMQ Transforms Kafka into a Cloud‑Native, Elastic Messaging Service
DevOps Coach
DevOps Coach
Mar 26, 2026 · Cloud Native

How kubara Enables Rapid, Production‑Ready Kubernetes Platforms in 30 Minutes

This article explains how the open‑source kubara framework provides a GitOps‑driven, hub‑and‑spoke Kubernetes platform that can be bootstrapped in about 30 minutes, detailing its architecture, default security, control‑plane components, data‑plane onboarding, and step‑by‑step commands for a production‑grade setup.

Argo CDCloud NativeGitOps
0 likes · 20 min read
How kubara Enables Rapid, Production‑Ready Kubernetes Platforms in 30 Minutes
DevOps Coach
DevOps Coach
Mar 24, 2026 · Operations

Avoid the Top 10 Kubernetes Monitoring Mistakes Every SRE Team Makes

This article examines the ten most common Kubernetes monitoring errors that SRE teams encounter, explains why each mistake harms reliability, and provides concrete, actionable solutions—including the Golden Signals framework, pod‑restart analysis, alert‑fatigue reduction, application‑level observability, etcd health checks, network metrics, control‑plane monitoring, log‑metric correlation, resource request tracking, and end‑to‑end observability—to help teams build robust, scalable monitoring systems.

Cloud NativeKubernetesObservability
0 likes · 11 min read
Avoid the Top 10 Kubernetes Monitoring Mistakes Every SRE Team Makes
Code Wrench
Code Wrench
Mar 20, 2026 · Cloud Native

Inside Traefik v3: How Its Configuration Watcher, Router, and Concurrency Model Work

This article provides a senior Go engineer’s deep dive into Traefik’s source code, explaining the configuration hot‑reload engine, routing dispatch mechanism, and graceful concurrency model, and shows how to tune the proxy, build custom plugins, and apply the concepts to production‑grade Go services.

Cloud NativeConfiguration ReloadGo
0 likes · 13 min read
Inside Traefik v3: How Its Configuration Watcher, Router, and Concurrency Model Work
Architect Chen
Architect Chen
Mar 19, 2026 · Cloud Native

How Does Kubernetes Really Work? A Deep Dive into K8s Architecture

This article provides a comprehensive, step‑by‑step explanation of Kubernetes (K8s) architecture and operation, covering the control plane components, node components, data flow, and the detailed workflow from a kubectl command to a running pod, illustrated with diagrams and ASCII schematics.

Cloud NativeDevOpsKubernetes
0 likes · 5 min read
How Does Kubernetes Really Work? A Deep Dive into K8s Architecture
IT Architects Alliance
IT Architects Alliance
Mar 18, 2026 · Cloud Native

Why Serverless Projects Fail in Production and How to Avoid the Pitfalls

The article analyzes common misconceptions and hidden costs of serverless adoption, outlines four critical steps from PoC to production, and presents five enterprise‑grade best practices—including scenario selection, framework usage, observability, security, and cost governance—to ensure reliable, cost‑effective serverless deployments.

Cloud NativeCost OptimizationObservability
0 likes · 9 min read
Why Serverless Projects Fail in Production and How to Avoid the Pitfalls
Shuge Unlimited
Shuge Unlimited
Mar 17, 2026 · Operations

Exploring OpenClaw for K8s AIOps: Four Practical Scenarios from Concept to Deployment

This article analyzes how OpenClaw’s Skills, Subagent, and Cron capabilities can be leveraged to build Kubernetes AIOps solutions, presenting four detailed scenarios—fault diagnosis, resource optimization, security audit, and continuous health checks—while evaluating technical feasibility, security, reliability, cost, and a phased rollout plan.

Cloud NativeKubernetesOpenClaw
0 likes · 19 min read
Exploring OpenClaw for K8s AIOps: Four Practical Scenarios from Concept to Deployment
Alibaba Cloud Observability
Alibaba Cloud Observability
Mar 16, 2026 · Information Security

Secure OpenClaw AI Agents: One‑Click Log Integration & Real‑Time Auditing with Alibaba SLS

This article explains how to connect OpenClaw, a leading AI agent platform, to Alibaba Cloud Log Service (SLS) using the SLS Access Center, providing one‑click log ingestion, built‑in audit and observability dashboards, and detailed guidance for security auditing, cost monitoring, and troubleshooting across multiple data sources.

AI AgentAlibaba CloudCloud Native
0 likes · 29 min read
Secure OpenClaw AI Agents: One‑Click Log Integration & Real‑Time Auditing with Alibaba SLS
Selected Java Interview Questions
Selected Java Interview Questions
Mar 15, 2026 · Cloud Native

What Exactly Are Docker Images, Containers, and Kubernetes Pods? A Simple Guide

An easy-to-understand walkthrough explains Docker images as static system snapshots, containers as runnable instances, Dockerfile and docker‑compose recipes, and how Kubernetes Pods orchestrate containers, highlighting why these tools enable “run anywhere” deployment and scalable management across clusters.

Cloud NativeContainersDevOps
0 likes · 6 min read
What Exactly Are Docker Images, Containers, and Kubernetes Pods? A Simple Guide
Alibaba Cloud Native
Alibaba Cloud Native
Mar 15, 2026 · Artificial Intelligence

How LoongSuite Python Probe Brings Full‑Stack Observability to GenAI Applications

This article explains the three core challenges of AI‑agent observability—data back‑flow, inconsistent semantics, and missing end‑to‑end traces—and shows how the LoongSuite Python probe, built on OpenTelemetry, provides automatic instrumentation, unified GenAI semantics, multi‑dimensional coverage, and flexible OTLP export to simplify monitoring, debugging, and optimizing AI applications.

AI ObservabilityCloud NativeGenAI
0 likes · 15 min read
How LoongSuite Python Probe Brings Full‑Stack Observability to GenAI Applications
Cloud Native Technology Community
Cloud Native Technology Community
Mar 13, 2026 · Cloud Native

How Kubernetes Evolved into a Unified AI Platform for Massive Data and Autonomous Agents

From its 2015 debut as a stateless microservice orchestrator, Kubernetes now powers large‑scale data pipelines, distributed training, high‑throughput inference, and autonomous agents, unifying these workloads on a single platform while addressing resource coordination, multi‑cluster scheduling, and GPU economics.

AICloud NativeGPU scheduling
0 likes · 10 min read
How Kubernetes Evolved into a Unified AI Platform for Massive Data and Autonomous Agents
Raymond Ops
Raymond Ops
Mar 12, 2026 · Operations

How to Supercharge Prometheus: Proven Techniques to Slash Memory and Query Latency

This article shares real‑world experiences and step‑by‑step practices for optimizing Prometheus performance, covering metric pruning, scrape interval tuning, storage engine tweaks, query acceleration, federation architecture, and future observability trends to keep monitoring systems reliable at scale.

Cloud NativeObservabilityOperations
0 likes · 11 min read
How to Supercharge Prometheus: Proven Techniques to Slash Memory and Query Latency
Didi Tech
Didi Tech
Mar 11, 2026 · Cloud Native

How Huatuo Now Monitors MetaX GPUs for Cloud‑Native AI Workloads

Huatuo, the open‑source deep‑observability platform backed by Didi, now supports real‑time monitoring of MetaX GPUs, offering detailed hardware metrics via Docker or Kubernetes deployments and exposing them through a /metrics endpoint for cloud‑native AI and operations use cases.

AI InfrastructureCloud NativeGPU monitoring
0 likes · 4 min read
How Huatuo Now Monitors MetaX GPUs for Cloud‑Native AI Workloads
MeowKitty Programming
MeowKitty Programming
Mar 10, 2026 · Industry Insights

Why Java Remains the Undying Choice for Enterprise Development

The article analyzes Java's enduring dominance in enterprise software by examining its cross‑platform runtime, extensive ecosystem, continuous language and runtime innovations, performance and security breakthroughs, and the resulting cost, stability, and talent advantages that keep 90% of Fortune 500 companies invested.

Cloud NativeEnterprise DevelopmentJVM
0 likes · 8 min read
Why Java Remains the Undying Choice for Enterprise Development
DevOps Coach
DevOps Coach
Mar 6, 2026 · Operations

SRE vs Platform Engineering vs DevOps: Key Differences, Roles, and Toolchains

An in‑depth comparison of Site Reliability Engineering (SRE), Platform Engineering, and DevOps explains their origins, core responsibilities, distinct tools, and how they complement each other in modern cloud‑native organizations, helping teams choose the right practices for reliable, scalable software delivery.

Cloud NativeDevOpsSRE
0 likes · 9 min read
SRE vs Platform Engineering vs DevOps: Key Differences, Roles, and Toolchains
Linux Ops Smart Journey
Linux Ops Smart Journey
Mar 4, 2026 · Cloud Native

Secure Envoy Gateway with Basic Auth and Kubernetes Secrets

This guide walks through enabling Basic Authentication in Envoy Gateway by creating an .htpasswd file, storing it as a Kubernetes Secret, applying a SecurityPolicy, and verifying access with curl, while highlighting important security considerations such as using HTTPS.

Basic AuthCloud NativeEnvoy Gateway
0 likes · 5 min read
Secure Envoy Gateway with Basic Auth and Kubernetes Secrets
DevOps Coach
DevOps Coach
Mar 3, 2026 · Cloud Native

Discover Argo Workflows 4.0: 24 New Features, Performance Gains & UI Upgrades

Argo Workflows 4.0 has been released, bringing 24 new features, 122 bug fixes, and contributions from 73 developers, including artifact‑driver plugins, full CRD validation, deprecated singular sync primitives, name‑filtering for archived workflows, real‑time parallelism updates, OIDC custom CA support, UI improvements, and enhanced CLI commands, all aimed at simplifying large‑scale pipeline orchestration across clusters.

Argo WorkflowsCloud NativeKubernetes
0 likes · 9 min read
Discover Argo Workflows 4.0: 24 New Features, Performance Gains & UI Upgrades
Linux Ops Smart Journey
Linux Ops Smart Journey
Mar 3, 2026 · Cloud Native

Prevent Service Avalanches: Configuring Circuit Breaker & Connection Limits in Envoy Gateway

This tutorial explains how to use Envoy Gateway on Kubernetes to implement circuit breaker and connection‑limit policies, walks through the necessary YAML configurations, demonstrates verification with the hey load‑testing tool, and shows how these mechanisms improve system resilience in microservice architectures.

Cloud NativeConnection LimitEnvoy
0 likes · 12 min read
Prevent Service Avalanches: Configuring Circuit Breaker & Connection Limits in Envoy Gateway
Shuge Unlimited
Shuge Unlimited
Feb 27, 2026 · Databases

Why Is Milvus, the 43K‑Star Vector Database, So Powerful?

This article analyzes Milvus—its open‑source origins, three deployment modes, four‑layer architecture, eight‑plus indexing algorithms, real‑world case studies, and a detailed comparison with competitors—highlighting its strengths, weaknesses, common pitfalls, and when it’s the right choice for large‑scale AI workloads.

AI workloadsCloud NativeDeployment
0 likes · 15 min read
Why Is Milvus, the 43K‑Star Vector Database, So Powerful?
Alibaba Cloud Native
Alibaba Cloud Native
Feb 26, 2026 · Cloud Native

Seamlessly Connect Dify Apps to Multiple AI Models with the Higress AI Gateway

This guide explains the challenges of managing multiple AI models in Dify, introduces the Higress AI Gateway plugin that provides unified model access, governance, and fine‑grained authentication, and walks through two practical demos—an Agent assistant and an image‑generation workflow—showing step‑by‑step configuration and usage.

Cloud NativeDifyHigress
0 likes · 10 min read
Seamlessly Connect Dify Apps to Multiple AI Models with the Higress AI Gateway
Alibaba Cloud Native
Alibaba Cloud Native
Feb 25, 2026 · Cloud Native

How a 100M‑MAU Parenting App Scaled with RocketMQ 5.x Serverless for Real‑Time Messaging

The article details how the "Qinbao" parenting app, serving over ten million daily active users and massive photo/video uploads, tackled high‑concurrency writes, instant cross‑device notifications, and distributed transaction consistency by migrating to Alibaba Cloud RocketMQ 5.x Serverless, achieving millisecond latency, elastic scaling, and near‑perfect reliability.

Cloud NativeMessagingMobile Backend
0 likes · 9 min read
How a 100M‑MAU Parenting App Scaled with RocketMQ 5.x Serverless for Real‑Time Messaging
IT Services Circle
IT Services Circle
Feb 25, 2026 · Cloud Native

Nacos vs Apollo: Which Config Center Is Faster and Easier for Microservices?

This article compares Nacos and Apollo, the two most popular configuration centers in the microservice ecosystem, covering their design philosophies, quick‑start procedures, core features, data models, architecture, performance, operational costs, community support, and provides guidance on when to choose each solution.

ApolloCloud NativeConfiguration Management
0 likes · 21 min read
Nacos vs Apollo: Which Config Center Is Faster and Easier for Microservices?
Raymond Ops
Raymond Ops
Feb 24, 2026 · Cloud Native

Master Enterprise Monitoring: Build a Prometheus + Grafana Observability Platform

This guide details how to design and implement an enterprise‑grade cloud‑native observability platform using Prometheus for metrics collection and Grafana for visualization, covering architecture, high‑availability deployment, alerting, dashboard automation, case studies, best‑practice recommendations, and future trends.

Cloud NativeGrafanaObservability
0 likes · 24 min read
Master Enterprise Monitoring: Build a Prometheus + Grafana Observability Platform
AI Waka
AI Waka
Feb 22, 2026 · Industry Insights

Why Multi‑Agent AI Fails at Scale and How 12‑Factor Cloud‑Native Principles Save It

The article explains why naïve multi‑agent AI architectures collapse under load due to internal east‑west dependencies, and shows how applying 12‑Factor App and cloud‑native patterns—isolated workers, externalized state, short‑lived sessions, and strict orchestration—enable scalable, fault‑tolerant agentic systems.

12-factorCloud NativeDistributed Systems
0 likes · 17 min read
Why Multi‑Agent AI Fails at Scale and How 12‑Factor Cloud‑Native Principles Save It
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Feb 22, 2026 · Cloud Native

How to Stabilize Java Services on Kubernetes: A 3‑Year Success Story

This article walks through a real‑world Java service on Kubernetes, detailing the initial confidence, recurring OOM and rollout issues, and a multi‑round remediation that introduced container‑aware JVM settings, refined resource requests, OOM dumps, probes, and metrics, ultimately achieving three years of stable operation with lower resource usage.

Cloud NativeJVMJava
0 likes · 10 min read
How to Stabilize Java Services on Kubernetes: A 3‑Year Success Story
Raymond Ops
Raymond Ops
Feb 12, 2026 · Cloud Native

Master Kubernetes: Core Concepts, Architecture, and Advanced Networking Explained

This comprehensive guide demystifies Kubernetes by covering its core principles, component architecture, service discovery mechanisms, pod resource sharing, CNI plugins, multi‑layer load balancing, and IP addressing models, providing engineers with the knowledge needed to design and operate robust cloud‑native clusters.

CNICloud NativeIP addressing
0 likes · 14 min read
Master Kubernetes: Core Concepts, Architecture, and Advanced Networking Explained
ByteDance Data Platform
ByteDance Data Platform
Feb 11, 2026 · Databases

How ByteHouse Redefines Real‑Time Multimodal Analytics with a Cloud‑Native Data Warehouse

ByteHouse, ByteDance's cloud‑native data warehouse, evolves from a traditional warehouse to a next‑generation AI‑ready platform that handles 800+ PB of data, supports 25,000 nodes, and delivers real‑time, multimodal analytics through a decoupled storage‑compute architecture, AI‑driven query optimization, and native vector search integration.

AI OptimizationCloud NativeReal-time analytics
0 likes · 9 min read
How ByteHouse Redefines Real‑Time Multimodal Analytics with a Cloud‑Native Data Warehouse
Code Wrench
Code Wrench
Feb 9, 2026 · Cloud Native

Is Go the AI‑Recommended Gold Mine for Backend and Cloud‑Native Success?

The article examines AI platforms' rankings of high‑pay programming languages, highlights Go's strengths in performance, concurrency and cloud‑native ecosystems, and offers practical advice and future outlook for developers seeking lucrative and sustainable career paths with Go.

Cloud NativeGocareer
0 likes · 7 min read
Is Go the AI‑Recommended Gold Mine for Backend and Cloud‑Native Success?
JakartaEE China Community
JakartaEE China Community
Feb 9, 2026 · Backend Development

Eclipse Foundation Unveils Jakarta EE 11 with Simplified Data Access and Cloud‑Native Enhancements

The Eclipse Foundation announced Jakarta EE 11, highlighting streamlined data access via the new Jakarta Data spec, support for Java 21 including virtual threads, modernized testing tools, deprecation of Managed Beans, and early certification of several enterprise servers, marking a major step toward cloud‑native Java development.

Cloud NativeEnterprise JavaJakarta Data
0 likes · 6 min read
Eclipse Foundation Unveils Jakarta EE 11 with Simplified Data Access and Cloud‑Native Enhancements
Alibaba Cloud Native
Alibaba Cloud Native
Feb 6, 2026 · Cloud Native

Ingress NGINX Retirement: Impact, Risks, and Migration Strategies

Kubernetes SIG Network and Security committees announced the retirement of Ingress NGINX, detailing the end‑of‑life timeline, lack of future releases or security patches, and urging users to assess their clusters and migrate to Gateway API or alternative ingress controllers within two months.

Cloud NativeGateway APIKubernetes
0 likes · 5 min read
Ingress NGINX Retirement: Impact, Risks, and Migration Strategies
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Feb 1, 2026 · Cloud Native

Master Kubernetes Liveness Probes: When, Why, and How to Use Them

This article provides a comprehensive guide to Kubernetes Liveness Probes, explaining their purpose, the three probe types (HTTP GET, TCP Socket, Exec), how they differ from Readiness and Startup probes, practical YAML examples, verification steps, common pitfalls, troubleshooting tips, and best‑practice recommendations for improving pod stability and self‑healing.

Cloud NativeKubernetesLiveness Probe
0 likes · 10 min read
Master Kubernetes Liveness Probes: When, Why, and How to Use Them
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 29, 2026 · Cloud Native

How Alibaba Cloud’s MaxCompute Powers Multi‑Modal AI Data Processing for MOSI Intelligence

In the era of rapid AI advancement, MOSI Intelligence faced IDC storage, compute, and network bottlenecks for large‑scale audio‑video pipelines, prompting a partnership with Alibaba Cloud to build a cloud‑native, one‑stop multi‑modal data processing platform using MaxCompute and the custom MaxFrame engine, dramatically improving performance and operational efficiency.

AI Data PlatformCloud NativeMaxCompute
0 likes · 8 min read
How Alibaba Cloud’s MaxCompute Powers Multi‑Modal AI Data Processing for MOSI Intelligence
JakartaEE China Community
JakartaEE China Community
Jan 28, 2026 · Cloud Native

Key Findings from the 2024 Java Cloud‑Native Survey

The Jakarta EE Working Group surveyed over 170 developers between July and August 2024, revealing that Jakarta EE 8/Java EE 8 remain dominant, Java SE 17 is the most used version, Spring Boot and Tomcat lead runtime adoption, and MicroProfile adoption is split between newer and legacy versions.

Cloud NativeEnterprise JavaJava SE
0 likes · 6 min read
Key Findings from the 2024 Java Cloud‑Native Survey
Alibaba Cloud Native
Alibaba Cloud Native
Jan 27, 2026 · Cloud Native

How Serverless Sandboxes Power Secure AI Agents

This article explains how lightweight security sandboxes in serverless function compute provide strong process isolation, elastic resource scaling, and pay‑per‑use economics, enabling AI agents to execute untrusted code safely while supporting session lifecycle management, protocol extensions, and cost‑effective billing models.

AI AgentCloud NativeFunction Compute
0 likes · 12 min read
How Serverless Sandboxes Power Secure AI Agents
Volcano Engine Developer Services
Volcano Engine Developer Services
Jan 26, 2026 · Databases

How Volcano Engine veDB Scales to Tens of Thousands of Pods with Cloud‑Native Architecture

This article explains how Volcano Engine's veDB leverages compute‑storage separation, Kubernetes operators, and declarative operations to achieve extreme deployment density, seamless scaling, and high‑availability for millions of database instances, while addressing the challenges of traditional VM‑based deployments.

Cloud NativeOperatordatabase scaling
0 likes · 14 min read
How Volcano Engine veDB Scales to Tens of Thousands of Pods with Cloud‑Native Architecture
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 26, 2026 · Cloud Native

How Kimi Scaled AI Agents with Alibaba Cloud’s Elastic Sandbox Architecture

Kimi built a high‑performance, low‑cost AI Agent infrastructure by combining Alibaba Cloud ACK node pools and the ACS Agent Sandbox, addressing challenges of instant sandbox response, state continuity, massive concurrency, cost efficiency, security isolation, and search‑memory integration for production‑grade agents.

AI AgentCloud NativeCost Optimization
0 likes · 18 min read
How Kimi Scaled AI Agents with Alibaba Cloud’s Elastic Sandbox Architecture
DevOps Coach
DevOps Coach
Jan 22, 2026 · Cloud Native

Why YAML Won’t Scale in Kubernetes and What’s Coming Next

The article examines how YAML, once central to Kubernetes, has become a scalability bottleneck due to human error, lack of intent modeling, and configuration debt, and outlines a shift toward intent‑driven, autonomous platforms powered by code‑native execution and continuous SLO enforcement.

Cloud NativeInfrastructure AutomationKubernetes
0 likes · 7 min read
Why YAML Won’t Scale in Kubernetes and What’s Coming Next
Alibaba Cloud Native
Alibaba Cloud Native
Jan 22, 2026 · Cloud Native

Building a Cloud‑Native AI Glass Traffic Enforcement Prototype with AgentRun and Serverless Functions

This article details a cloud‑native architecture that combines Meta Ray‑Ban AI glasses, a custom iOS app, and Alibaba Cloud Function Compute (FC) with AgentRun to perform OCR‑based traffic rule enforcement, showcasing a three‑layer "client‑brain‑tools" design, prompt‑driven logic, and cost‑effective serverless deployment.

AIAgent ArchitectureAlibaba Cloud
0 likes · 14 min read
Building a Cloud‑Native AI Glass Traffic Enforcement Prototype with AgentRun and Serverless Functions
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jan 22, 2026 · Cloud Native

Mastering Kubernetes: Complete Architecture, Principles, and Components Explained

This article provides a comprehensive technical overview of Kubernetes, covering its core problems, master‑worker architecture, essential components such as API server, etcd, scheduler, controller manager, kubelet, kube-proxy, container runtimes, and a step‑by‑step deployment workflow, illustrated with diagrams.

Cloud NativeContainersKubernetes
0 likes · 5 min read
Mastering Kubernetes: Complete Architecture, Principles, and Components Explained
DevOps Coach
DevOps Coach
Jan 20, 2026 · Cloud Native

How to Scale Kubernetes to Hundreds of Clusters: A Practical Enterprise Guide

This article walks you through the complete journey from a single Kubernetes cluster to a production‑grade, multi‑cluster platform, covering managed services, capacity planning, GitOps pipelines, networking, observability, cost optimisation, upgrade strategies, and the people and processes needed for sustainable large‑scale operations.

Cloud NativeCost ManagementInfrastructure
0 likes · 27 min read
How to Scale Kubernetes to Hundreds of Clusters: A Practical Enterprise Guide
DataFunSummit
DataFunSummit
Jan 18, 2026 · Big Data

How Ray Reinvents AI Data Pipelines for Massive Multimodal Inference

This article examines the shortcomings of traditional big‑data engines for AI workloads, presents a Ray‑based heterogeneous fusion architecture that unifies CPU/GPU scheduling, Python ecosystems, and streaming‑batch processing, and details fault‑tolerance, checkpointing, compute‑storage separation, resource‑utilization, scalability, and observability improvements that enable thousands of nodes and dramatically higher GPU efficiency.

Big DataCloud NativeRay
0 likes · 31 min read
How Ray Reinvents AI Data Pipelines for Massive Multimodal Inference
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jan 17, 2026 · Cloud Native

Deploying Microservices on Kubernetes: A Step‑by‑Step Guide

Learn how to package each microservice into containers and host them on a Kubernetes cluster, covering architecture diagrams, Ingress traffic routing, service discovery, ConfigMap and Secret management, persistent storage, deployment manifests, autoscaling, and CI/CD automation, while avoiding promotional fluff.

Cloud NativeConfigMapDeployment
0 likes · 4 min read
Deploying Microservices on Kubernetes: A Step‑by‑Step Guide
Java Architect Handbook
Java Architect Handbook
Jan 14, 2026 · Operations

How to Build a Scalable Prometheus Monitoring System for Big Data on Kubernetes

This guide explains how to design, configure, and implement a Prometheus‑based monitoring solution for big‑data components running in Kubernetes, covering metric exposure methods, scrape configurations, alerting architecture, dynamic rule management, exporter deployment, and practical examples with full YAML snippets.

AlertingBig Data MonitoringCloud Native
0 likes · 19 min read
How to Build a Scalable Prometheus Monitoring System for Big Data on Kubernetes