Author

Alibaba Cloud Native

We publish cloud-native tech news, curate in-depth content, host regular events and live streams, and share Alibaba product and user case studies. Join us to explore and share the cloud-native insights you need.

1.2k

Articles

Likes

1.9k

Views

Comments

Latest from Alibaba Cloud Native

100 recent articles max

Alibaba Cloud Native

Oct 17, 2025 · Artificial Intelligence

How We Boosted Embedding Service Throughput 16× with Cloud‑Native Optimizations

This article details the cost and speed challenges of embedding vectors in large‑scale log scenarios, analyzes inference framework choices, describes GPU utilization, priority queuing, and pipeline redesigns, and reports a 16‑fold throughput increase and dramatically lower per‑request costs.

EmbeddingGPU optimizationTriton

0 likes · 8 min read

How We Boosted Embedding Service Throughput 16× with Cloud‑Native Optimizations

Alibaba Cloud Native

Oct 16, 2025 · Artificial Intelligence

How Spring AI Alibaba Admin Powers Data‑Centric AI Agent Development and Ops

This article outlines the industry shift toward large‑scale AI Agent deployment, identifies key engineering challenges such as prompt management, quality assessment, and observability, and presents Spring AI Alibaba Admin—a cloud‑native platform that offers prompt, dataset, evaluator, and tracing capabilities, complete with setup instructions and future roadmap.

AI AgentJavaNacos

0 likes · 15 min read

How Spring AI Alibaba Admin Powers Data‑Centric AI Agent Development and Ops

Alibaba Cloud Native

Oct 15, 2025 · Cloud Native

What’s New in Higress 2.0? 30 Updates Including RAG MCP Server and Performance Fixes

The Higress 2.0 release introduces 30 changes—13 new features such as a RAG MCP server and ECDS‑based configuration refactor, 7 bug fixes, 5 refactorings, documentation updates and a test improvement—providing developers with enhanced knowledge‑management capabilities, more stable routing, and clearer documentation for cloud‑native service‑mesh environments.

Bug FixMCPRAG

0 likes · 20 min read

What’s New in Higress 2.0? 30 Updates Including RAG MCP Server and Performance Fixes

Alibaba Cloud Native

Oct 14, 2025 · Mobile Development

How Alibaba Cloud RUM SDK Captures iOS App Performance and Crashes

The article explains the architecture, data collection methods, and crash monitoring techniques of Alibaba Cloud's RUM SDK for iOS, detailing session tracing, performance metrics, Method Swizzling, system event handling, and KSCrash integration to improve issue diagnosis.

Crash ReportingMethod SwizzlingMobile Development

0 likes · 9 min read

How Alibaba Cloud RUM SDK Captures iOS App Performance and Crashes

Alibaba Cloud Native

Oct 12, 2025 · Cloud Native

Boost Code Review Accuracy with Single‑Commit AI Review Mode

The article explains how the single‑commit review mode in Alibaba Cloud Codeup uses AI to evaluate each commit individually, addressing the shortcomings of default bulk diff reviews, detailing configuration steps, recommended scenarios, observed benefits, and its performance trade‑offs.

AI code reviewCloud NativeContinuous Integration

0 likes · 6 min read

Boost Code Review Accuracy with Single‑Commit AI Review Mode

Alibaba Cloud Native

Oct 11, 2025 · Artificial Intelligence

How AI Gateway Redefines AI Application Infrastructure with Serverless Flexibility

The article provides a comprehensive overview of the AI Gateway product, detailing its evolution, core capabilities across model, tool, and agent access, security features, the open‑source HiMarket platform, and the new Serverless edition that dramatically lowers entry costs for AI workloads.

AI InfrastructureOpen Platformserverless

0 likes · 16 min read

How AI Gateway Redefines AI Application Infrastructure with Serverless Flexibility

Alibaba Cloud Native

Oct 10, 2025 · Artificial Intelligence

How AI Gateways Are Evolving: From Simple Routing to Intelligent Multi‑Model Orchestration

Since 2024, AI gateways have shifted from static rule‑based routers to flexible platforms that support multi‑model traffic scheduling, smart routing, agent and MCP service management, and AI governance, driven by new tools like Tinker, OpenAI's Apps SDK, and emerging video generation technologies.

AI toolsAgent DevelopmentMulti-Model Routing

0 likes · 12 min read

How AI Gateways Are Evolving: From Simple Routing to Intelligent Multi‑Model Orchestration

Alibaba Cloud Native

Sep 30, 2025 · Cloud Native

Deploy a Scalable MCP Server with Function Compute and MSE Nacos

This guide explains how to address high deployment costs, slow iteration, and poor manageability of MCP Server by using Alibaba Cloud Function Compute for serverless execution and MSE Nacos Enterprise for automatic registration, dynamic configuration, and unified service governance.

Function ComputeMCPMSE Nacos

0 likes · 13 min read

Deploy a Scalable MCP Server with Function Compute and MSE Nacos

Alibaba Cloud Native

Sep 23, 2025 · Artificial Intelligence

Why Independent Runtime Agents Are the Future of Scalable AI Systems

The article explains how a configuration‑driven, cloud‑native architecture with independent runtime agents solves performance isolation, availability, scalability, security, and technology heterogeneity problems of low‑code platforms, and introduces a unified Agent Spec, Agent Studio, execution engine, A2A protocol, and dynamic governance to enable enterprise‑grade AI deployments.

Cloud NativeDynamic Scalingconfiguration-driven

0 likes · 29 min read

Why Independent Runtime Agents Are the Future of Scalable AI Systems

Alibaba Cloud Native

Sep 22, 2025 · Cloud Native

How Alibaba Cloud AI Gateway Ensures High Availability for LLM Services

This guide explains how Alibaba Cloud AI Gateway provides traffic management, passive health checks, first‑packet timeout, and fallback mechanisms to keep large language model services highly available during traffic spikes and overload scenarios.

First Packet TimeoutLLMPassive Health Check

0 likes · 8 min read

How Alibaba Cloud AI Gateway Ensures High Availability for LLM Services