Alibaba Cloud Infrastructure
Author

Alibaba Cloud Infrastructure

For uninterrupted computing services

353
Articles
0
Likes
936
Views
0
Comments
Recent Articles

Latest from Alibaba Cloud Infrastructure

100 recent articles max
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 26, 2026 · Cloud Native

How Kimi Scaled AI Agents with Alibaba Cloud’s Elastic Sandbox Architecture

Kimi built a high‑performance, low‑cost AI Agent infrastructure by combining Alibaba Cloud ACK node pools and the ACS Agent Sandbox, addressing challenges of instant sandbox response, state continuity, massive concurrency, cost efficiency, security isolation, and search‑memory integration for production‑grade agents.

AI AgentKubernetescloud-native
0 likes · 18 min read
How Kimi Scaled AI Agents with Alibaba Cloud’s Elastic Sandbox Architecture
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 21, 2026 · Artificial Intelligence

Boost LLM Performance: Deploy Qwen3‑235B with PD‑Separation, MoE, SGLang & RBG

This article details how to deploy the 235‑billion‑parameter Qwen3‑235B model using PD‑separation and MoE techniques, explains the associated challenges, and demonstrates a production‑grade solution built on the high‑performance SGLang inference engine and the RoleBasedGroup (RBG) orchestration framework, complete with benchmark results and best‑practice YAML examples.

AIInferenceKubernetes
0 likes · 21 min read
Boost LLM Performance: Deploy Qwen3‑235B with PD‑Separation, MoE, SGLang & RBG
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 15, 2026 · Cloud Native

Deploy Alibaba Cloud Service Mesh (ASM): Gateways, Traffic Management & Zero‑Trust

This guide explains how to set up Alibaba Cloud Service Mesh (ASM) on an ACK Kubernetes cluster, covering prerequisites, two methods of cluster registration, creation of north‑south and east‑west gateways, traffic routing with HTTPRoute, security policies using PeerAuthentication and AuthorizationPolicy, and observability configuration via Telemetry.

ASMAlibaba CloudGateway API
0 likes · 9 min read
Deploy Alibaba Cloud Service Mesh (ASM): Gateways, Traffic Management & Zero‑Trust
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Dec 27, 2025 · Cloud Native

How to Safely Deploy AI Inference Models Across Multi‑Cluster Environments with ACK One Fleet

This article explains why AI inference services require multi‑cluster gray‑release, outlines the risks of traditional updates, and details how ACK One Fleet combined with Kruise Rollout provides a controlled, observable, and rollback‑capable solution for deploying large AI models across hybrid cloud clusters.

ACK OneAIKruise Rollout
0 likes · 10 min read
How to Safely Deploy AI Inference Models Across Multi‑Cluster Environments with ACK One Fleet
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Dec 22, 2025 · Artificial Intelligence

Boost LLM Inference with KV‑Cache‑Aware Routing on Alibaba Cloud ACK GIE

This article explains why KV‑Cache hit rate is critical for large‑model inference, describes vLLM's automatic prefix caching, outlines the distributed cache challenges, and provides a step‑by‑step guide to deploying Alibaba Cloud ACK Gateway with Inference Extension's precise‑mode prefix‑cache‑aware routing, backed by benchmark results.

Alibaba CloudInferenceKV cache
0 likes · 18 min read
Boost LLM Inference with KV‑Cache‑Aware Routing on Alibaba Cloud ACK GIE