Tagged articles
11 articles
Page 1 of 1
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Oct 20, 2025 · Artificial Intelligence

How ACK Inference Gateway Tripled Large‑Model Performance for an Insurance Giant

This article details how Guotai Insurance tackled the high latency and cost of large‑model inference by deploying Alibaba Cloud's ACK Inference Gateway, which uses load‑aware, prefix‑aware routing, intelligent queuing, and comprehensive observability to boost efficiency threefold while reducing expenses.

ACK GatewayAI inferenceCloud Native
0 likes · 18 min read
How ACK Inference Gateway Tripled Large‑Model Performance for an Insurance Giant
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 11, 2025 · Cloud Native

Implementing Per‑User Rate Limiting with Alibaba Cloud Service Mesh (ASM) Traffic Scheduling Suite

This article explains how to use Alibaba Cloud Service Mesh (ASM) traffic‑scheduling suite to implement rich traffic‑control scenarios such as per‑user rate limiting, request queuing and priority scheduling in a Kubernetes environment, providing step‑by‑step deployment, configuration and verification instructions.

ASMKubernetesMicroservices
0 likes · 14 min read
Implementing Per‑User Rate Limiting with Alibaba Cloud Service Mesh (ASM) Traffic Scheduling Suite
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 8, 2024 · Cloud Native

How HTTPDNS Edge Migration Boosted Performance and Cut Costs by 35%

This article details the end‑to‑end migration of ByteDance's HTTPDNS service from a central cloud to edge nodes, covering technical challenges in service placement and traffic scheduling, the edge‑native solutions implemented with visualization models and GTM, and the resulting performance, cost and reliability gains.

HTTPDNSPerformance OptimizationTraffic Scheduling
0 likes · 13 min read
How HTTPDNS Edge Migration Boosted Performance and Cut Costs by 35%
Architect
Architect
Oct 19, 2023 · Industry Insights

How Vivo Built a Highly Available Push System: Multi‑Region Architecture, Real‑Time Traffic Scheduling, and Disaster‑Recovery Strategies

This article analyzes the design of Vivo's push notification platform, detailing its high‑concurrency requirements, three‑region long‑connection deployment, traffic‑scheduling bypass layer, and layered storage disaster‑recovery solutions, while explaining the trade‑offs and performance metrics behind each architectural decision.

Cloud NativeKafkaSystem Architecture
0 likes · 14 min read
How Vivo Built a Highly Available Push System: Multi‑Region Architecture, Real‑Time Traffic Scheduling, and Disaster‑Recovery Strategies
DeWu Technology
DeWu Technology
Mar 15, 2023 · Operations

Blue-Green Deployment: Process, Traffic Scheduling, and Component Support

The article explains blue‑green deployment as a release strategy that improves large‑scale microservice rollouts by extracting traffic from a blue cluster, incrementally shifting it to a green environment, using global and local traffic scheduling, central metadata, compatible components, and careful considerations such as idempotent consumption and version compatibility.

Blue‑Green deploymentContinuous DeliveryOperations
0 likes · 12 min read
Blue-Green Deployment: Process, Traffic Scheduling, and Component Support
iQIYI Technical Product Team
iQIYI Technical Product Team
Sep 17, 2021 · Cloud Computing

iQIYI Full‑Network Automatic Traffic Scheduling System: Architecture, Implementation, and Performance Evaluation

iQIYI’s SDN‑based full‑network automatic traffic‑scheduling system dynamically balances inter‑ and intra‑province traffic using BGP and policy routing, integrates monitoring, flow collection, DFS backup‑path calculation, and real‑time Kafka/Flink processing, cutting fault‑handling time to minutes and boosting link availability to 99.9999 % while preparing for programmable‑switch and SR‑based extensions.

BGPSDNTraffic Scheduling
0 likes · 11 min read
iQIYI Full‑Network Automatic Traffic Scheduling System: Architecture, Implementation, and Performance Evaluation
Efficient Ops
Efficient Ops
Oct 29, 2019 · Operations

How Xiami’s SRE Team Revamped Monitoring to Cut Alert Noise by 90%

Xiami’s SRE team overhauled its monitoring system by categorizing alerts, introducing fault, generic, and basic monitoring, optimizing alert paths with stream processing, and leveraging Alibaba’s traffic scheduling platform, dramatically reducing daily noise from thousands of alerts to a manageable few hundred critical notifications.

AlibabaSRETraffic Scheduling
0 likes · 9 min read
How Xiami’s SRE Team Revamped Monitoring to Cut Alert Noise by 90%
Architects' Tech Alliance
Architects' Tech Alliance
Mar 25, 2019 · Cloud Computing

Tencent Cloud’s Intelligent Traffic Scheduling and High‑Redundancy Architecture Mitigate Shanghai Fiber‑Cut Outage

On March 23, a construction accident severed a fiber optic cable in Shanghai, causing widespread internet disruptions, but Tencent Cloud’s intelligent traffic scheduling system and four‑fiber‑three‑router high‑redundancy architecture automatically rerouted traffic, restoring services within two minutes and demonstrating robust cloud network resilience.

BGPNetwork ResilienceTraffic Scheduling
0 likes · 6 min read
Tencent Cloud’s Intelligent Traffic Scheduling and High‑Redundancy Architecture Mitigate Shanghai Fiber‑Cut Outage
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Dec 8, 2017 · Operations

Alibaba Edge Network Traffic Optimization for Double 11: NetO System and BGP/SDN Strategies

The article explains how Alibaba’s Edge Network, using the NetO intelligent traffic scheduling system, combines real‑time internet quality monitoring, SDN‑enhanced BGP routing, and extensive peering to optimize both inbound and outbound paths for millions of global Double 11 shoppers, ensuring a seamless user experience.

AlibabaBGPEdge Network
0 likes · 5 min read
Alibaba Edge Network Traffic Optimization for Double 11: NetO System and BGP/SDN Strategies
ITPUB
ITPUB
Nov 25, 2015 · Operations

Why Meizu Adopted Multi‑Data‑Center Deployment and How It Works

Meizu moved from a single‑datacenter to a multi‑datacenter architecture to improve reliability, reduce latency, and meet user proximity demands, detailing technical challenges, traffic scheduling, read‑heavy and read‑write balanced services, and GSLB‑based routing solutions.

GSLBReliabilityTraffic Scheduling
0 likes · 10 min read
Why Meizu Adopted Multi‑Data‑Center Deployment and How It Works
Efficient Ops
Efficient Ops
Oct 22, 2015 · Operations

Unlock Hidden Savings: Optimizing Multi‑Data Center Bandwidth Costs

This article examines the characteristics and billing models of multi‑data‑center networks, analyzes external traffic patterns, identifies challenges in optimizing Internet‑facing bandwidth, and proposes practical scheduling strategies to better utilize idle bandwidth and reduce carrier costs.

Multi-Data CenterOperationsTraffic Scheduling
0 likes · 13 min read
Unlock Hidden Savings: Optimizing Multi‑Data Center Bandwidth Costs