Tagged articles
88 articles
Page 1 of 1
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Mar 27, 2026 · Cloud Native

How AutoMQ Transforms Kafka into a Cloud‑Native, Elastic Messaging Service

This article examines the limitations of traditional Kafka in large‑scale deployments and presents AutoMQ’s cloud‑native redesign—detailing its stateless architecture, storage separation, automatic scaling, read/write isolation, performance benchmarks, and real‑world migration case studies that demonstrate reduced latency, higher throughput, and lower resource costs.

AutoMQCloud NativeKafka
0 likes · 13 min read
How AutoMQ Transforms Kafka into a Cloud‑Native, Elastic Messaging Service
Architect's Guide
Architect's Guide
Feb 26, 2026 · Backend Development

8 Essential Software Architecture Patterns and When to Use Them

This article explains eight common software architecture patterns—from single‑database apps to microservices, caching, sharding, elastic scaling and multi‑datacenter deployment—detailing their designs, typical use cases, advantages, drawbacks, and practical implementation steps.

Design PatternsSoftware Architecturebackend scaling
0 likes · 23 min read
8 Essential Software Architecture Patterns and When to Use Them
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 26, 2026 · Cloud Native

How Kimi Scaled AI Agents with Alibaba Cloud’s Elastic Sandbox Architecture

Kimi built a high‑performance, low‑cost AI Agent infrastructure by combining Alibaba Cloud ACK node pools and the ACS Agent Sandbox, addressing challenges of instant sandbox response, state continuity, massive concurrency, cost efficiency, security isolation, and search‑memory integration for production‑grade agents.

AI AgentCloud NativeCost Optimization
0 likes · 18 min read
How Kimi Scaled AI Agents with Alibaba Cloud’s Elastic Sandbox Architecture
Alibaba Cloud Observability
Alibaba Cloud Observability
Dec 29, 2025 · Cloud Native

How to Seamlessly Import Massive S3 Logs into Alibaba Cloud SLS with Real‑Time Analysis

This article explains how to centralize and analyze massive multi‑cloud log data stored in object storage by moving AWS S3 logs into Alibaba Cloud Log Service (SLS) using dual‑mode file discovery, SQS event‑driven import, elastic scaling, and pre‑ingestion processing to achieve low latency, high reliability, and cost efficiency.

AWS S3Real-time Processingalibaba-sls
0 likes · 12 min read
How to Seamlessly Import Massive S3 Logs into Alibaba Cloud SLS with Real‑Time Analysis
IT Architects Alliance
IT Architects Alliance
Sep 7, 2025 · Cloud Native

Mastering Elastic Scaling on Kubernetes: Cut Costs While Handling Traffic Peaks

This article explains how to design elastic scaling architectures on cloud platforms—combining horizontal, vertical, and functional scaling, leveraging Kubernetes autoscaling features, predictive scaling, mixed instance strategies, and cost‑monitoring practices—to handle traffic spikes while minimizing expenses.

Cloud Cost OptimizationDevOpsautoscaling
0 likes · 9 min read
Mastering Elastic Scaling on Kubernetes: Cut Costs While Handling Traffic Peaks
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 5, 2025 · Cloud Native

Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud

This article describes how JoinQuant built a cloud‑native quantitative research platform on Alibaba Cloud, identified performance, cost, data‑management, and security challenges, and solved them with Fluid’s JindoRuntime data‑caching, elastic scaling, and Python‑driven workflows, achieving dramatic speed and cost improvements.

Cloud NativeData CachingFluid
0 likes · 18 min read
Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud
Architecture and Beyond
Architecture and Beyond
Feb 6, 2025 · Operations

Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC

This article examines DeepSeek’s frequent service interruptions, contrasts the inherent reliability challenges of AIGC products with traditional internet applications, and proposes adopting proven isolation, rate‑limiting, and elastic‑scaling techniques to improve AI service availability and user experience.

AIGCAvailabilityDeepSeek
0 likes · 12 min read
Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC
DataFunSummit
DataFunSummit
Feb 6, 2025 · Big Data

Migrating Big Data Workloads to Cloud‑Native Kubernetes: Challenges, Solutions, and Lessons from OPPO

This article describes how OPPO's big‑data team transitioned from traditional IDC and EMR environments to a cloud‑native Kubernetes architecture, detailing the motivations, design principles, elastic scaling challenges, custom solutions, and future directions for large‑scale data processing on the cloud.

Cloud NativeKuberneteselastic scaling
0 likes · 18 min read
Migrating Big Data Workloads to Cloud‑Native Kubernetes: Challenges, Solutions, and Lessons from OPPO
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 23, 2025 · Big Data

How Alibaba Cloud DataWorks Leverages Flink CDC for Scalable Data Lake Integration

Alibaba Cloud DataWorks’ Data Integration platform, built on Flink CDC, offers a comprehensive, serverless solution for real‑time and batch data lake ingestion, detailing its architecture, elastic scaling, productized use cases, and future roadmap, including AI‑driven diagnostics and expanded source support.

Big DataData IntegrationData Lake
0 likes · 12 min read
How Alibaba Cloud DataWorks Leverages Flink CDC for Scalable Data Lake Integration
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 17, 2025 · Artificial Intelligence

Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid

This article explains how to reduce inference cost and improve performance for large language models on Alibaba Cloud ACK by using Knative's request‑based autoscaling, custom ResourcePolicy priority scheduling, and Fluid data‑caching to achieve elastic scaling, resource pre‑emption, and faster model loading.

FluidInferenceKnative
0 likes · 22 min read
Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid
Alibaba Cloud Native
Alibaba Cloud Native
Jan 2, 2025 · Cloud Native

Unlocking Serverless Elastic Scaling: ElasticWorkload, WorkloadSpread, UnitedDeployment & ResourcePolicy Explained

This article explains how Alibaba Cloud ACK’s four configurable plugins—ElasticWorkload, WorkloadSpread, UnitedDeployment, and ResourcePolicy—provide flexible, on‑demand resource scaling for serverless workloads, compares their architectures, outlines usage scenarios, shows real‑world examples, and discusses their strengths and limitations.

ACKKubernetesOpenKruise
0 likes · 33 min read
Unlocking Serverless Elastic Scaling: ElasticWorkload, WorkloadSpread, UnitedDeployment & ResourcePolicy Explained
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 1, 2025 · Industry Insights

How Cloud‑Native Is Reshaping China’s Game Industry and What Elastic Strategies Developers Need

The article analyzes the rapid growth of China's game cloud market, explains why cloud‑native adoption has become industry‑wide, and details practical application‑layer and resource‑layer elasticity strategies—including OpenKruiseGame, state‑aware scaling, and Alibaba Cloud node‑scaling options—to improve performance and reduce costs.

Alibaba CloudCloud NativeGame Development
0 likes · 14 min read
How Cloud‑Native Is Reshaping China’s Game Industry and What Elastic Strategies Developers Need
Yum! Tech Team
Yum! Tech Team
Nov 28, 2024 · Cloud Native

Elastic Scaling Architecture for a Smart Delivery System During Peak Holiday Traffic

The article describes how an operations engineer transforms a complex, multi‑language smart delivery platform into an elastic, container‑native system that automatically scales, registers, and logs services during the high‑load Chinese New Year period using Kubernetes, Docker, init containers, and a configuration center.

Configuration ManagementDockerKubernetes
0 likes · 13 min read
Elastic Scaling Architecture for a Smart Delivery System During Peak Holiday Traffic
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 30, 2024 · Cloud Computing

Using Alibaba Cloud ACK One Registration Cluster for Elastic Hybrid Cloud Deployment

This guide explains how enterprises can overcome IDC data‑center capacity limits by leveraging Alibaba Cloud ACK One registration clusters to achieve flexible, cost‑effective elastic scaling, detailing architecture, registration steps, node‑pool creation, virtual nodes, multi‑level scheduling, and associated command‑line examples.

ACKCloud NativeKubernetes
0 likes · 10 min read
Using Alibaba Cloud ACK One Registration Cluster for Elastic Hybrid Cloud Deployment
JD Cloud Developers
JD Cloud Developers
Aug 23, 2024 · Cloud Native

Scaling Across Clusters: JD Cloud’s Large‑Scale Application Management Practices

At KubeCon + CloudNativeCon 2024 in Hong Kong, JD Cloud presented its cross‑cluster, large‑scale application management practice, detailing a federated Serverless model that oversees over 10,000 nodes, improves resource utilization, simplifies multi‑cluster scheduling, and offers efficient elastic scaling solutions.

Cloud NativeJD CloudKubernetes
0 likes · 3 min read
Scaling Across Clusters: JD Cloud’s Large‑Scale Application Management Practices
Baidu Geek Talk
Baidu Geek Talk
Aug 12, 2024 · Cloud Computing

How Baidu’s Cloud HPC Transforms Hybrid Computing for Enterprises

The article explains how Baidu Cloud's CHPC platform offers a full‑stack, hybrid‑cloud HPC solution with modern hardware, dynamic scheduling, elastic scaling, and performance tuning, enabling enterprises in life sciences and manufacturing to cut costs, accelerate innovation, and efficiently manage compute workloads.

Dynamic SchedulingHPCcase study
0 likes · 8 min read
How Baidu’s Cloud HPC Transforms Hybrid Computing for Enterprises
Huolala Tech
Huolala Tech
Aug 1, 2024 · Big Data

How Huolala’s Big Data Team Cut Costs and Boosted Efficiency with an Elastic Architecture

Huolala’s three‑year‑old big data team shares how they tackled cost, operations, and analysis inefficiencies by building a layered, elastic infrastructure, adopting ARM servers, automating workflows, embracing cloud‑native practices, and implementing multi‑engine routing, achieving 20‑30% cost savings and higher performance.

Cloud NativeCost Optimizationelastic scaling
0 likes · 12 min read
How Huolala’s Big Data Team Cut Costs and Boosted Efficiency with an Elastic Architecture
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
May 23, 2024 · Cloud Native

Cloud-Native Architecture and Tiered Storage for Xiaohongshu Kafka: Cost Reduction, Elastic Migration, and Performance Optimization

Xiaohongshu's big-data storage team built cloud-native architecture with tiered storage, containerized Kafka, and custom load balancer, cutting storage costs up to 60%, enabling minute‑level elastic migration, improving scaling efficiency tenfold, and boosting performance via caching and batch reads.

Cost OptimizationKafkaelastic scaling
0 likes · 20 min read
Cloud-Native Architecture and Tiered Storage for Xiaohongshu Kafka: Cost Reduction, Elastic Migration, and Performance Optimization
DataFunSummit
DataFunSummit
May 20, 2024 · Big Data

Real-Time High-Performance Analytics on Data Lakes with CloudLakehouse Multi-Cluster Architecture

This article explains how CloudLakehouse’s Multi‑Cluster elastic architecture enables high‑concurrency, low‑latency real‑time analytics on data lakes by addressing storage‑compute separation, dynamic caching, and automated scaling, providing a cost‑effective solution for customer‑facing data products.

Cloud NativeMulti-ClusterReal-time analytics
0 likes · 18 min read
Real-Time High-Performance Analytics on Data Lakes with CloudLakehouse Multi-Cluster Architecture
Tencent Cloud Developer
Tencent Cloud Developer
Nov 15, 2023 · Game Development

Case Study: KMS Game Company’s Cloud‑Native Architecture and Elastic Microservice Deployment on Tencent Cloud

Japanese game developer KMS migrated from Azure to Tencent Cloud, adopting a cloud‑native architecture with Tencent’s Elastic Microservice platform that provides timed and metric‑based scaling, CI/CD pipelines, and batch upgrades, resulting in roughly 50% cost savings, 15% performance gains and 50% latency reduction.

CI/CDGame DevelopmentMicroservices
0 likes · 9 min read
Case Study: KMS Game Company’s Cloud‑Native Architecture and Elastic Microservice Deployment on Tencent Cloud
HelloTech
HelloTech
Aug 1, 2023 · Cloud Native

Elastic Scaling Practices in Cloud‑Native Kubernetes Environments

To overcome native HPA limits and business‑specific constraints in a fully containerized, cloud‑native Kubernetes environment, we implemented a dual‑threshold water‑level and scheduled scaling engine, hybrid‑cloud ClusterAutoScale, mixed‑deployment resource prioritization, and comprehensive Prometheus‑based observability, achieving higher utilization, lower costs, and a roadmap toward deeper optimization and AIOps.

Auto ScalingCloud NativeKubernetes
0 likes · 10 min read
Elastic Scaling Practices in Cloud‑Native Kubernetes Environments
Tencent Cloud Developer
Tencent Cloud Developer
May 8, 2023 · Cloud Native

Modernizing Tencent Cloud Log Service (CLS): Cloud‑Native Architecture, Challenges, and Benefits

Tencent Cloud Log Service was modernized by migrating over 95 % of its components to a cloud‑native stack of containers, Kubernetes, and declarative APIs, addressing chaotic infrastructure, stateful‑to‑stateless conversion, configuration drift, upgrade risk, elastic scaling, traffic protection and observability, which cut costs by more than 20 million CNY, reduced scaling latency by 90 %, and achieved over 99.99 % availability with petabyte‑scale burst handling.

Configuration ManagementLog Servicearchitecture
0 likes · 15 min read
Modernizing Tencent Cloud Log Service (CLS): Cloud‑Native Architecture, Challenges, and Benefits
Bilibili Tech
Bilibili Tech
Mar 28, 2023 · Operations

Bilibili's Capacity Management Platform: Design, Implementation, and S12 Event Support

Bilibili's capacity management platform integrates foundational data, VPA/HPA scaling, quota control, and visual dashboards to streamline resource usage, cut costs, and boost stability, delivering event‑specific support such as for S12 that slashes release issues by 80% and online failures by 90%, while planning predictive scaling and risk control.

BilibiliResource OptimizationSRE
0 likes · 13 min read
Bilibili's Capacity Management Platform: Design, Implementation, and S12 Event Support
Architecture & Thinking
Architecture & Thinking
Mar 19, 2023 · Cloud Native

How Baidu Feed Achieved Serverless Scaling with Multi‑Dimensional Service Profiles

This article explains how Baidu's Feed recommendation backend adopted a serverless approach, building elastic, traffic, and capacity portraits for each micro‑service to enable predictive, load‑feedback, and timed scaling, thereby reducing resource waste and operational costs in a cloud‑native environment.

Cloud NativeServerlessService Profiling
0 likes · 17 min read
How Baidu Feed Achieved Serverless Scaling with Multi‑Dimensional Service Profiles
Baidu Tech Salon
Baidu Tech Salon
Mar 15, 2023 · Industry Insights

How Baidu Feed Scales Millions of Users with Serverless: A Multi‑Dimensional Elasticity Blueprint

This article details Baidu Feed's serverless transformation, describing how multi‑dimensional service profiling (elasticity, traffic, capacity) and three elastic strategies—predictive, load‑feedback, and timed—enable automatic scaling that reduces resource waste while maintaining 24/7 stability for billions of users.

Baidu FeedCloud NativeOperations
0 likes · 19 min read
How Baidu Feed Scales Millions of Users with Serverless: A Multi‑Dimensional Elasticity Blueprint
Baidu Geek Talk
Baidu Geek Talk
Mar 15, 2023 · Industry Insights

How Baidu Feed Scaled to Serverless with Multi‑Dimensional Service Profiles

This article explains how Baidu Feed’s backend services were transformed to a serverless model by building elastic, traffic, and capacity profiles for each service, enabling predictive, load‑feedback, and timed scaling strategies that automatically adjust resources with traffic fluctuations, reduce costs, and maintain stability.

Cloud NativeServerlessService Profiling
0 likes · 19 min read
How Baidu Feed Scaled to Serverless with Multi‑Dimensional Service Profiles
Huolala Tech
Huolala Tech
Sep 29, 2022 · Big Data

How Huolala Cuts Big Data Costs with Hybrid Cloud Strategies

This article details Huolala's comprehensive big‑data cost‑control system—covering data‑asset measurement, budgeting, auxiliary governance, storage tiering, and elastic compute management—to dramatically reduce both storage and compute expenses while maintaining service quality across diverse workloads.

Big Dataelastic scalingresource budgeting
0 likes · 21 min read
How Huolala Cuts Big Data Costs with Hybrid Cloud Strategies
Tencent Cloud Middleware
Tencent Cloud Middleware
Aug 9, 2022 · Cloud Native

How to Safeguard Microservices with Smart Rate‑Limiting Strategies

This article explains why service rate limiting is essential for protecting backend systems from traffic spikes, outlines global, tag‑based and dynamic throttling models, compares common algorithms, shows TSF’s architecture and configuration, and provides practical testing and scaling guidance for high‑traffic e‑commerce scenarios.

MicroservicesPerformance Testingelastic scaling
0 likes · 18 min read
How to Safeguard Microservices with Smart Rate‑Limiting Strategies
Cloud Native Technology Community
Cloud Native Technology Community
Jun 22, 2022 · Industry Insights

How to Slash Cloud‑Native Costs: Practical Steps for Better Resource Utilization

This article analyzes the low server utilization problem in modern cloud‑native environments, presents industry survey data, and outlines a four‑step framework—including observability, optimal public‑cloud usage, elasticity sharing, and remote deployment—to help enterprises dramatically reduce cloud costs while maintaining performance.

Cloud NativeCost OptimizationKubernetes
0 likes · 23 min read
How to Slash Cloud‑Native Costs: Practical Steps for Better Resource Utilization
IT Architects Alliance
IT Architects Alliance
Jun 14, 2022 · Backend Development

8 Essential Backend Architecture Patterns Every Engineer Should Master

This article explores eight common backend architecture patterns—from simple single‑database setups to microservices, caching, sharding, elastic scaling, and multi‑region deployments—detailing their design principles, typical use cases, advantages, drawbacks, and practical implementation steps.

Backend ArchitectureDesign PatternsMicroservices
0 likes · 23 min read
8 Essential Backend Architecture Patterns Every Engineer Should Master
Alibaba Cloud Developer
Alibaba Cloud Developer
May 5, 2022 · Cloud Computing

How Reserved Instances and Idle Billing Slash Serverless Costs by 30%

This article walks through a developer's journey from traditional and Kubernetes architectures to Alibaba Cloud Function Compute, explaining how reserved instances, pre‑warming, and the new idle‑billing feature together deliver high elasticity, zero‑maintenance operation, and up to 30% cost reduction for latency‑sensitive online services.

Idle BillingServerlesselastic scaling
0 likes · 9 min read
How Reserved Instances and Idle Billing Slash Serverless Costs by 30%
Top Architect
Top Architect
Apr 30, 2022 · Backend Development

Scaling Strategies, Hardware Expansion, and Distributed ID Generation in Backend Systems

The article explains why capacity expansion is needed, compares whole‑machine and component‑level scaling, introduces the AKF splitting principle, discusses challenges of distributed architectures, and reviews database clustering and distributed ID generation techniques such as UUID and Snowflake.

Backend Architecturedistributed-idelastic scaling
0 likes · 12 min read
Scaling Strategies, Hardware Expansion, and Distributed ID Generation in Backend Systems
ITPUB
ITPUB
Apr 27, 2022 · Artificial Intelligence

How 58’s WPAI Platform Boosted AI Resource Utilization by Over 50%

This article details the design and optimization of 58.com’s WPAI machine learning platform, covering background, training‑task scheduling, elastic inference scaling, offline‑online resource mixing, and model‑inference acceleration, and shows how these techniques collectively raised GPU usage by 51% and CPU usage by 38% while cutting costs.

AI PlatformGPU utilizationInference Acceleration
0 likes · 26 min read
How 58’s WPAI Platform Boosted AI Resource Utilization by Over 50%
Architect
Architect
Apr 25, 2022 · Cloud Native

Designing a Cloud‑Native Intelligent Data Architecture for Baidu Search Platform

This article presents a cloud‑native redesign of Baidu's search middle‑platform that introduces intelligent data management, elastic scaling, on‑demand resource allocation, precise fan‑out, and localized computation to address efficiency, cost, stability, and performance challenges of large‑scale search workloads.

Data ManagementSearch Architecturecloud-native
0 likes · 14 min read
Designing a Cloud‑Native Intelligent Data Architecture for Baidu Search Platform
IT Architects Alliance
IT Architects Alliance
Feb 4, 2022 · Backend Development

Designing a Scalable Architecture for Million‑Level DAU Systems

The article outlines a comprehensive backend architecture for handling million‑to‑tens‑of‑million daily active users, covering DNS routing, L4/L7 load balancing, monolithic versus microservice deployment, caching, database sharding, hybrid‑cloud strategies, elastic scaling, and multi‑level degradation mechanisms.

Microservicesdatabase shardingelastic scaling
0 likes · 11 min read
Designing a Scalable Architecture for Million‑Level DAU Systems
Top Architect
Top Architect
Feb 1, 2022 · Backend Development

Designing a Scalable Backend Architecture for Millions of Daily Active Users

The article outlines a comprehensive backend architecture for handling millions of daily active users, covering DNS routing, layer‑4/7 load balancing, monolithic versus microservice deployment, caching, database sharding, hybrid‑cloud strategies, elastic scaling, and multi‑level degradation mechanisms.

Backend ArchitectureScalabilityelastic scaling
0 likes · 12 min read
Designing a Scalable Backend Architecture for Millions of Daily Active Users
58 Tech
58 Tech
Jan 10, 2022 · Artificial Intelligence

Resource Utilization Optimization Practices for the 58.com Machine Learning Platform (WPAI)

This article details the 58.com WPAI machine learning platform's architecture and the optimizations applied to training task scheduling, inference service elastic scaling, and offline‑online resource mixing, demonstrating how these techniques significantly improve GPU/CPU utilization and inference performance across both GPU and CPU environments.

AIInference AccelerationKubernetes
0 likes · 27 min read
Resource Utilization Optimization Practices for the 58.com Machine Learning Platform (WPAI)
Tencent Architect
Tencent Architect
Dec 30, 2021 · Databases

Practices and Exploration of Disaster Recovery in Tencent Cloud‑Native Database TDSQL‑C (formerly CynosDB)

This article examines the architecture differences between cloud‑native TDSQL‑C and traditional MySQL, outlines TDSQL‑C’s elastic, serverless, low‑latency features, compares MySQL disaster‑recovery models, and details the multi‑dimensional disaster‑recovery system and its cross‑AZ/Region challenges and solutions.

TDSQL-Ccloud-native databasedisaster recovery
0 likes · 9 min read
Practices and Exploration of Disaster Recovery in Tencent Cloud‑Native Database TDSQL‑C (formerly CynosDB)
Baidu Geek Talk
Baidu Geek Talk
Dec 15, 2021 · Cloud Native

Cloud-Native Intelligent Data Management Architecture for Baidu Search Platform

Cloud-native redesign of Baidu's search middle platform introduces partition, shard, replica, and addressing controllers that enable elastic scaling, on-demand resource allocation, precise fan‑out, and localized computation, reducing capacity adjustment time from weeks to hours, cutting costs by 30‑80%, raising availability above 99.9% and halving query latency.

Data ManagementSearch Architecturecloud-native
0 likes · 17 min read
Cloud-Native Intelligent Data Management Architecture for Baidu Search Platform
IT Architects Alliance
IT Architects Alliance
Nov 9, 2021 · Operations

Why Scale and How: Hardware Expansion, AKF Splitting Principle, Distributed ID Generation, and Elastic Scaling

The article explains the reasons for scaling, outlines hardware and component expansion strategies, introduces the AKF splitting principle for distributed systems, discusses database clustering and distributed ID generation methods such as UUID and Snowflake, and describes elastic scaling challenges and solutions.

Distributed SystemsID generationcapacity planning
0 likes · 14 min read
Why Scale and How: Hardware Expansion, AKF Splitting Principle, Distributed ID Generation, and Elastic Scaling
Architecture Digest
Architecture Digest
Nov 9, 2021 · Operations

Scaling Strategies: Hardware Expansion, AKF Partitioning, and Distributed ID Generation

This article explains why scaling is necessary, outlines hardware and component expansion strategies, introduces the AKF partitioning principle for horizontal and vertical scaling, discusses challenges after splitting, and reviews database clustering and distributed ID generation techniques such as UUID and Snowflake, highlighting their advantages and drawbacks.

Distributed SystemsID generationdatabase clustering
0 likes · 15 min read
Scaling Strategies: Hardware Expansion, AKF Partitioning, and Distributed ID Generation
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Nov 8, 2021 · Operations

How to Scale Your System: From Hardware Expansion to Distributed ID Strategies

This article explains why capacity expansion is necessary, outlines hardware and component scaling strategies, introduces the AKF splitting principle for Redis clusters, discusses challenges of distributed scaling such as data consistency and high concurrency, and reviews database clustering and distributed ID generation methods like UUID and Snowflake.

AKF principlecapacity planningdatabase clustering
0 likes · 14 min read
How to Scale Your System: From Hardware Expansion to Distributed ID Strategies
Java High-Performance Architecture
Java High-Performance Architecture
Nov 1, 2021 · Operations

Why Scaling Matters: Hardware Expansion, Distributed ID & Elastic Capacity Strategies

The article explains why performance optimization has limits and outlines practical scaling methods—including whole‑machine and component upgrades, AKF splitting, database clustering, distributed ID generation (UUID and Snowflake), and elastic scaling—while also discussing the challenges each approach introduces.

ID generationcapacity planningdatabase clustering
0 likes · 14 min read
Why Scaling Matters: Hardware Expansion, Distributed ID & Elastic Capacity Strategies
21CTO
21CTO
Oct 30, 2021 · Operations

Scaling Systems: Hardware Expansion, Distributed IDs, and Elastic Capacity

This article explains why capacity expansion is necessary, outlines hardware and component scaling strategies, introduces AKF splitting principles, discusses database clustering and distributed ID generation methods such as UUID and Snowflake, and highlights the benefits and challenges of elastic scaling.

capacity planningdistributed-idelastic scaling
0 likes · 13 min read
Scaling Systems: Hardware Expansion, Distributed IDs, and Elastic Capacity
DataFunTalk
DataFunTalk
Oct 17, 2021 · Databases

Databend: A Cloud‑Native Modern Data Warehouse Architecture

This article explains how Databend, a cloud‑native OLAP data warehouse, addresses modern data‑warehouse challenges by separating storage and compute, providing elastic scaling, multi‑cloud support, and efficient query planning and execution to deliver low‑cost, on‑demand analytics.

DatabendOLAParchitecture
0 likes · 12 min read
Databend: A Cloud‑Native Modern Data Warehouse Architecture
DataFunSummit
DataFunSummit
Oct 17, 2021 · Databases

Databend: Cloud‑Native Modern Data Warehouse Architecture and Features

This article explains how Databend, a cloud‑native data warehouse, addresses modern OLAP requirements through storage‑compute separation, elastic scaling, multi‑cloud support, advanced query planning, and serverless‑ready design, contrasting it with traditional data warehouse limitations.

DatabendQuery Planningcloud-native
0 likes · 11 min read
Databend: Cloud‑Native Modern Data Warehouse Architecture and Features
Tencent Database Technology
Tencent Database Technology
Sep 6, 2021 · Cloud Native

Cloud‑Native ClickHouse Architecture and Design Overview

This article presents a comprehensive design of a cloud‑native ClickHouse OLAP system, detailing its three‑layer architecture, storage‑compute separation, unified metadata management, high‑availability mechanisms, elastic scaling, cost reductions, and future enhancements for multi‑replica and MPP query support.

Cloud NativeDistributed SystemsOLAP
0 likes · 19 min read
Cloud‑Native ClickHouse Architecture and Design Overview
dbaplus Community
dbaplus Community
Jun 17, 2021 · Cloud Native

How Dada Achieved Seamless Elastic Scaling for Massive Delivery Peaks

Facing surges during holidays and major shopping events, Dada’s DevOps team built a cloud‑native elastic scaling system that combines fine‑grained capacity management, multi‑cloud support, metric‑driven auto‑scaling, and extreme‑scale down strategies, delivering stable delivery performance while cutting costs.

Auto ScalingOperationscapacity management
0 likes · 17 min read
How Dada Achieved Seamless Elastic Scaling for Massive Delivery Peaks
High Availability Architecture
High Availability Architecture
May 3, 2021 · Operations

Meituan Elastic Scaling System: Evolution, Challenges, and Business Enablement

This article introduces Meituan's elastic scaling platform, detailing its evolution from version 1.0 to 2.0, the technical and operational challenges faced, the strategies adopted for promotion and resource management, and several real‑world business scenarios where elastic scaling reduces cost and improves reliability.

MeituanOperationsResource Management
0 likes · 24 min read
Meituan Elastic Scaling System: Evolution, Challenges, and Business Enablement
Meituan Technology Team
Meituan Technology Team
Apr 22, 2021 · Cloud Native

Meituan Serverless Platform: Architecture, Practices, and Optimization

Meituan’s Nest Serverless platform, built on native Kubernetes with Knative‑inspired components, delivers elastic scaling, rapid cold‑start reduction, multi‑region high availability, and integrated developer tools, enabling higher resource utilization, lower costs, and up to 40 % faster development across diverse business scenarios.

Cloud NativeFunction as a ServiceKubernetes
0 likes · 30 min read
Meituan Serverless Platform: Architecture, Practices, and Optimization
Dada Group Technology
Dada Group Technology
Apr 19, 2021 · Operations

Exploring Elastic Capacity and Automated Scaling Architecture at Dada Group

This article presents Dada Group's comprehensive approach to elastic capacity management and automated scaling, detailing the challenges faced during traffic spikes, the design of a cloud‑native auto‑scaler, multi‑metric observability, decision‑making logic, execution mechanisms, extreme scaling practices, and future optimization directions.

Auto ScalingCloud NativeSRE
0 likes · 15 min read
Exploring Elastic Capacity and Automated Scaling Architecture at Dada Group
High Availability Architecture
High Availability Architecture
Apr 15, 2021 · Cloud Native

Meituan Elastic Scaling System: Architecture, Challenges, and Business Enablement

This article presents Meituan's elastic scaling platform, detailing its evolution from Hulk 1.0 to Hulk 2.0, the technical and operational challenges faced, the solutions implemented for resource management and multi‑tenant scaling, and real‑world business scenarios such as holiday, peak‑hour, and emergency capacity provisioning.

MeituanOperationsResource Management
0 likes · 22 min read
Meituan Elastic Scaling System: Architecture, Challenges, and Business Enablement
Meituan Technology Team
Meituan Technology Team
Apr 1, 2021 · Cloud Native

Meituan Elastic Scaling System: Architecture, Challenges, and Business Enablement

Meituan's elastic scaling system evolved from Hulk 1.0 on OpenStack to Hulk 2.0 on Kubernetes, adding micro‑services, quota management, hybrid‑cloud pools, and automated scheduling, thereby delivering cost savings, high‑availability handling of holiday peaks, delivery spikes, anti‑scraping needs, and SaaS releases, while future plans target stability, usability, and emerging technologies.

Cloud NativeKubernetesMeituan
0 likes · 21 min read
Meituan Elastic Scaling System: Architecture, Challenges, and Business Enablement
Alibaba Cloud Native
Alibaba Cloud Native
Mar 27, 2021 · Cloud Native

Why Knative? Simplifying Serverless on Kubernetes with ASK Integration

This article explains why Knative is needed to simplify Kubernetes‑based serverless workloads, describes its core modules and traffic‑based gray release capabilities, and shows how Alibaba Cloud's ASK platform integrates with Knative to reduce operational complexity, improve elasticity, and lower costs.

ASKCloud NativeKnative
0 likes · 10 min read
Why Knative? Simplifying Serverless on Kubernetes with ASK Integration
Volcano Engine Developer Services
Volcano Engine Developer Services
Mar 23, 2021 · Cloud Native

How Douyin Handled 70B Red Packet Interactions in 27 Days with Cloud‑Native Magic

In just 27 days, Douyin and Volcano Engine's cloud‑native team built a Kubernetes‑based, elastically scalable infrastructure that supported 703 billion red‑packet interactions and over a trillion live‑stream views during the 2021 Spring Festival Gala, ensuring zero downtime and seamless user experience.

Cloud NativeEdge Computingdistributed storage
0 likes · 12 min read
How Douyin Handled 70B Red Packet Interactions in 27 Days with Cloud‑Native Magic
Xianyu Technology
Xianyu Technology
Dec 17, 2020 · Cloud Native

Elastic Scaling in Serverless Cloud‑Native Applications

Elastic scaling, a cornerstone of Xianyu’s shift to serverless cloud-native architecture, leverages Kubernetes autoscaling components—Cluster‑Autoscaler, HPA, VPA—to dynamically adjust resources via reactive thresholds or predictive models, yet faces challenges like cold‑starts, lack of scale‑to‑zero, and optimal pod‑pool buffering, prompting ongoing research for faster, smarter, safer scaling.

Auto ScalingCloud-nativeKubernetes
0 likes · 19 min read
Elastic Scaling in Serverless Cloud‑Native Applications
dbaplus Community
dbaplus Community
Aug 5, 2020 · Databases

How JIMKV Unifies Cache and Storage to Power High‑Performance Distributed Databases

The article details JD Retail's JIMKV distributed database, explaining its unified cache‑storage architecture, fault‑detection and elastic‑scaling mechanisms, hot‑cold data tiering, read/write amplification mitigation, real‑world product‑detail use case, and future plans for intelligent operations and OLAP support.

KV Storecache storage integrationdistributed database
0 likes · 18 min read
How JIMKV Unifies Cache and Storage to Power High‑Performance Distributed Databases
Youku Technology
Youku Technology
Jul 16, 2020 · Operations

How Alibaba Entertainment Automates Capacity Management and Elastic Scaling

Alibaba Entertainment transformed its capacity management from manual, experience‑based decisions to a fully automated system that continuously evaluates single‑machine performance, identifies performance and success‑rate breakpoints, and drives elastic scaling, dramatically improving resource utilization, availability, and development efficiency across all its applications.

OperationsPerformance Testingautomation
0 likes · 10 min read
How Alibaba Entertainment Automates Capacity Management and Elastic Scaling
DataFunTalk
DataFunTalk
Jun 20, 2020 · Cloud Native

Automated Elastic Scaling for Million‑Scale Core Services and Mixed Workloads on ByteDance's Private Cloud Platform

This article presents ByteDance's private cloud platform TCE architecture and explains how automated elastic scaling, dynamic over‑commit, and mixed‑workload deployment are used to improve resource utilization for millions of services, balancing online peak demand with offline batch tasks.

Cloud NativeKuberneteselastic scaling
0 likes · 25 min read
Automated Elastic Scaling for Million‑Scale Core Services and Mixed Workloads on ByteDance's Private Cloud Platform
Tencent Cloud Developer
Tencent Cloud Developer
May 21, 2020 · Game Development

How Tencent’s Game Server Engine Tackles Low Latency and Cost in Multiplayer Games

This article analyzes the challenges of low‑latency, stable, and cost‑effective online multiplayer games and explains how Tencent's Game Server Engine (GSE) provides elastic scaling, near‑by scheduling, stateful shrinkage, multi‑region disaster recovery, and zero‑downtime updates to meet those demands.

Low latencyTencent GSEcloud gaming
0 likes · 11 min read
How Tencent’s Game Server Engine Tackles Low Latency and Cost in Multiplayer Games
Alibaba Cloud Native
Alibaba Cloud Native
Mar 20, 2020 · Cloud Native

Build Elastic, Low‑Cost Serverless Video Processing on Alibaba Cloud

This article explains how to design and implement an elastic, high‑availability video‑on‑demand solution on Alibaba Cloud using Function Compute and Function Flow, covering simple and full‑featured architectures, performance and cost comparisons, and practical deployment steps.

Cost OptimizationFunction ComputeFunction Flow
0 likes · 16 min read
Build Elastic, Low‑Cost Serverless Video Processing on Alibaba Cloud
Meituan Technology Team
Meituan Technology Team
Sep 12, 2019 · Cloud Native

Meituan HULK: Cloud‑Native Container Cluster Management and Scheduling Practices

Meituan’s HULK platform evolved from an OpenStack‑based scheduler to a Kubernetes‑native container cluster manager, integrating service governance, release, CMDB, and monitoring to automate VM‑to‑container migration, improve resource utilization, and deliver elastic, policy‑driven scheduling and scaling with reduced troubleshooting time and higher SLA compliance.

Cloud NativeCluster SchedulingKubernetes
0 likes · 13 min read
Meituan HULK: Cloud‑Native Container Cluster Management and Scheduling Practices
Tencent Cloud Developer
Tencent Cloud Developer
Aug 20, 2019 · Cloud Native

Why Serverless Is the Next Revolution in Cloud Computing

The article explains what Serverless (or Function as a Service) is, outlines its technical characteristics, business benefits, and ideal application scenarios, and argues that Serverless represents a fundamental shift toward cloud‑native architectures in modern software development.

Cloud NativeEvent-drivenUse Cases
0 likes · 10 min read
Why Serverless Is the Next Revolution in Cloud Computing
Architecture Digest
Architecture Digest
May 29, 2019 · Backend Development

Design and Solutions for High Availability and High Concurrency in Weibo Short Video Service

The article presents a detailed analysis of Weibo's short‑video platform architecture, covering team background, business scenarios, micro‑service design, feed‑pull model, multi‑level distributed caching, multi‑datacenter HA deployment, circuit‑breaker mechanisms, and elastic scaling to achieve high availability under unpredictable traffic spikes.

Backend ArchitectureWeibodistributed cache
0 likes · 12 min read
Design and Solutions for High Availability and High Concurrency in Weibo Short Video Service
Architects' Tech Alliance
Architects' Tech Alliance
Jul 28, 2018 · Databases

Technical Requirements and Architectural Directions for Cloud Databases

The article explains the key technical requirements of cloud databases, such as elastic scaling, compute‑storage separation, multi‑model support and self‑management, and discusses emerging architectural trends like storage‑SQL separation, multi‑model engines, and disaster‑recovery/multi‑active designs for various enterprise scenarios.

cloud databasedbPaaSelastic scaling
0 likes · 16 min read
Technical Requirements and Architectural Directions for Cloud Databases
Architecture Digest
Architecture Digest
Feb 2, 2018 · Cloud Computing

Design and Implementation of an Elastic Scaling Service on Alibaba ECS

This article explains why elastic scaling is needed for variable web traffic, describes how to build a cost‑effective, automatically adjustable service on Alibaba ECS using message queues, service refactoring, Docker deployment, logging, and a real‑time allocation algorithm, and shares practical lessons learned.

Alibaba ECSAllocation AlgorithmDocker
0 likes · 9 min read
Design and Implementation of an Elastic Scaling Service on Alibaba ECS
Efficient Ops
Efficient Ops
Jan 3, 2018 · Operations

How QQ Space Photo Album Handled a 4‑Fold Traffic Surge on New Year’s Day

On December 30, 2017, a sudden wave of users uploading and downloading their 18‑year‑old photos caused QQ Space's album service to experience a four‑times spike in download traffic and a twelve‑times surge in post activity, prompting the operations and development teams to employ capacity monitoring, elastic scaling, flexible architecture, and targeted optimizations to maintain service stability and user experience.

OperationsQQ Spacecapacity planning
0 likes · 10 min read
How QQ Space Photo Album Handled a 4‑Fold Traffic Surge on New Year’s Day
Tencent Cloud Developer
Tencent Cloud Developer
Nov 3, 2017 · Cloud Computing

Handling Massive Traffic Spikes for a Forum Using Elastic Scaling, CDN, and Staticization on Tencent Cloud

To cope with a sudden, X5‑driven traffic surge that overwhelmed a migrated forum’s servers, the team enabled elastic scaling, redirected missing avatars with soft‑404 responses, off‑loaded avatar and post requests to a CDN, and created a pseudo‑static domain, allowing a single modest server to sustain massive loads.

BackendCDNelastic scaling
0 likes · 8 min read
Handling Massive Traffic Spikes for a Forum Using Elastic Scaling, CDN, and Staticization on Tencent Cloud
Qunar Tech Salon
Qunar Tech Salon
Oct 13, 2017 · Operations

WeChat Operational Practices: Elastic Scaling, Cloud Management, Capacity Management, and Automated Scheduling

This article describes WeChat's operational standards, cloud‑native management, capacity planning, and automated scheduling techniques, covering configuration file conventions, name‑service design, cloud migration decisions, hardware‑metric based capacity evaluation, stress‑testing methods, and dynamic resource allocation to ensure efficient, reliable service scaling.

capacity managementcloud automationelastic scaling
0 likes · 25 min read
WeChat Operational Practices: Elastic Scaling, Cloud Management, Capacity Management, and Automated Scheduling
Efficient Ops
Efficient Ops
Apr 11, 2017 · Cloud Computing

How JD Built a Scalable Elastic Cloud Platform: Architecture, Challenges, and Lessons

This article details JD.com's Elastic Cloud 1.0 platform—its massive container deployment, four‑principle philosophy, architectural design, operational challenges, performance optimizations, and the roadmap toward Elastic Cloud 2.0—offering practical insights for large‑scale cloud engineering.

JD.comcloud computingcontainer orchestration
0 likes · 17 min read
How JD Built a Scalable Elastic Cloud Platform: Architecture, Challenges, and Lessons
Ctrip Technology
Ctrip Technology
Mar 17, 2017 · Cloud Computing

Ctrip Container Cloud: Architecture, Elastic Scaling, and Monitoring Practices

This article details Ctrip's journey in building a private container cloud to support rapid business growth, covering elasticity challenges, container deployment principles, orchestration platform choices, network design, operational issues, custom executors, monitoring solutions, and the overarching CDOS system.

DockerMesoscdos
0 likes · 16 min read
Ctrip Container Cloud: Architecture, Elastic Scaling, and Monitoring Practices
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 16, 2017 · Databases

AliCloudDB’s Secrets for Scaling During Double‑11 Traffic

This article explains how AliCloudDB supports the massive traffic of Alibaba’s Double‑11 shopping festival through elastic scaling (both in‑place and cross‑machine upgrades), secure and standard access paths, robust architecture design, read‑write separation, engine and index optimization, high‑availability configurations, performance tuning, and disaster‑recovery strategies.

AliCloudDBelastic scalinghigh availability
0 likes · 12 min read
AliCloudDB’s Secrets for Scaling During Double‑11 Traffic
dbaplus Community
dbaplus Community
Jan 15, 2017 · Databases

How JD’s JIMDB Achieves Zero‑Downtime Scaling and Automatic Failover for Massive Caches

JIMDB is JD’s in‑house distributed cache platform that combines automatic fault detection, seamless online scaling, multi‑language support, and containerized deployment to replace traditional Memcached/Redis solutions, offering features such as one‑click cluster creation, elastic expansion, lossless scaling, and comprehensive monitoring for high‑traffic e‑commerce services.

CacheDistributed Systemselastic scaling
0 likes · 23 min read
How JD’s JIMDB Achieves Zero‑Downtime Scaling and Automatic Failover for Massive Caches
Qunar Tech Salon
Qunar Tech Salon
Dec 25, 2015 · Backend Development

Design and Implementation of Elastic-Job: A Distributed Job Scheduling Framework

Elastic-Job is a Java-based, decentralized distributed job scheduling framework that addresses limitations of existing solutions by providing features such as distributed coordination via Zookeeper, parallel task execution, elastic scaling, centralized management, customizable workflow tasks, and robust non‑functional requirements, with future plans for multi‑language support and enhanced monitoring.

Distributed SchedulingZooKeeperelastic scaling
0 likes · 14 min read
Design and Implementation of Elastic-Job: A Distributed Job Scheduling Framework

Designing a High‑Availability, Auto‑Scaling KV Storage System Based on Memcached and Redis

This article examines common NoSQL key‑value stores such as Memcached and Redis, compares their strengths and limitations, and proposes a distributed architecture with routing, storage, management, and migration nodes that achieves high availability, automatic fault‑tolerance, load balancing, and elastic scaling.

KV StoreMemcachedelastic scaling
0 likes · 15 min read
Designing a High‑Availability, Auto‑Scaling KV Storage System Based on Memcached and Redis
Baidu Tech Salon
Baidu Tech Salon
Mar 10, 2014 · Cloud Computing

Building a Scalable App Engine Platform: Architecture, Open‑Source Tools & Best Practices

This article provides a comprehensive overview of App Engine as a PaaS solution, analyzes leading platforms such as CloudFoundry and SAE, outlines architectural requirements, and presents practical implementation guidance using open‑source tools like Nginx, Scribe, and Storm for elastic, scalable cloud services.

App EnginePaaSarchitecture
0 likes · 20 min read
Building a Scalable App Engine Platform: Architecture, Open‑Source Tools & Best Practices