Tag

elastic scaling

0 views collected around this technical thread.

Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 5, 2025 · Cloud Native

Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud

This article describes how JoinQuant built a cloud‑native quantitative research platform on Alibaba Cloud, identified performance, cost, data‑management, and security challenges, and solved them with Fluid’s JindoRuntime data‑caching, elastic scaling, and Python‑driven workflows, achieving dramatic speed and cost improvements.

Data CachingFluidKubernetes
0 likes · 18 min read
Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud
Architecture and Beyond
Architecture and Beyond
Feb 6, 2025 · Operations

Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC

This article examines DeepSeek’s frequent service interruptions, contrasts the inherent reliability challenges of AIGC products with traditional internet applications, and proposes adopting proven isolation, rate‑limiting, and elastic‑scaling techniques to improve AI service availability and user experience.

AIGCDeepSeekRate Limiting
0 likes · 12 min read
Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC
DataFunSummit
DataFunSummit
Feb 6, 2025 · Big Data

Migrating Big Data Workloads to Cloud‑Native Kubernetes: Challenges, Solutions, and Lessons from OPPO

This article describes how OPPO's big‑data team transitioned from traditional IDC and EMR environments to a cloud‑native Kubernetes architecture, detailing the motivations, design principles, elastic scaling challenges, custom solutions, and future directions for large‑scale data processing on the cloud.

Big DataKubernetesMulti-Cloud
0 likes · 18 min read
Migrating Big Data Workloads to Cloud‑Native Kubernetes: Challenges, Solutions, and Lessons from OPPO
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 17, 2025 · Artificial Intelligence

Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid

This article explains how to reduce inference cost and improve performance for large language models on Alibaba Cloud ACK by using Knative's request‑based autoscaling, custom ResourcePolicy priority scheduling, and Fluid data‑caching to achieve elastic scaling, resource pre‑emption, and faster model loading.

FluidInferenceKnative
0 likes · 22 min read
Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid
Yum! Tech Team
Yum! Tech Team
Nov 28, 2024 · Cloud Native

Elastic Scaling Architecture for a Smart Delivery System During Peak Holiday Traffic

The article describes how an operations engineer transforms a complex, multi‑language smart delivery platform into an elastic, container‑native system that automatically scales, registers, and logs services during the high‑load Chinese New Year period using Kubernetes, Docker, init containers, and a configuration center.

Configuration ManagementDockerKubernetes
0 likes · 13 min read
Elastic Scaling Architecture for a Smart Delivery System During Peak Holiday Traffic
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 30, 2024 · Cloud Computing

Using Alibaba Cloud ACK One Registration Cluster for Elastic Hybrid Cloud Deployment

This guide explains how enterprises can overcome IDC data‑center capacity limits by leveraging Alibaba Cloud ACK One registration clusters to achieve flexible, cost‑effective elastic scaling, detailing architecture, registration steps, node‑pool creation, virtual nodes, multi‑level scheduling, and associated command‑line examples.

AckAlibaba CloudKubernetes
0 likes · 10 min read
Using Alibaba Cloud ACK One Registration Cluster for Elastic Hybrid Cloud Deployment
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
May 23, 2024 · Cloud Native

Cloud-Native Architecture and Tiered Storage for Xiaohongshu Kafka: Cost Reduction, Elastic Migration, and Performance Optimization

Xiaohongshu's big-data storage team built cloud-native architecture with tiered storage, containerized Kafka, and custom load balancer, cutting storage costs up to 60%, enabling minute‑level elastic migration, improving scaling efficiency tenfold, and boosting performance via caching and batch reads.

ContainerizationCost OptimizationKafka
0 likes · 20 min read
Cloud-Native Architecture and Tiered Storage for Xiaohongshu Kafka: Cost Reduction, Elastic Migration, and Performance Optimization
DataFunSummit
DataFunSummit
May 20, 2024 · Big Data

Real-Time High-Performance Analytics on Data Lakes with CloudLakehouse Multi-Cluster Architecture

This article explains how CloudLakehouse’s Multi‑Cluster elastic architecture enables high‑concurrency, low‑latency real‑time analytics on data lakes by addressing storage‑compute separation, dynamic caching, and automated scaling, providing a cost‑effective solution for customer‑facing data products.

Big Datacloud nativedata lake
0 likes · 18 min read
Real-Time High-Performance Analytics on Data Lakes with CloudLakehouse Multi-Cluster Architecture
Tencent Cloud Developer
Tencent Cloud Developer
Nov 15, 2023 · Game Development

Case Study: KMS Game Company’s Cloud‑Native Architecture and Elastic Microservice Deployment on Tencent Cloud

Japanese game developer KMS migrated from Azure to Tencent Cloud, adopting a cloud‑native architecture with Tencent’s Elastic Microservice platform that provides timed and metric‑based scaling, CI/CD pipelines, and batch upgrades, resulting in roughly 50% cost savings, 15% performance gains and 50% latency reduction.

CI/CDGame developmentTencent Cloud
0 likes · 9 min read
Case Study: KMS Game Company’s Cloud‑Native Architecture and Elastic Microservice Deployment on Tencent Cloud
ByteDance Cloud Native
ByteDance Cloud Native
Oct 30, 2023 · Cloud Native

Unlocking Elastic Resource Sharing: TikTok’s Cloud‑Native Mix‑Mode Scaling

This article explains how TikTok’s cloud‑native platform leverages elastic scaling, monitoring, and quota systems to dynamically share resources between online, latency‑sensitive services and offline, batch workloads, improving utilization while preserving service stability across tidal traffic patterns.

Kubernetescloud nativeelastic scaling
0 likes · 19 min read
Unlocking Elastic Resource Sharing: TikTok’s Cloud‑Native Mix‑Mode Scaling
HelloTech
HelloTech
Aug 1, 2023 · Cloud Native

Elastic Scaling Practices in Cloud‑Native Kubernetes Environments

To overcome native HPA limits and business‑specific constraints in a fully containerized, cloud‑native Kubernetes environment, we implemented a dual‑threshold water‑level and scheduled scaling engine, hybrid‑cloud ClusterAutoScale, mixed‑deployment resource prioritization, and comprehensive Prometheus‑based observability, achieving higher utilization, lower costs, and a roadmap toward deeper optimization and AIOps.

Kubernetesauto scalingcloud native
0 likes · 10 min read
Elastic Scaling Practices in Cloud‑Native Kubernetes Environments
Tencent Cloud Developer
Tencent Cloud Developer
May 8, 2023 · Cloud Native

Modernizing Tencent Cloud Log Service (CLS): Cloud‑Native Architecture, Challenges, and Benefits

Tencent Cloud Log Service was modernized by migrating over 95 % of its components to a cloud‑native stack of containers, Kubernetes, and declarative APIs, addressing chaotic infrastructure, stateful‑to‑stateless conversion, configuration drift, upgrade risk, elastic scaling, traffic protection and observability, which cut costs by more than 20 million CNY, reduced scaling latency by 90 %, and achieved over 99.99 % availability with petabyte‑scale burst handling.

Configuration ManagementObservabilityarchitecture
0 likes · 15 min read
Modernizing Tencent Cloud Log Service (CLS): Cloud‑Native Architecture, Challenges, and Benefits
Bilibili Tech
Bilibili Tech
Mar 28, 2023 · Operations

Bilibili's Capacity Management Platform: Design, Implementation, and S12 Event Support

Bilibili's capacity management platform integrates foundational data, VPA/HPA scaling, quota control, and visual dashboards to streamline resource usage, cut costs, and boost stability, delivering event‑specific support such as for S12 that slashes release issues by 80% and online failures by 90%, while planning predictive scaling and risk control.

BilibiliSREcapacity visualization
0 likes · 13 min read
Bilibili's Capacity Management Platform: Design, Implementation, and S12 Event Support
Architecture & Thinking
Architecture & Thinking
Mar 19, 2023 · Cloud Native

How Baidu Feed Achieved Serverless Scaling with Multi‑Dimensional Service Profiles

This article explains how Baidu's Feed recommendation backend adopted a serverless approach, building elastic, traffic, and capacity portraits for each micro‑service to enable predictive, load‑feedback, and timed scaling, thereby reducing resource waste and operational costs in a cloud‑native environment.

Backend ServicesService Profilingcloud native
0 likes · 17 min read
How Baidu Feed Achieved Serverless Scaling with Multi‑Dimensional Service Profiles
Architecture & Thinking
Architecture & Thinking
Nov 10, 2022 · Backend Development

Mastering Traffic Spikes: Rate Limiting Strategies for Resilient Services

This article explores how sudden traffic surges can cause service avalanches and presents cloud‑native scaling, various rate‑limiting algorithms (fixed window, sliding window, token bucket, leaky bucket) and practical fallback techniques to protect backend systems and ensure graceful degradation.

Rate Limitingbackendcircuit breaker
0 likes · 10 min read
Mastering Traffic Spikes: Rate Limiting Strategies for Resilient Services
Tencent Cloud Developer
Tencent Cloud Developer
Sep 30, 2022 · Cloud Computing

Understanding GPU Computing and Cloud-Based GPU Solutions

The article explains how massive parallel pixel calculations demand GPUs, whose high cost and inflexibility are solved by Tencent Cloud’s elastic, virtualized GPU services—including vGPU, qGPU, TACO abstraction, and spot instances—delivering up to 16 EFLOPS for AI, scientific, graphics, and video workloads.

Cloud GPUGPU computingParallel Computing
0 likes · 5 min read
Understanding GPU Computing and Cloud-Based GPU Solutions
Top Architect
Top Architect
Apr 30, 2022 · Backend Development

Scaling Strategies, Hardware Expansion, and Distributed ID Generation in Backend Systems

The article explains why capacity expansion is needed, compares whole‑machine and component‑level scaling, introduces the AKF splitting principle, discusses challenges of distributed architectures, and reviews database clustering and distributed ID generation techniques such as UUID and Snowflake.

ScalingUUIDbackend architecture
0 likes · 12 min read
Scaling Strategies, Hardware Expansion, and Distributed ID Generation in Backend Systems
Architect
Architect
Apr 25, 2022 · Cloud Native

Designing a Cloud‑Native Intelligent Data Architecture for Baidu Search Platform

This article presents a cloud‑native redesign of Baidu's search middle‑platform that introduces intelligent data management, elastic scaling, on‑demand resource allocation, precise fan‑out, and localized computation to address efficiency, cost, stability, and performance challenges of large‑scale search workloads.

cloud nativedata managementelastic scaling
0 likes · 14 min read
Designing a Cloud‑Native Intelligent Data Architecture for Baidu Search Platform
IT Architects Alliance
IT Architects Alliance
Feb 4, 2022 · Backend Development

Designing a Scalable Architecture for Million‑Level DAU Systems

The article outlines a comprehensive backend architecture for handling million‑to‑tens‑of‑million daily active users, covering DNS routing, L4/L7 load balancing, monolithic versus microservice deployment, caching, database sharding, hybrid‑cloud strategies, elastic scaling, and multi‑level degradation mechanisms.

Load Balancingdatabase shardingelastic scaling
0 likes · 11 min read
Designing a Scalable Architecture for Million‑Level DAU Systems
Top Architect
Top Architect
Feb 1, 2022 · Backend Development

Designing a Scalable Backend Architecture for Millions of Daily Active Users

The article outlines a comprehensive backend architecture for handling millions of daily active users, covering DNS routing, layer‑4/7 load balancing, monolithic versus microservice deployment, caching, database sharding, hybrid‑cloud strategies, elastic scaling, and multi‑level degradation mechanisms.

Load Balancingbackend architectureelastic scaling
0 likes · 12 min read
Designing a Scalable Backend Architecture for Millions of Daily Active Users