Tagged articles
11 articles
Page 1 of 1
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Dec 30, 2025 · Cloud Native

How HBox Boosts GPU Utilization with Multi‑Pool and NUMA‑Aware Scheduling

The HBox scheduling platform tackles large‑scale AI cluster challenges by introducing a three‑pool resource model, priority‑based preemptive scheduling, network‑topology and NUMA‑aware dispatch, and GPU virtualization techniques like MIG and vGPU, dramatically improving GPU utilization, SLA guarantees, and overall cluster efficiency.

AI clustersGPU schedulingGPU virtualization
0 likes · 24 min read
How HBox Boosts GPU Utilization with Multi‑Pool and NUMA‑Aware Scheduling
Bilibili Tech
Bilibili Tech
Jul 4, 2025 · Operations

Solving CPU Performance Layering in Heterogeneous Data Centers: A Practical Guide

This article explains why heterogeneous servers cause CPU performance layering, describes how to detect the issue using metrics such as NUMA hit/miss rates, cache miss ratios and frequency states, and provides step‑by‑step remediation techniques—including NUMA binding, cache isolation, recompilation and frequency locking—to improve resource pooling efficiency in modern data centers.

CPU performanceData centerNUMA
0 likes · 24 min read
Solving CPU Performance Layering in Heterogeneous Data Centers: A Practical Guide
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 28, 2024 · Artificial Intelligence

Mooncake: Open-Source KVCache-Centric Architecture Boosting Large-Model Inference

Mooncake, an open-source KVCache-centric inference architecture co-developed by Alibaba Cloud and Tsinghua University's MADSys lab, dramatically improves large-model throughput and reduces cost by decoupling resources, standardizing cache pooling, and integrating with frameworks like vLLM, sparking broad industry interest.

AI InfrastructureKVCachelarge language models
0 likes · 4 min read
Mooncake: Open-Source KVCache-Centric Architecture Boosting Large-Model Inference
AntTech
AntTech
Apr 23, 2024 · Databases

The Cloud Era of Databases: Insights from OceanBase Chief Scientist Yang Zhenkun

In his OceanBase developer conference keynote, chief scientist Yang Zhenkun explains how cloud resource pooling enables distributed databases to achieve elastic compute and storage, discusses the evolution of databases, the challenges of transaction processing, and envisions fully shared, on‑demand cloud database services.

Distributed SystemsOceanBasecloud computing
0 likes · 7 min read
The Cloud Era of Databases: Insights from OceanBase Chief Scientist Yang Zhenkun
dbaplus Community
dbaplus Community
Jun 24, 2023 · Operations

How Bilibili Scales Capacity: VPA, HPA, and Cost‑Saving Strategies

This article summarizes Zhang He’s Bilibili SRE talk on building a capacity‑management system that visualizes resource usage, reduces costs, improves stability, and leverages Kubernetes VPA, HPA, pooling, and quota management to support massive live‑stream events and rapid feature releases.

Cost OptimizationHPAKubernetes
0 likes · 21 min read
How Bilibili Scales Capacity: VPA, HPA, and Cost‑Saving Strategies
Open Source Linux
Open Source Linux
Dec 12, 2021 · Cloud Computing

What Is Cloud Computing? A Visual Journey Through Its History and Benefits

This comic‑style article explains cloud computing by tracing its evolution from the first computer in 1946, through networks, servers and data centers, to modern services like Amazon EC2, highlighting its resource‑pooling, elasticity, and security advantages over traditional computing.

cloud computingelasticityhistory
0 likes · 6 min read
What Is Cloud Computing? A Visual Journey Through Its History and Benefits
Efficient Ops
Efficient Ops
Jun 29, 2021 · Cloud Computing

What Is Cloud Computing? A Visual Journey from ENIAC to Modern Cloud

This article traces the evolution of cloud computing from the first computer ENIAC through the rise of networks, servers, and data centers, explains how Amazon and Google pioneered the term, and highlights its three core features—resource pooling, elastic scalability, and reliable security—showing why it’s reshaping IT today.

IT transformationVirtualizationcloud computing
0 likes · 6 min read
What Is Cloud Computing? A Visual Journey from ENIAC to Modern Cloud
Qunar Tech Salon
Qunar Tech Salon
Sep 16, 2020 · Operations

Noah: A Test Environment Governance Platform for Efficient Development and Testing

Noah is a test environment governance platform that uses infrastructure‑as‑code principles, resource pooling, soft routing, and containerization to automate the creation, management, and teardown of complex testing environments, dramatically improving developer productivity and reducing operational costs.

AutomationDevOpsInfrastructure as Code
0 likes · 12 min read
Noah: A Test Environment Governance Platform for Efficient Development and Testing
Efficient Ops
Efficient Ops
Aug 13, 2018 · Databases

How NetEase Scaled DBA Automation: From Manual Ops to Self‑Service Platforms

From 2015 to now, NetEase’s DBA team transformed from manual maintenance to a fully automated, platform‑driven system that supports multi‑active database architectures, real‑time monitoring, automated alarm handling, MHA management, resource pooling, rapid migration, and self‑service SQL review, dramatically reducing downtime and operational overhead.

DBA automationMHA managementSQL Review
0 likes · 21 min read
How NetEase Scaled DBA Automation: From Manual Ops to Self‑Service Platforms
21CTO
21CTO
Sep 8, 2017 · Operations

How Tencent CDN Handles Tb‑Level Traffic Bursts with a Docker‑Powered Burst Pool

This article explains how Tencent CDN tackles ever‑growing Tb‑scale traffic spikes by virtualizing resources into a shared Docker‑based burst pool, detailing the challenges, architectural solutions, technical optimizations, and the resulting cost savings and rapid scaling capabilities.

CDNDockerburst traffic
0 likes · 10 min read
How Tencent CDN Handles Tb‑Level Traffic Bursts with a Docker‑Powered Burst Pool