Tagged articles
14 articles
Page 1 of 1
Architects' Tech Alliance
Architects' Tech Alliance
Jan 13, 2026 · Artificial Intelligence

Inside Google’s Massive TPU SuperPod: How Scale‑Up and Scale‑Out Build a 9,216‑Chip AI Engine

The article explains Google’s TPU data‑center architecture, detailing the vertical Scale‑Up strategy within a SuperPod, the horizontal Scale‑Out across SuperPods, the 3D Torus topology with Twisted variants, and the multi‑layer network design that enables petabyte‑scale AI training and inference.

AI hardwareData centerScale‑Up
0 likes · 8 min read
Inside Google’s Massive TPU SuperPod: How Scale‑Up and Scale‑Out Build a 9,216‑Chip AI Engine
Architects' Tech Alliance
Architects' Tech Alliance
Oct 24, 2025 · Artificial Intelligence

How xPU Scale‑Up Networks Are Redefining AI Training Efficiency

As AI models grow to massive scales, the demand for ultra‑high‑performance, low‑latency networking in xPU clusters intensifies, prompting a shift from dense to MoE architectures and driving the evolution of Scale‑up networks, where Alibaba Cloud’s UPN design tackles bandwidth, cost, and reliability challenges.

AIMoEScale‑Up
0 likes · 13 min read
How xPU Scale‑Up Networks Are Redefining AI Training Efficiency
Architects' Tech Alliance
Architects' Tech Alliance
Oct 9, 2025 · Artificial Intelligence

Unlocking AI Scale‑Up: Inside SUE, OISA, ALS and ETH+ High‑Performance Interconnects

This article introduces four cutting‑edge AI networking technologies—SUE, OISA, ALS, and ETH+—detailing their backgrounds, architectural designs, and performance enhancements that enable ultra‑high bandwidth, low‑latency, and scalable interconnects for modern AI compute clusters.

AI networkingHigh‑performance computingScale‑Up
0 likes · 13 min read
Unlocking AI Scale‑Up: Inside SUE, OISA, ALS and ETH+ High‑Performance Interconnects
Architects' Tech Alliance
Architects' Tech Alliance
Sep 18, 2025 · Artificial Intelligence

How AI Model Training Is Redefining Data Center Scaling Strategies

Large‑scale AI model training now demands unprecedented bandwidth and latency performance, forcing data centers to adopt three scaling approaches—Scale‑up, Scale‑out, and Scale‑Across—while leveraging optical I/O, CPO, and optical circuit switching to overcome power, distance, and bandwidth limits.

AI InfrastructureScale‑Updata center scaling
0 likes · 11 min read
How AI Model Training Is Redefining Data Center Scaling Strategies
Architects' Tech Alliance
Architects' Tech Alliance
Jun 29, 2025 · Artificial Intelligence

Scale-Up vs Scale-Out: Balancing Performance and Flexibility in AI Infrastructure

This article explains the technical definitions, core differences, and practical use cases of Scale‑Up and Scale‑Out networking in AI systems, highlighting how they impact latency, bandwidth, and cost, and illustrates their combined application through NVIDIA's NVL72 supernode case study.

AI InfrastructureGPU networkingHigh‑performance computing
0 likes · 14 min read
Scale-Up vs Scale-Out: Balancing Performance and Flexibility in AI Infrastructure
Architects' Tech Alliance
Architects' Tech Alliance
May 31, 2025 · Artificial Intelligence

GPU Cluster Scaling: Understanding Scale‑Up and Scale‑Out for AI Pods

This article explains the concepts of AI Pods and GPU clusters, compares vertical (scale‑up) and horizontal (scale‑out) expansion, describes XPU types, discusses internal and inter‑pod communication, and evaluates the benefits and drawbacks of each scaling approach along with relevant networking technologies.

AI PodsGPUInfiniBand
0 likes · 10 min read
GPU Cluster Scaling: Understanding Scale‑Up and Scale‑Out for AI Pods
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Dec 2, 2024 · Cloud Computing

Alibaba Cloud Showcases ALS System and AI‑Driven Storage Innovations at CCF China Storage Conference 2024

At the CCF China Storage Conference 2024 in Guangzhou, Alibaba Cloud’s research team presented the ALS (ALink System) ecosystem, discussed Scale‑Up interconnect protocols, and demonstrated multi‑layer storage innovations for AI workloads, highlighting hardware‑software integration, high‑bandwidth networking, and future CXL/PIM research.

AI storageALSScale‑Up
0 likes · 7 min read
Alibaba Cloud Showcases ALS System and AI‑Driven Storage Innovations at CCF China Storage Conference 2024
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jan 2, 2024 · Backend Development

Cache Design and Optimization Practices for High‑Concurrency Music Library Service

The article details NetEase Cloud Music’s high‑concurrency cache architecture—using lazy‑load, hole‑wrapped objects for penetration protection, placeholder values for missing data, horizontal and vertical scaling with consistent hashing, and asynchronous binlog‑driven invalidation—to achieve sub‑millisecond reads for a read‑heavy, write‑light music library.

Distributed SystemsScale‑Upcache invalidation
0 likes · 12 min read
Cache Design and Optimization Practices for High‑Concurrency Music Library Service
Architects' Tech Alliance
Architects' Tech Alliance
Jan 30, 2019 · Databases

SAP HANA Overview: Deployment Options, Use Cases, Scale‑Up/Scale‑Out, TDI, HA and Architecture

This article provides a comprehensive overview of SAP HANA, covering its role as an in‑memory database, deployment models (cloud, appliance, on‑premise), primary application scenarios, hardware certification, scale‑up versus scale‑out architectures, TDI integration, virtualization support, storage sizing, high‑availability options and node roles.

SAP HANAScale‑UpTDI
0 likes · 12 min read
SAP HANA Overview: Deployment Options, Use Cases, Scale‑Up/Scale‑Out, TDI, HA and Architecture
Architects' Tech Alliance
Architects' Tech Alliance
Jul 5, 2018 · Databases

Understanding SAP HANA Deployment Options, Scenarios, and High‑Availability Strategies

This article explains SAP HANA’s role as an in‑memory database platform, outlines its cloud and on‑premise deployment models, describes key business scenarios such as Business Warehouse on HANA and Business Suite on HANA, and details scale‑up vs. scale‑out, TDI, virtualization, storage sizing, and high‑availability configurations.

DeploymentSAP HANAScale‑Up
0 likes · 10 min read
Understanding SAP HANA Deployment Options, Scenarios, and High‑Availability Strategies
21CTO
21CTO
Jan 22, 2016 · Backend Development

Mastering Scale‑Up: How to Maximize Single‑Server Concurrency for Web Apps

This article explains why high‑concurrency design is essential for modern internet services, compares vertical (scale‑up) and horizontal (scale‑out) scaling, revisits the C10K/C10M challenges, and reviews process, thread, coroutine, and event‑driven models to help engineers build ultra‑scalable web servers.

Backend ArchitectureEvent-drivenScale‑Up
0 likes · 13 min read
Mastering Scale‑Up: How to Maximize Single‑Server Concurrency for Web Apps