Tag

AI compute

0 views collected around this technical thread.

Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Nov 11, 2024 · Artificial Intelligence

Scale‑up x10 Drives a New Wave of AI Compute Cluster Network Architecture

At the CCF ChinaNet conference, Alibaba Cloud’s VP of R&D presented a vision of AI compute scaling to ten‑fold larger clusters, highlighting the shift from InfiniBand to high‑throughput Ethernet, the HPN7.0 architecture, emerging Scale‑up challenges, and the roadmap for high‑throughput Ethernet and the ENode+ super‑node system.

AI computeEthernetHPN7.0
0 likes · 8 min read
Scale‑up x10 Drives a New Wave of AI Compute Cluster Network Architecture
Architects' Tech Alliance
Architects' Tech Alliance
Sep 12, 2024 · Artificial Intelligence

Comparison of InfiniBand and RoCEv2 Architectures for AI Compute Networks

This article examines the two dominant AI compute network architectures, InfiniBand and RoCEv2, detailing their designs, flow‑control mechanisms, performance, cost and scalability characteristics, and evaluates their respective advantages and limitations to guide network selection for AI data centers.

AI computeInfiniBandRDMA
0 likes · 9 min read
Comparison of InfiniBand and RoCEv2 Architectures for AI Compute Networks
Architects' Tech Alliance
Architects' Tech Alliance
Jul 25, 2024 · Artificial Intelligence

NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market

The article reviews NVIDIA's newly released H20 AI accelerator for China, compares its performance and pricing with domestic chips, outlines the expanding Chinese AI chip ecosystem—including Huawei, Cambricon, HaiGuang, Alibaba, ByteDance, and Baidu—while highlighting market size growth, multi‑chip heterogeneity strategies, and the strong demand forecast through 2024.

AI chipsAI computeGPU
0 likes · 8 min read
NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market
Architects' Tech Alliance
Architects' Tech Alliance
Jun 20, 2024 · Artificial Intelligence

Comparative Analysis of InfiniBand and RoCEv2 Architectures for AI Compute Networks

This article provides a detailed comparison of InfiniBand and RoCEv2 network architectures, examining their technical features, flow‑control mechanisms, performance, cost, and suitability for AI compute environments to guide designers in selecting the optimal solution.

AI computeInfiniBandPerformance
0 likes · 9 min read
Comparative Analysis of InfiniBand and RoCEv2 Architectures for AI Compute Networks
IT Architects Alliance
IT Architects Alliance
Jun 12, 2024 · Cloud Computing

Network Architecture Selection and Comparison for AI Compute Centers

The article analyzes traditional cloud data‑center networking challenges for AI workloads and compares two‑layer and three‑layer fat‑tree architectures, presenting high‑bandwidth, non‑blocking, and low‑latency designs such as AI‑Pool networks and offering practical deployment scales from hundreds to tens of thousands of GPUs.

AI computeFat TreeHigh Bandwidth
0 likes · 11 min read
Network Architecture Selection and Comparison for AI Compute Centers
360 Smart Cloud
360 Smart Cloud
Feb 1, 2024 · Operations

AI Compute Era: Data Center Power, Cooling, and Space Requirements

The rapid growth of AI compute demand is forcing data centers to redesign cabinet power capacity, adopt advanced cooling solutions such as liquid cooling, and re‑evaluate space density and construction timelines to meet the high‑density, high‑power needs of modern AI workloads.

AI computeData Center Operationscooling solutions
0 likes · 12 min read
AI Compute Era: Data Center Power, Cooling, and Space Requirements
Architects' Tech Alliance
Architects' Tech Alliance
Aug 21, 2023 · Artificial Intelligence

AI Compute Landscape: GPU Architectures, Tensor Cores, NVLink, and Scaling Challenges

The article surveys the AI compute ecosystem, explaining why CPUs are unsuitable for AI workloads, how heterogeneous CPU‑plus‑accelerator designs dominate, and detailing the evolution of NVIDIA GPUs, Tensor Cores, memory technologies, and inter‑GPU networking that enable large‑scale model training.

AI computeAI hardwareGPU architecture
0 likes · 11 min read
AI Compute Landscape: GPU Architectures, Tensor Cores, NVLink, and Scaling Challenges
Architects' Tech Alliance
Architects' Tech Alliance
Aug 8, 2023 · Cloud Computing

Design Principles and Practices for High‑Performance AI Compute Center Networks

The article analyzes the limitations of traditional data‑center networking for AI compute workloads and presents high‑bandwidth, non‑blocking, low‑latency design solutions—including two‑layer and three‑layer fat‑tree architectures, AI‑Pool concepts, and recommended configurations—for building scalable, efficient intelligent computing clusters.

AI computeFat TreeHigh Bandwidth
0 likes · 10 min read
Design Principles and Practices for High‑Performance AI Compute Center Networks
Architects' Tech Alliance
Architects' Tech Alliance
Dec 28, 2021 · Artificial Intelligence

Understanding FLOPS, Benchmarks, and AI Compute Performance

This article explains the concept of FLOPS, its measurement units, common benchmarks such as Linpack and MLPerf, why traditional HPC benchmarks may not suit AI workloads, and provides a comprehensive overview of hardware performance figures from GFLOPS to PFLOPS across various modern processors and supercomputers.

AI computeBenchmarkFLOPS
0 likes · 11 min read
Understanding FLOPS, Benchmarks, and AI Compute Performance