Tagged articles
10 articles
Page 1 of 1
Architects' Tech Alliance
Architects' Tech Alliance
May 6, 2025 · Artificial Intelligence

Evolution of NVIDIA GPU Architectures for AI from Volta to Blackwell

The article reviews NVIDIA's GPU architecture progression—from Volta's pioneering Tensor Cores through Turing, Ampere, Hopper, and the latest Blackwell and Rubin designs—highlighting key innovations, performance gains for deep learning, and related resource updates for AI practitioners.

GPU architectureHigh‑Performance ComputingNvidia
0 likes · 9 min read
Evolution of NVIDIA GPU Architectures for AI from Volta to Blackwell
AntTech
AntTech
Mar 19, 2025 · Artificial Intelligence

Award-Winning HPCA 2025 Papers on Near‑DRAM Processing (UniNDP) and GPU‑Accelerated Fully Homomorphic Encryption (WarpDrive)

At HPCA 2025, two standout papers—UniNDP, a unified compilation and simulation tool for near‑DRAM processing architectures, and WarpDrive, a GPU‑based fully homomorphic encryption accelerator leveraging Tensor and CUDA cores—demonstrate significant performance gains for AI workloads and privacy‑preserving computation.

AI accelerationFully Homomorphic EncryptionGPU
0 likes · 5 min read
Award-Winning HPCA 2025 Papers on Near‑DRAM Processing (UniNDP) and GPU‑Accelerated Fully Homomorphic Encryption (WarpDrive)
Architects' Tech Alliance
Architects' Tech Alliance
Aug 18, 2024 · Artificial Intelligence

RDMA, InfiniBand, RoCE, and iWARP: High‑Performance Networking for Large‑Scale Generative AI Model Training

The article explains how RDMA technologies—including InfiniBand, RoCE, and iWARP—provide high‑throughput, low‑latency, CPU‑free data transfer for massive generative AI model training, compares their architectures, and discusses modern network designs and load‑balancing strategies to optimize AI‑focused data‑center networks.

AI trainingHigh‑Performance ComputingInfiniBand
0 likes · 11 min read
RDMA, InfiniBand, RoCE, and iWARP: High‑Performance Networking for Large‑Scale Generative AI Model Training
Architects' Tech Alliance
Architects' Tech Alliance
Jul 7, 2024 · Operations

Designing High‑Performance Cluster Networks for AI Large Models: InfiniBand vs RoCE

The article analyzes the networking challenges of AI super‑large models, comparing InfiniBand and RoCE technologies, and presents design guidelines for ultra‑scale, high‑bandwidth, low‑latency, and highly stable cluster interconnects to maximize GPU utilization and overall training efficiency.

AIGPU interconnectHigh‑Performance Computing
0 likes · 14 min read
Designing High‑Performance Cluster Networks for AI Large Models: InfiniBand vs RoCE
Architects' Tech Alliance
Architects' Tech Alliance
Apr 21, 2024 · Fundamentals

Understanding RDMA: InfiniBand, RoCE, and Their Role in High‑Performance AI Model Training

This article explains how Remote Direct Memory Access (RDMA) technologies such as InfiniBand and RoCE bypass OS kernels to achieve ultra‑low latency and high bandwidth, discusses their hardware implementations, cost considerations, and their critical impact on large‑scale AI model training and HPC network design.

AIGPUHigh‑Performance Computing
0 likes · 11 min read
Understanding RDMA: InfiniBand, RoCE, and Their Role in High‑Performance AI Model Training
Architects' Tech Alliance
Architects' Tech Alliance
Mar 20, 2023 · Fundamentals

Dragonfly Network Topology and Routing Algorithms for High‑Performance Data Centers

The article explains the Dragonfly network topology, its hierarchical structure, key parameters, routing algorithms (Minimal, Non‑Minimal, UGAL variants) and deadlock avoidance techniques, highlighting how modern data‑center networks address latency bottlenecks in high‑performance computing environments.

Data centerDragonflyHigh‑Performance Computing
0 likes · 9 min read
Dragonfly Network Topology and Routing Algorithms for High‑Performance Data Centers
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
May 21, 2021 · Cloud Computing

2021 Data Center High‑Quality Development Conference Highlights Liquid‑Cooling Innovations and Industry Standards

The 2021 Data Center High‑Quality Development Conference in Beijing showcased the release of industry indexes and low‑carbon white papers, awarded Alibaba Cloud for liquid‑cooled cloud computing innovations, and detailed China's rapid data‑center growth, emerging liquid‑cooling technologies, and collaborative standard‑setting efforts.

Alibaba CloudData centerHigh‑Performance Computing
0 likes · 5 min read
2021 Data Center High‑Quality Development Conference Highlights Liquid‑Cooling Innovations and Industry Standards
Architects' Tech Alliance
Architects' Tech Alliance
Apr 26, 2021 · Artificial Intelligence

GPU Market Overview and Industry Applications

The article provides a comprehensive overview of GPU technology, its architecture, rapid market growth, segmentation by type, device and industry, cloud deployment trends, competitive landscape, and diverse applications ranging from high‑performance computing and AI to automotive, AR/VR, and IoT.

GPUHigh‑Performance ComputingMarket analysis
0 likes · 9 min read
GPU Market Overview and Industry Applications
Architects' Tech Alliance
Architects' Tech Alliance
Mar 11, 2019 · Fundamentals

Understanding Mellanox InfiniBand Technology and Its Role in High‑Performance Computing

The article explains Nvidia's $6.9 billion acquisition of Mellanox, outlines Mellanox's history and product portfolio, and provides a detailed overview of InfiniBand architecture, network topologies, protocols, and related software stacks such as OFED, highlighting their importance for data‑center, HPC, and cloud environments.

Data centerHigh‑Performance ComputingInfiniBand
0 likes · 14 min read
Understanding Mellanox InfiniBand Technology and Its Role in High‑Performance Computing
Architects' Tech Alliance
Architects' Tech Alliance
Feb 6, 2019 · Artificial Intelligence

Intel's $5.5 Billion Bid for Mellanox: Implications for Data Center AI and High‑Performance Computing

Intel has reportedly offered $5.5 billion to acquire Mellanox Technologies, a leading provider of high‑performance networking hardware, sparking interest from multiple tech giants and analysts who see the deal as a potential boost for data‑center AI, big‑data analytics, and HPC capabilities.

AcquisitionData centerHigh‑Performance Computing
0 likes · 5 min read
Intel's $5.5 Billion Bid for Mellanox: Implications for Data Center AI and High‑Performance Computing