Tagged articles

12 articles

Page 1 of 1

Aug 10, 2025 · Artificial Intelligence

How Huawei Ascend 910 Redefines AI Training Performance

The Huawei Ascend 910 AI processor, built on the Da Vinci architecture with 7nm+ EUV technology, delivers 256 TFLOPS FP16 and 512 TOPS INT8 performance, superior energy efficiency, and a full-stack software ecosystem, making it ideal for large‑scale AI training, HPC, and cloud AI services.

AI processorAscend 910Da Vinci architecture

0 likes · 13 min read

How Huawei Ascend 910 Redefines AI Training Performance

Kuaishou Tech

Jul 17, 2025 · Artificial Intelligence

How DHPS Boosted Online Inference Throughput by 270% with RDMA

This article details the design and evolution of DHPS, Kuaishou's load‑balanced, RDMA‑based high‑performance service architecture, explaining its network, storage, and traffic‑scheduling innovations that deliver over 270% query‑throughput improvement, lower latency, reduced CPU usage, and near‑five‑nine availability for large‑scale AI inference workloads.

RDMAStorage Enginedistributed systems

0 likes · 17 min read

How DHPS Boosted Online Inference Throughput by 270% with RDMA

Architects' Tech Alliance

May 12, 2025 · Artificial Intelligence

Comparison of Fat-Tree, Dragonfly, and Torus Network Topologies for AI and High‑Performance Computing

The article reviews Fat‑Tree, Dragonfly, and Torus network topologies, analyzing their bandwidth, scalability, latency, routing algorithms, and cost trade‑offs for AI‑driven high‑performance computing clusters, and highlights each design's strengths and limitations in large‑scale deployments.

AI computingDragonflyFat-Tree

0 likes · 12 min read

Comparison of Fat-Tree, Dragonfly, and Torus Network Topologies for AI and High‑Performance Computing

Architects' Tech Alliance

Jul 1, 2024 · Industry Insights

Why Fat-Tree, Dragonfly, and Torus Topologies Matter for HPC Networks

The article analyzes three major high‑performance‑computing network topologies—Fat‑Tree, Dragonfly, and Torus—detailing their design principles, scalability formulas, routing strategies, advantages, and limitations to help architects choose the most suitable architecture for large‑scale GPU clusters.

DragonflyFat-TreeGPU clusters

0 likes · 13 min read

Why Fat-Tree, Dragonfly, and Torus Topologies Matter for HPC Networks

21CTO

May 30, 2024 · Fundamentals

How Gordon Bell’s Vision Shaped Modern Computing: From PDP to Bell’s Law

Renowned computer architect Gordon Bell, whose pioneering work on DEC’s PDP series, the creation of Bell’s Law, and leadership in supercomputing and high‑performance computing institutions, left an enduring legacy that continues to influence modern systems, AI, and the evolution of computing technology.

Computer ArchitectureGordon BellSystems

0 likes · 11 min read

How Gordon Bell’s Vision Shaped Modern Computing: From PDP to Bell’s Law

Architects' Tech Alliance

May 3, 2024 · Fundamentals

From OSI Model to RDMA: High‑Performance Networking, Leaf‑Spine Architecture, and Switch Selection

This article examines the evolution of network protocols from the OSI seven‑layer model and TCP/IP to RDMA technologies such as InfiniBand and RoCE, compares traditional three‑tier and leaf‑spine data‑center designs, and evaluates Ethernet, InfiniBand, and RoCE switches for high‑throughput, low‑latency HPC environments.

Data center architectureInfiniBandLeaf-Spine

0 likes · 13 min read

From OSI Model to RDMA: High‑Performance Networking, Leaf‑Spine Architecture, and Switch Selection

Alimama Tech

Mar 20, 2024 · Artificial Intelligence

Dolphin VectorDB: A High-Performance Vector Database for AI Applications

Dolphin VectorDB, created by Alibaba’s Alimama team, is a high‑performance, scalable vector database that delivers fast, cost‑effective AI‑driven vector storage and real‑time updates, supporting multiple query modes and powering applications such as content risk control, marketing Q&A, and audience selection, with ongoing enhancements for multimodal computing.

AI applicationsReal-time UpdatesVector Database

0 likes · 13 min read

Dolphin VectorDB: A High-Performance Vector Database for AI Applications

Architects' Tech Alliance

Jun 18, 2023 · Fundamentals

Analysis of Advanced High‑Performance Processors for Exascale Computing: Fujitsu A64FX, NVIDIA H100, AMD MI250X, and Intel PonteVecchio

This article examines four leading exascale‑grade high‑performance processors—Fujitsu A64FX, NVIDIA H100, AMD MI250X, and Intel PonteVecchio—detailing their core architectures, compute resources, memory hierarchies, specialized accelerators, process technologies, performance metrics, and trends to inform future domestic processor development.

AMD MI250XExascaleFujitsu A64FX

0 likes · 11 min read

Analysis of Advanced High‑Performance Processors for Exascale Computing: Fujitsu A64FX, NVIDIA H100, AMD MI250X, and Intel PonteVecchio

Architects' Tech Alliance

Sep 30, 2022 · Fundamentals

High‑Performance Computing: Principles, Evolution, Applications, and Market Landscape

This article explains the concept and history of high‑performance computing (HPC), its serial and parallel processing architectures, performance metrics such as FLOPS, major application domains, and the rapid market growth and competitive landscape in China driven by national policies and industry investment.

China HPCHPC ApplicationsSupercomputers

0 likes · 14 min read

High‑Performance Computing: Principles, Evolution, Applications, and Market Landscape

Baidu Intelligent Cloud Tech Hub

Jul 21, 2022 · Cloud Computing

How Baidu’s Cloud Storage Powers High‑Performance Computing and AI Workloads

This article explains the storage challenges of high‑performance computing—including traditional HPC, AI‑driven HPC, and HPDA—then details Baidu’s unified storage platform, object storage BOS, and runtime solutions PFS and RapidFS, illustrating their architecture, features, and a real‑world autonomous‑driving customer case.

AI trainingData Lakecloud storage

0 likes · 29 min read

How Baidu’s Cloud Storage Powers High‑Performance Computing and AI Workloads

Alibaba Cloud Infrastructure

Jul 5, 2022 · Fundamentals

High‑Performance Chiplet and Interconnect Architectures: Insights from the HiPChips Workshop at ISCA 2022

The HiPChips workshop at ISCA 2022 gathered leading academia and industry experts to discuss the motivations, recent research breakthroughs, technical challenges, and ecosystem efforts surrounding high‑performance chiplet and interconnect architectures for future computing systems.

ChipletComputer ArchitectureHardware Design

0 likes · 10 min read

High‑Performance Chiplet and Interconnect Architectures: Insights from the HiPChips Workshop at ISCA 2022

Tencent Cloud Developer

Jul 10, 2018 · Cloud Computing

Building a Containerized Scientific Computing Platform on the Cloud

The talk details XtraPi’s journey from early PBS‑based supercomputers to a modern Kubernetes‑driven, multi‑cloud platform that uses Tencent Cloud TKE to run massive containerized drug‑discovery simulations, describing scaling strategies, image optimization, CI pipelines, checkpoint‑restart, and future serverless and bare‑metal enhancements.

HPCKubernetesMesos

0 likes · 28 min read

Building a Containerized Scientific Computing Platform on the Cloud