Architects' Tech Alliance
Architects' Tech Alliance
Aug 10, 2025 · Artificial Intelligence

How Huawei Ascend 910 Redefines AI Training Performance

The Huawei Ascend 910 AI processor, built on the Da Vinci architecture with 7nm+ EUV technology, delivers 256 TFLOPS FP16 and 512 TOPS INT8 performance, superior energy efficiency, and a full-stack software ecosystem, making it ideal for large‑scale AI training, HPC, and cloud AI services.

AI processorAscend 910Da Vinci architecture
0 likes · 13 min read
How Huawei Ascend 910 Redefines AI Training Performance
Kuaishou Tech
Kuaishou Tech
Jul 17, 2025 · Artificial Intelligence

How DHPS Boosted Online Inference Throughput by 270% with RDMA

This article details the design and evolution of DHPS, Kuaishou's load‑balanced, RDMA‑based high‑performance service architecture, explaining its network, storage, and traffic‑scheduling innovations that deliver over 270% query‑throughput improvement, lower latency, reduced CPU usage, and near‑five‑nine availability for large‑scale AI inference workloads.

RDMAStorage Enginedistributed-systems
0 likes · 17 min read
How DHPS Boosted Online Inference Throughput by 270% with RDMA
Architects' Tech Alliance
Architects' Tech Alliance
May 12, 2025 · Artificial Intelligence

Comparison of Fat-Tree, Dragonfly, and Torus Network Topologies for AI and High‑Performance Computing

The article reviews Fat‑Tree, Dragonfly, and Torus network topologies, analyzing their bandwidth, scalability, latency, routing algorithms, and cost trade‑offs for AI‑driven high‑performance computing clusters, and highlights each design's strengths and limitations in large‑scale deployments.

AI computingDragonflyTorus
0 likes · 12 min read
Comparison of Fat-Tree, Dragonfly, and Torus Network Topologies for AI and High‑Performance Computing
Architects' Tech Alliance
Architects' Tech Alliance
Jul 1, 2024 · Industry Insights

Why Fat-Tree, Dragonfly, and Torus Topologies Matter for HPC Networks

The article analyzes three major high‑performance‑computing network topologies—Fat‑Tree, Dragonfly, and Torus—detailing their design principles, scalability formulas, routing strategies, advantages, and limitations to help architects choose the most suitable architecture for large‑scale GPU clusters.

DragonflyGPU clustersTorus
0 likes · 13 min read
Why Fat-Tree, Dragonfly, and Torus Topologies Matter for HPC Networks
21CTO
21CTO
May 30, 2024 · Fundamentals

How Gordon Bell’s Vision Shaped Modern Computing: From PDP to Bell’s Law

Renowned computer architect Gordon Bell, whose pioneering work on DEC’s PDP series, the creation of Bell’s Law, and leadership in supercomputing and high‑performance computing institutions, left an enduring legacy that continues to influence modern systems, AI, and the evolution of computing technology.

Gordon BellSystemscomputer architecture
0 likes · 11 min read
How Gordon Bell’s Vision Shaped Modern Computing: From PDP to Bell’s Law
Architects' Tech Alliance
Architects' Tech Alliance
May 3, 2024 · Fundamentals

From OSI Model to RDMA: High‑Performance Networking, Leaf‑Spine Architecture, and Switch Selection

This article examines the evolution of network protocols from the OSI seven‑layer model and TCP/IP to RDMA technologies such as InfiniBand and RoCE, compares traditional three‑tier and leaf‑spine data‑center designs, and evaluates Ethernet, InfiniBand, and RoCE switches for high‑throughput, low‑latency HPC environments.

Data center architectureInfinibandLeaf-Spine
0 likes · 13 min read
From OSI Model to RDMA: High‑Performance Networking, Leaf‑Spine Architecture, and Switch Selection
Alimama Tech
Alimama Tech
Mar 20, 2024 · Artificial Intelligence

Dolphin VectorDB: A High-Performance Vector Database for AI Applications

Dolphin VectorDB, created by Alibaba’s Alimama team, is a high‑performance, scalable vector database that delivers fast, cost‑effective AI‑driven vector storage and real‑time updates, supporting multiple query modes and powering applications such as content risk control, marketing Q&A, and audience selection, with ongoing enhancements for multimodal computing.

AI applicationsReal-time Updatescontent risk control
0 likes · 13 min read
Dolphin VectorDB: A High-Performance Vector Database for AI Applications
Architects' Tech Alliance
Architects' Tech Alliance
Jun 18, 2023 · Fundamentals

Analysis of Advanced High‑Performance Processors for Exascale Computing: Fujitsu A64FX, NVIDIA H100, AMD MI250X, and Intel PonteVecchio

This article examines four leading exascale‑grade high‑performance processors—Fujitsu A64FX, NVIDIA H100, AMD MI250X, and Intel PonteVecchio—detailing their core architectures, compute resources, memory hierarchies, specialized accelerators, process technologies, performance metrics, and trends to inform future domestic processor development.

AMD MI250XExascaleFujitsu A64FX
0 likes · 11 min read
Analysis of Advanced High‑Performance Processors for Exascale Computing: Fujitsu A64FX, NVIDIA H100, AMD MI250X, and Intel PonteVecchio
Architects' Tech Alliance
Architects' Tech Alliance
Sep 30, 2022 · Fundamentals

High‑Performance Computing: Principles, Evolution, Applications, and Market Landscape

This article explains the concept and history of high‑performance computing (HPC), its serial and parallel processing architectures, performance metrics such as FLOPS, major application domains, and the rapid market growth and competitive landscape in China driven by national policies and industry investment.

China HPCHPC ApplicationsSupercomputers
0 likes · 14 min read
High‑Performance Computing: Principles, Evolution, Applications, and Market Landscape
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jul 21, 2022 · Cloud Computing

How Baidu’s Cloud Storage Powers High‑Performance Computing and AI Workloads

This article explains the storage challenges of high‑performance computing—including traditional HPC, AI‑driven HPC, and HPDA—then details Baidu’s unified storage platform, object storage BOS, and runtime solutions PFS and RapidFS, illustrating their architecture, features, and a real‑world autonomous‑driving customer case.

AI trainingcloud storagedata lake
0 likes · 29 min read
How Baidu’s Cloud Storage Powers High‑Performance Computing and AI Workloads
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jul 5, 2022 · Fundamentals

High‑Performance Chiplet and Interconnect Architectures: Insights from the HiPChips Workshop at ISCA 2022

The HiPChips workshop at ISCA 2022 gathered leading academia and industry experts to discuss the motivations, recent research breakthroughs, technical challenges, and ecosystem efforts surrounding high‑performance chiplet and interconnect architectures for future computing systems.

ChipletInterconnectcomputer architecture
0 likes · 10 min read
High‑Performance Chiplet and Interconnect Architectures: Insights from the HiPChips Workshop at ISCA 2022
Tencent Cloud Developer
Tencent Cloud Developer
Jul 10, 2018 · Cloud Computing

Building a Containerized Scientific Computing Platform on the Cloud

The talk details XtraPi’s journey from early PBS‑based supercomputers to a modern Kubernetes‑driven, multi‑cloud platform that uses Tencent Cloud TKE to run massive containerized drug‑discovery simulations, describing scaling strategies, image optimization, CI pipelines, checkpoint‑restart, and future serverless and bare‑metal enhancements.

HPCKubernetesMesos
0 likes · 28 min read
Building a Containerized Scientific Computing Platform on the Cloud