Tagged articles

AI compute

53 articles · Page 1 of 1

Jun 29, 2026 · Industry Insights

Google Limits Meta's Access to Gemini Model, Disrupting AI Projects

According to the Financial Times and Reuters, Google told Meta in March that it could not provide the full compute capacity for the Gemini AI model, a restriction that has delayed several internal Meta AI projects and prompted the company to urge more efficient use of AI tokens.

AI computeCloudGemini

0 likes · 3 min read

Google Limits Meta's Access to Gemini Model, Disrupting AI Projects

Jun 28, 2026 · Artificial Intelligence

When the Memory Wall Locks AI Compute, Is HBM the Key or Another Lock?

The article analyzes how the growing memory‑wall bottleneck forces GPUs to idle while waiting for data, compares on‑chip SRAM and high‑bandwidth memory (HBM) as remedies, and examines HBM’s technical advantages, supply constraints, and divergent manufacturing routes that may turn it into a new limitation.

AI computeGPUHBM

0 likes · 6 min read

When the Memory Wall Locks AI Compute, Is HBM the Key or Another Lock?

Jun 20, 2026 · Artificial Intelligence

Free Model Weights, Yet No Free Intelligence: The AI Compute Debate

A lively debate sparked by a tweet reveals that while open‑source model weights may be free, achieving useful AI still demands costly GPU compute, exposing a gap between benchmark scores, real‑world utility, and the economics of hosting large language models.

AI computeGPU infrastructureOpen-source AI

0 likes · 5 min read

Free Model Weights, Yet No Free Intelligence: The AI Compute Debate

Architects' Tech Alliance

Jun 10, 2026 · Industry Insights

The Illusion Behind China’s AI Compute Boom

Although public statistics show domestic AI accelerator shipments soaring to over 55% market share and high penetration in key sectors, on‑site data‑center surveys reveal that less than 10% actually deploy Chinese chips, and hidden total‑cost‑of‑ownership issues make most enterprises still prefer Nvidia solutions.

AI computeChina AI hardwareSoftware Ecosystem

0 likes · 10 min read

The Illusion Behind China’s AI Compute Boom

Architects' Tech Alliance

Jun 6, 2026 · Industry Insights

2026 Blueprint for Super‑Scale AI Compute Centers: Architecture, Cooling, Power

Facing trillion‑parameter models and soaring AI token usage, the 2026 generation of AI compute centers will abandon traditional X86 servers, air cooling, and Ethernet spine‑leaf networks, adopting vertically‑tightly‑coupled supernodes with up to 8192 NPU/GPU cards, heterogeneous chip pools, and cabinet‑level liquid cooling powered by green electricity, achieving linear acceleration above 88 % and PUE of 1.10‑1.15.

AI computePUEgreen power

0 likes · 5 min read

2026 Blueprint for Super‑Scale AI Compute Centers: Architecture, Cooling, Power

Lao Guo's Learning Space

Jun 3, 2026 · Industry Insights

Can Apple’s M5 Ultra Still Compete After NVIDIA’s RTX Spark Launch?

The RTX Spark desktop processor delivers 1 PFLOP of AI compute—about 14 times the M5 Ultra—while the M5 Ultra retains a three‑times higher memory bandwidth and twice the memory capacity, making it superior for certain inference workloads; the article breaks down specs, benchmarks, ecosystem differences, pricing and market positioning to show how each platform fits distinct AI use cases.

AI computeApple M5 UltraCUDA

0 likes · 12 min read

Can Apple’s M5 Ultra Still Compete After NVIDIA’s RTX Spark Launch?

Machine Learning Algorithms & Natural Language Processing

May 29, 2026 · Industry Insights

SpaceX Switches Large‑Model Training Stack from JAX to C, Claiming Ten‑fold Speedup

SpaceX has replaced JAX with a C‑based training stack that Elon Musk says speeds up large‑model training by an order of magnitude, while simultaneously building the 1‑GW Colossus II supercomputer, listing AI infrastructure as a core business, and offering short‑term compute rentals such as a 180‑day lease to Anthropic.

AI computeAnthropicC language

0 likes · 6 min read

SpaceX Switches Large‑Model Training Stack from JAX to C, Claiming Ten‑fold Speedup

Baidu Intelligent Cloud Tech Hub

May 29, 2026 · Industry Insights

How Baidu’s Hanhai U Series Cuts 3 Million Yuan Cost for 10 MW High‑Density AI Data Centers

The article analyzes the power‑supply challenges of high‑density AI data centers, compares traditional UPS and 800 V DC architectures, and shows how Baidu’s Hanhai U series redesign delivers precise capacity matching, up to 2.5× higher power density, 55% space reduction and up to 15% cost savings.

AI computeBaiduHanhai U series

0 likes · 11 min read

How Baidu’s Hanhai U Series Cuts 3 Million Yuan Cost for 10 MW High‑Density AI Data Centers

Architects' Tech Alliance

May 19, 2026 · Industry Insights

Why Did the Nvidia H100 GPU Vanish in 2026?

In 2026 the Nvidia H100 GPU became virtually unavailable as export bans, a locked‑down supply chain, and aggressive capacity reservations by cloud giants drove rental prices up 40%, lead times beyond a year, and forced small AI teams to seek niche clouds or spot instances.

AI computeCoWoS packagingGPU shortage

0 likes · 10 min read

Why Did the Nvidia H100 GPU Vanish in 2026?

May 19, 2026 · Industry Insights

Where Did the NVIDIA H100 Go? Memory and Packaging Bottlenecks Explained

The article analyzes why NVIDIA H100 GPUs have vanished from cloud and direct‑purchase channels in 2026, tracing the shortage to HBM memory and CoWoS packaging constraints, detailing price spikes, the role of mega‑buyers, impacts on small teams, and emerging mitigation strategies.

AI computeCoWoS packagingGPU shortage

0 likes · 15 min read

Where Did the NVIDIA H100 Go? Memory and Packaging Bottlenecks Explained

May 16, 2026 · Industry Insights

Why Compute Is the Lifeline of AI: Anthropic CFO Reveals the Industry’s Harsh Truth

Anthropic’s new credit‑based pricing for Claude, massive multi‑chip compute investments, and the CFO’s warning that buying too little or too much compute is dangerous together illustrate how rising hardware costs and Jevons paradox are driving the AI industry’s ever‑increasing subscription fees.

AI computeAI industryAnthropic

0 likes · 7 min read

Why Compute Is the Lifeline of AI: Anthropic CFO Reveals the Industry’s Harsh Truth

Java Tech Enthusiast

May 13, 2026 · Industry Insights

Musk Allocates 220,000 GPUs to Claude, Doubling 5‑Hour Limits and Building Space‑Based Compute

Elon Musk's SpaceX AI has handed over its Colossus 1 supercomputer—over 220,000 Nvidia GPUs delivering more than 300 MW of power—to Anthropic for Claude, instantly doubling the model's five‑hour usage limits while reshaping the AI compute market and fueling upcoming IPO narratives.

AI computeAnthropicClaude

0 likes · 6 min read

Musk Allocates 220,000 GPUs to Claude, Doubling 5‑Hour Limits and Building Space‑Based Compute

Architects' Tech Alliance

May 9, 2026 · Industry Insights

PCIe 8.0 Draft Unveiled: Toward a 1 TB/s Ultra‑Fast Era

The PCI‑SIG has released the PCIe 8.0 draft (0.5), promising 256 GT/s (1 TB/s per x16 link) that doubles PCIe 7.0, remains backward‑compatible, and aims to eliminate the bandwidth bottleneck for AI, GPUs, SSDs and CXL, with a spec expected in 2028 and market rollout around 2029‑30.

AI computeData CenterHigh-speed interconnect

0 likes · 6 min read

PCIe 8.0 Draft Unveiled: Toward a 1 TB/s Ultra‑Fast Era

Machine Learning Algorithms & Natural Language Processing

May 7, 2026 · Industry Insights

Global AI New King Emerges: Anthropic’s $1.2 Trillion Valuation Tops OpenAI

Anthropic’s pre‑IPO valuation surged to $1.2 trillion—about 20% above OpenAI—driven by an 80‑fold revenue jump, massive compute contracts, a $200 billion Google cloud deal, and a surprise 22‑million‑GPU boost from Elon Musk’s SpaceX, sparking both excitement and sustainability concerns in the AI industry.

AI computeAI industryAI valuation

0 likes · 8 min read

Global AI New King Emerges: Anthropic’s $1.2 Trillion Valuation Tops OpenAI

May 7, 2026 · Industry Insights

OpenAI’s $10B Deployment Company and Pentagon Deal Power the $50B AI Compute Arms Race

OpenAI has launched a $10 billion ‘Deployment Company’ with major PE backers and secured a Pentagon partnership to embed its models in classified networks, creating a dual‑track compute strategy that turns AI compute into geopolitical power and accelerates a $50 billion industry arms race.

AI computeAI industry arms raceArtificial Intelligence

0 likes · 7 min read

OpenAI’s $10B Deployment Company and Pentagon Deal Power the $50B AI Compute Arms Race

Architects' Tech Alliance

Apr 28, 2026 · Information Security

Why Compute Power Gets You In, but Security Determines Survival—HaiGuang’s Two Game‑Changing Moves

The article analyzes the rapid expansion of AI compute demand, the shift toward domestic chip dominance, emerging security threats such as data poisoning, and HaiGuang’s hardware‑level “intrinsic security” architecture—including a full‑stack cryptographic platform and a trusted data space—to make AI systems both usable and secure for critical industries.

AI computeChinese semiconductordata poisoning

0 likes · 6 min read

Why Compute Power Gets You In, but Security Determines Survival—HaiGuang’s Two Game‑Changing Moves

Architects' Tech Alliance

Apr 24, 2026 · Industry Insights

Full-Stack Software‑Hardware Co‑Design Redefines China's AI Compute Landscape

The 2026 HaiGuang AI Software Ecosystem Summit in Zhengzhou revealed a decisive industry shift from peak‑performance chip bragging to system‑level effective compute, emphasizing full‑stack software‑hardware collaboration, heterogeneous scheduling, and open architecture as the key to unlocking trillion‑parameter AI models.

AI computeChina AI ecosystemMLPerf

0 likes · 5 min read

Full-Stack Software‑Hardware Co‑Design Redefines China's AI Compute Landscape

Apr 15, 2026 · Industry Insights

2026 AI Index: China‑US Model Race, Compute Surge & Data Trends

Based on Stanford HAI’s AI Index 2026, this analysis highlights how the US‑China model performance gap has vanished, global AI compute has exploded 3.3‑fold, data bottlenecks are easing through synthetic data and curation, while transparency, supply‑chain concentration, and environmental impact raise new challenges.

AI Index 2026AI computeAI trends

0 likes · 8 min read

2026 AI Index: China‑US Model Race, Compute Surge & Data Trends

Lao Guo's Learning Space

Apr 12, 2026 · Artificial Intelligence

Nvidia N1 vs N1X: 20‑Core ARM CPUs and Blackwell GPUs Power the Next AI‑Focused PC

Nvidia's newly announced N1 and N1X ARM‑based Windows‑on‑Arm processors combine up to 20 CPU cores, Blackwell GPUs with 6144 CUDA cores, and 180‑200 TOPS of AI compute, promising desktop‑class AI performance in laptops while facing power, cooling, and software ecosystem challenges.

AI PCAI computeArm

0 likes · 12 min read

Nvidia N1 vs N1X: 20‑Core ARM CPUs and Blackwell GPUs Power the Next AI‑Focused PC

Apr 7, 2026 · Industry Insights

Anthropic Secures Multi‑Gigawatt TPU Power with Google and Broadcom to Fuel Claude’s Explosive Growth

Anthropic has signed a multi‑year agreement with Google and Broadcom to lock in multiple gigawatts of next‑generation TPU capacity starting in 2027, a move driven by Claude’s soaring demand, revenue surpassing $30 billion and a rapid doubling of high‑spending enterprise customers.

AI computeAnthropicClaude

0 likes · 5 min read

Anthropic Secures Multi‑Gigawatt TPU Power with Google and Broadcom to Fuel Claude’s Explosive Growth

Mar 25, 2026 · Industry Insights

Why Meta Jumped on Arm’s First In‑House Neoverse Chip

Arm has shifted from pure IP licensing to launching its own Neoverse processor, securing Meta as the first customer, a move that could reshape semiconductor power dynamics by compressing the value chain and intensifying competition among chip designers and server manufacturers.

AI computeArmMeta

0 likes · 7 min read

Why Meta Jumped on Arm’s First In‑House Neoverse Chip

Mar 23, 2026 · Industry Insights

Elon Musk’s Terawatt Compute Factory: Powering AI Arms Race and Interstellar Exploration

Elon Musk’s newly announced TERAFAB project aims to produce over one terawatt of compute power annually, representing a vertically integrated chip‑making venture that could reshape AI hardware supply chains, intensify the global semiconductor race, and provide space‑grade computing for Earth‑based AI systems and future interplanetary missions.

AI computeElon MuskTerafab

0 likes · 7 min read

Elon Musk’s Terawatt Compute Factory: Powering AI Arms Race and Interstellar Exploration

MeowKitty Programming

Mar 16, 2026 · Industry Insights

Why Some Developers Double Their Salary in the AI‑Coding Era

The article examines how AI coding tools are reshaping software development, showing that while routine coding skills are devalued and junior employment drops, engineers who master AI‑driven workflows, system design, and judgment can command double salaries and become indispensable, illustrated by real data and a three‑tier AI collaboration model.

AI codingAI computeCareer

0 likes · 10 min read

Why Some Developers Double Their Salary in the AI‑Coding Era

Mar 11, 2026 · Industry Insights

Jensen Huang and Former OpenAI Executives Target a Gigawatt‑Scale AI Supercomputer

Jensen Huang teams up with former OpenAI leaders to launch a 1‑gigawatt AI supercomputing platform next year, a move that could reshape AI infrastructure, accelerate breakthrough applications, and raise sustainability and centralization challenges for the industry.

AI InfrastructureAI computeGigawatt supercomputer

0 likes · 6 min read

Jensen Huang and Former OpenAI Executives Target a Gigawatt‑Scale AI Supercomputer

Architecture & Thinking

Mar 1, 2026 · Artificial Intelligence

Why DeepSeek V4 Prioritizes Chinese Chips Over Nvidia – A Game‑Changer for AI Compute

DeepSeek’s upcoming V4 model breaks industry norms by prioritizing Huawei’s Ascend chips over Nvidia GPUs, offering over 30% performance gains, ultra‑long context windows, native multimodal abilities, and dramatically lower inference costs, signaling a shift toward autonomous AI compute in China.

AI computeAI modelsChinese chips

0 likes · 6 min read

Why DeepSeek V4 Prioritizes Chinese Chips Over Nvidia – A Game‑Changer for AI Compute

Feb 27, 2026 · Industry Insights

OpenAI Secures Record $110 B Private Funding to Scale AI for Everyone

OpenAI announced a historic $110 billion private financing round led by Amazon, Nvidia and SoftBank, a 50% valuation jump to $730 billion, 900 million weekly active ChatGPT users, massive Nvidia compute deals, an exclusive AWS distribution partnership, and a global expansion centered on its London research hub.

AI computeAI financingAWS

0 likes · 6 min read

OpenAI Secures Record $110 B Private Funding to Scale AI for Everyone

Architects' Tech Alliance

Nov 3, 2025 · Artificial Intelligence

What Nvidia’s New Blackwell & Rubin GPUs Reveal About the Future of AI Compute

Nvidia’s latest GTC briefing details the Blackwell and Rubin GPU roadmaps, highlighting massive GPU shipments, new NVLink 6.0 interconnects, 448 Gbps SerDes, and architectural innovations aimed at boosting AI compute performance, efficiency, and scalability across data‑center workloads.

AI computeBlackwellGPU architecture

0 likes · 6 min read

What Nvidia’s New Blackwell & Rubin GPUs Reveal About the Future of AI Compute

Sep 24, 2025 · Artificial Intelligence

How OpenAI’s Quest for a Compute Empire Is Reshaping the AI Landscape

In a week OpenAI secured a $300 billion Oracle cloud deal, loosened its exclusive tie‑up with Microsoft, announced massive AI infrastructure projects, and revealed its own chip development, highlighting a strategic shift toward building an independent compute empire amid mounting financial and competitive pressures.

AI InfrastructureAI computeIndustry Analysis

0 likes · 22 min read

How OpenAI’s Quest for a Compute Empire Is Reshaping the AI Landscape

Sep 12, 2025 · Artificial Intelligence

How Alibaba and Baidu Are Building Homegrown AI Chips to Challenge Nvidia

Amid escalating US export restrictions, Chinese tech giants Alibaba and Baidu are accelerating the development of their own AI chips—Alibaba's self‑designed processors and Baidu's Kunlun P800—to reduce reliance on Nvidia’s H100 and A100, signaling a potential shift in the global AI compute landscape.

AI chipsAI computeAlibaba

0 likes · 5 min read

How Alibaba and Baidu Are Building Homegrown AI Chips to Challenge Nvidia

Architects' Tech Alliance

Mar 27, 2025 · Industry Insights

GPU Industry Deep Dive: Market Trends, Competitive Landscape, and Future Outlook

This article provides a comprehensive analysis of the GPU industry, covering product classifications, key characteristics, market size evolution, competitive dynamics among major players such as NVIDIA, AMD, and Huawei, policy influences, and future growth projections driven by AI and high‑performance computing demands.

AI computeGPUIndustry Analysis

0 likes · 14 min read

GPU Industry Deep Dive: Market Trends, Competitive Landscape, and Future Outlook

Architects' Tech Alliance

Mar 9, 2025 · Industry Insights

How DeepSeek’s LLMs Slash Training Costs and Reshape China’s Compute Landscape

DeepSeek’s three‑model LLM lineup—V3, R1‑Zero and R1—delivers high performance while cutting training expenses to under $600 k, a fraction of the $0.6‑1 B typical for comparable models, signaling a major shift in China’s AI compute demand and supply chain dynamics.

AI computeChinaDeepSeek

0 likes · 3 min read

How DeepSeek’s LLMs Slash Training Costs and Reshape China’s Compute Landscape

Java Web Project

Jan 29, 2025 · Industry Insights

How DeepSeek’s Low‑Cost AI Model Is Redrawing the Compute Landscape and Salary Benchmarks

DeepSeek’s ability to deliver top‑tier model performance on modest hardware sparked a US‑stock flash crash, challenged the high‑GPU demand narrative, and revealed unusually high salary tiers for AI researchers, prompting a reassessment of compute economics and talent compensation in the industry.

AI computeArtificial IntelligenceDeepSeek

0 likes · 5 min read

How DeepSeek’s Low‑Cost AI Model Is Redrawing the Compute Landscape and Salary Benchmarks

Architects' Tech Alliance

Nov 14, 2024 · Industry Insights

Why GPUs Still Dominate AI Compute and What’s Driving the Next Chip Upgrade

The article analyzes how AI compute centers rely on GPUs and emerging AI chips, examines the booming demand for HBM memory, the scarcity of advanced CoWoS packaging, and the rising need for sophisticated backside power delivery as AI models scale.

AI computeCoWoS packagingGPU dominance

0 likes · 6 min read

Why GPUs Still Dominate AI Compute and What’s Driving the Next Chip Upgrade

Alibaba Cloud Infrastructure

Nov 11, 2024 · Artificial Intelligence

Scale‑up x10 Drives a New Wave of AI Compute Cluster Network Architecture

At the CCF ChinaNet conference, Alibaba Cloud’s VP of R&D presented a vision of AI compute scaling to ten‑fold larger clusters, highlighting the shift from InfiniBand to high‑throughput Ethernet, the HPN7.0 architecture, emerging Scale‑up challenges, and the roadmap for high‑throughput Ethernet and the ENode+ super‑node system.

AI computeEthernetHPN7.0

0 likes · 8 min read

Scale‑up x10 Drives a New Wave of AI Compute Cluster Network Architecture

Architects' Tech Alliance

Nov 10, 2024 · Industry Insights

AI Compute Infrastructure: Trends, Scaling Laws, and the Rise of Massive Clusters

The article analyzes the development of AI compute infrastructure, detailing the three‑level architecture from chip to cluster, the scaling law linking model parameters to compute demand, the rapid growth of massive “ten‑thousand‑card” clusters worldwide, and the emerging demand for inference workloads driving new deployment and scheduling strategies.

AI computeIndustry TrendsInference Demand

0 likes · 15 min read

AI Compute Infrastructure: Trends, Scaling Laws, and the Rise of Massive Clusters

Architects' Tech Alliance

Sep 15, 2024 · Industry Insights

How to Build a Super‑Scale AI Cluster: From GPU Power to DPU‑Driven Architecture

This article analyzes the technical roadmap for upgrading AI super‑large GPU clusters to support trillion‑parameter multimodal models, covering single‑chip performance, super‑node scaling, DPU‑based compute fusion, energy‑efficient designs, converged storage, high‑throughput networking, and fault‑tolerant checkpoint strategies.

AI computeDPUGPU clusters

0 likes · 18 min read

How to Build a Super‑Scale AI Cluster: From GPU Power to DPU‑Driven Architecture

Architects' Tech Alliance

Sep 12, 2024 · Artificial Intelligence

Comparison of InfiniBand and RoCEv2 Architectures for AI Compute Networks

This article examines the two dominant AI compute network architectures, InfiniBand and RoCEv2, detailing their designs, flow‑control mechanisms, performance, cost and scalability characteristics, and evaluates their respective advantages and limitations to guide network selection for AI data centers.

AI computeInfiniBandNetwork Architecture

0 likes · 9 min read

Comparison of InfiniBand and RoCEv2 Architectures for AI Compute Networks

Architects' Tech Alliance

Aug 5, 2024 · Industry Insights

What Drives the AI Compute Chip Market? GPUs, ASICs, and the Rise of Chinese Players

This article examines the AI compute chip ecosystem, covering GPU, FPGA, and ASIC technologies, market share trends, key performance metrics such as TOPS, power and die area, and provides a detailed overview of major global and Chinese vendors and their flagship products.

AI computeASICChinese AI chips

0 likes · 12 min read

What Drives the AI Compute Chip Market? GPUs, ASICs, and the Rise of Chinese Players

Architects' Tech Alliance

Jul 30, 2024 · Industry Insights

How Market Segments Define the 2024 Automotive SoC Chip Landscape

The 2024 automotive SoC report breaks down vehicle price tiers and their AI compute needs into three chip categories—small (2.5‑20 TOPS), medium (20‑80 TOPS), and large (≥100 TOPS)—and analyzes demand, market size, and the competitive share of domestic versus foreign suppliers, highlighting growth opportunities for Chinese manufacturers.

2024AI computeAutomotive SoC

0 likes · 10 min read

How Market Segments Define the 2024 Automotive SoC Chip Landscape

Architects' Tech Alliance

Jul 25, 2024 · Artificial Intelligence

NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market

The article reviews NVIDIA's newly released H20 AI accelerator for China, compares its performance and pricing with domestic chips, outlines the expanding Chinese AI chip ecosystem—including Huawei, Cambricon, HaiGuang, Alibaba, ByteDance, and Baidu—while highlighting market size growth, multi‑chip heterogeneity strategies, and the strong demand forecast through 2024.

AI chipsAI computeChina

0 likes · 8 min read

NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market

Architects' Tech Alliance

Jun 30, 2024 · Industry Insights

Why 400G/800G/1.6T Optical Modules Are the Next Frontier for AI Data Centers

The rapid growth of AI, HPC, and cloud workloads is driving exponential demand for 400G, 800G, and even 1.6T optical modules, prompting a shift in packaging technologies, modulation schemes, and parallel‑lane architectures to meet higher bandwidth and lower‑cost requirements in modern data centers.

AI computeData CenterHigh-Speed Networking

0 likes · 8 min read

Why 400G/800G/1.6T Optical Modules Are the Next Frontier for AI Data Centers

Architects' Tech Alliance

Jun 20, 2024 · Artificial Intelligence

Comparative Analysis of InfiniBand and RoCEv2 Architectures for AI Compute Networks

This article provides a detailed comparison of InfiniBand and RoCEv2 network architectures, examining their technical features, flow‑control mechanisms, performance, cost, and suitability for AI compute environments to guide designers in selecting the optimal solution.

AI computeInfiniBandNetwork Architecture

0 likes · 9 min read

Comparative Analysis of InfiniBand and RoCEv2 Architectures for AI Compute Networks

IT Architects Alliance

Jun 12, 2024 · Cloud Computing

Network Architecture Selection and Comparison for AI Compute Centers

The article analyzes traditional cloud data‑center networking challenges for AI workloads and compares two‑layer and three‑layer fat‑tree architectures, presenting high‑bandwidth, non‑blocking, and low‑latency designs such as AI‑Pool networks and offering practical deployment scales from hundreds to tens of thousands of GPUs.

AI computeFat-TreeHigh Bandwidth

0 likes · 11 min read

Network Architecture Selection and Comparison for AI Compute Centers

Architects' Tech Alliance

May 1, 2024 · Industry Insights

How CXL Can Break the AI Memory Wall and Boost Data‑Center Performance

The rapid growth of AI models is widening the gap between compute power and memory bandwidth, but the emerging Compute Express Link (CXL) interconnect offers lower latency, memory sharing, and flexible device topologies that can alleviate the memory‑wall bottleneck and reshape future data‑center architectures.

AI computeCXLData Center

0 likes · 10 min read

How CXL Can Break the AI Memory Wall and Boost Data‑Center Performance

Architects' Tech Alliance

Mar 30, 2024 · Industry Insights

How NVIDIA’s B200 GPU Redefines AI Compute and What It Means for the Chip Market

The article analyzes the latest AI‑compute announcements from NVIDIA, AMD and Intel—including NVIDIA’s B200 GPU with 20 petaFLOPS FP4 performance, AMD’s MI300/MI400 roadmap, and Intel’s Gaudi 3 and Falcon Shores—while examining pricing, launch timelines, supply‑chain capacity, and the shifting market share among major cloud providers.

AI computeAMDGPU

0 likes · 10 min read

How NVIDIA’s B200 GPU Redefines AI Compute and What It Means for the Chip Market

Architects' Tech Alliance

Mar 17, 2024 · Industry Insights

Why GPUs Remain the Dominant AI Compute Engine: Trends, Risks, and Future Outlook

The article analyzes current AI hardware options, explains why GPUs continue to dominate model training due to architectural compatibility, ecosystem support, and market maturity, and outlines emerging trends such as model miniaturization, optical interconnects, and chiplet technology that will shape the next generation of AI compute.

AI computeChipletGPU

0 likes · 6 min read

Why GPUs Remain the Dominant AI Compute Engine: Trends, Risks, and Future Outlook

360 Smart Cloud

Feb 1, 2024 · Operations

AI Compute Era: Data Center Power, Cooling, and Space Requirements

The rapid growth of AI compute demand is forcing data centers to redesign cabinet power capacity, adopt advanced cooling solutions such as liquid cooling, and re‑evaluate space density and construction timelines to meet the high‑density, high‑power needs of modern AI workloads.

AI computecooling solutionsdata center operations

0 likes · 12 min read

AI Compute Era: Data Center Power, Cooling, and Space Requirements

Architects' Tech Alliance

Aug 21, 2023 · Artificial Intelligence

AI Compute Landscape: GPU Architectures, Tensor Cores, NVLink, and Scaling Challenges

The article surveys the AI compute ecosystem, explaining why CPUs are unsuitable for AI workloads, how heterogeneous CPU‑plus‑accelerator designs dominate, and detailing the evolution of NVIDIA GPUs, Tensor Cores, memory technologies, and inter‑GPU networking that enable large‑scale model training.

AI computeGPU clusteringNVLink

0 likes · 11 min read

AI Compute Landscape: GPU Architectures, Tensor Cores, NVLink, and Scaling Challenges

Architects' Tech Alliance

Aug 10, 2023 · Industry Insights

InfiniBand vs RoCEv2: Which Network Powers AI Model Training?

This article examines the architecture of AI compute clusters, explaining offline training and inference pipelines, the role of RDMA, and the technical differences between InfiniBand and RoCEv2—including latency, bandwidth, scalability, cost, and vendor considerations—to help engineers choose the optimal high‑performance network for large‑model training.

AI computeInfiniBandRDMA

0 likes · 13 min read

InfiniBand vs RoCEv2: Which Network Powers AI Model Training?

Architects' Tech Alliance

Aug 8, 2023 · Cloud Computing

Design Principles and Practices for High‑Performance AI Compute Center Networks

The article analyzes the limitations of traditional data‑center networking for AI compute workloads and presents high‑bandwidth, non‑blocking, low‑latency design solutions—including two‑layer and three‑layer fat‑tree architectures, AI‑Pool concepts, and recommended configurations—for building scalable, efficient intelligent computing clusters.

AI computeFat-TreeHigh Bandwidth

0 likes · 10 min read

Design Principles and Practices for High‑Performance AI Compute Center Networks

Architects' Tech Alliance

Apr 1, 2023 · Industry Insights

Why GPUs Lag Behind Big AI Models and How In‑Memory Computing Helps

The article examines the growing bottlenecks of large‑scale AI model training caused by the separation of storage and compute, analyzes why conventional GPU architectures cannot keep pace with exponential model growth, and presents in‑memory and near‑memory computing, as well as storage‑compute integration, as promising solutions to boost performance, energy efficiency, and scalability for cloud and edge deployments.

AI computeCloud ComputingGPU bottleneck

0 likes · 10 min read

Why GPUs Lag Behind Big AI Models and How In‑Memory Computing Helps

Open Source Linux

Jan 30, 2023 · Artificial Intelligence

How Liquid‑Cooled Servers Power the AI Future Inspired by “The Wandering Earth 2”

The article explores how the sci‑fi blockbuster “The Wandering Earth 2” highlights cutting‑edge liquid‑cooling server technology, linking it to the massive AI compute demands of the future, the push for greener data centers, and China’s Sugon innovations that could dramatically cut energy use.

AI computeSugondata center efficiency

0 likes · 6 min read

How Liquid‑Cooled Servers Power the AI Future Inspired by “The Wandering Earth 2”

Architects' Tech Alliance

Dec 28, 2021 · Artificial Intelligence

Understanding FLOPS, Benchmarks, and AI Compute Performance

This article explains the concept of FLOPS, its measurement units, common benchmarks such as Linpack and MLPerf, why traditional HPC benchmarks may not suit AI workloads, and provides a comprehensive overview of hardware performance figures from GFLOPS to PFLOPS across various modern processors and supercomputers.

AI computeFLOPSHPC

0 likes · 11 min read

Understanding FLOPS, Benchmarks, and AI Compute Performance