Tag

AI hardware

1 views collected around this technical thread.

Architects' Tech Alliance
Architects' Tech Alliance
Apr 18, 2025 · Artificial Intelligence

Evolution and Architecture of Google TPU Chips

This article outlines the development of Google's Tensor Processing Units (TPU) from the first generation to the latest seventh‑generation chip, detailing architectural improvements, performance specifications, integration into data‑center pods and mobile devices, and concludes with references to related AI‑hardware resources and promotional material.

AI hardwareGoogleTPU
0 likes · 10 min read
Evolution and Architecture of Google TPU Chips
Architects' Tech Alliance
Architects' Tech Alliance
Mar 28, 2025 · Artificial Intelligence

Evolution of NVIDIA GPU Architectures for Deep Learning: From Volta to Blackwell and Rubin

The article traces NVIDIA’s GPU architecture evolution from the Volta era’s pioneering Tensor Cores through Turing, Ampere, Hopper, and the latest Blackwell and Rubin designs, highlighting key innovations such as mixed‑precision support, sparsity, NVLink, and their impact on deep‑learning performance.

AI hardwareGPUNVIDIA
0 likes · 10 min read
Evolution of NVIDIA GPU Architectures for Deep Learning: From Volta to Blackwell and Rubin
Code Mala Tang
Code Mala Tang
Mar 21, 2025 · Artificial Intelligence

What Are the Four Waves of AI and How NVIDIA Is Shaping the Future?

NVIDIA’s GTC 2025 keynote outlines the four AI waves—from perception to physical AI—while highlighting the company’s latest Blackwell chips, DGX Spark/Station computers, Dynamo inference accelerator, robotics collaborations, GM autonomous‑driving partnership, and AI‑native 6G efforts, underscoring massive data‑center investment and future challenges.

AI hardwareArtificial IntelligenceNVIDIA
0 likes · 11 min read
What Are the Four Waves of AI and How NVIDIA Is Shaping the Future?
DataFunSummit
DataFunSummit
Mar 3, 2025 · Artificial Intelligence

DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training

The DeepSeek open‑source week introduced seven breakthrough technologies—FlashMLA, DeepGEMM, DeepEP, DualPipe, EPLB, 3FS, and Smallpond—that together overhaul data flow, algorithmic complexity, hardware utilization, MoE communication, and resource balancing, dramatically improving large‑model training efficiency and lowering entry barriers for the AI industry.

AI hardwareDeepSeekLarge Models
0 likes · 17 min read
DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training
Model Perspective
Model Perspective
Feb 27, 2025 · Artificial Intelligence

Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand

The article explains how DeepSeek’s low‑cost large‑language‑model training reduces GPU price pressure, yet paradoxically fuels greater demand for Nvidia hardware by lowering entry barriers, illustrating the modern Jevons paradox and its broader economic and societal implications.

AI hardwareDeepSeekGPU demand
0 likes · 8 min read
Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand
Code Mala Tang
Code Mala Tang
Feb 10, 2025 · Artificial Intelligence

How Much Does It Really Cost to Run a Full‑Scale DeepSeek AI Locally?

This article breaks down the hardware and software expenses required to deploy a complete DeepSeek large‑language model on‑premises, revealing a total cost of roughly $110,000 and explaining why such an investment is prohibitive for most individual developers but may be justified for well‑funded research or corporate projects.

AI hardwareDeepSeekDeployment
0 likes · 4 min read
How Much Does It Really Cost to Run a Full‑Scale DeepSeek AI Locally?
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 31, 2024 · Artificial Intelligence

Apple Intelligence and the Scaling Landscape of Large Language Models: Trends, Costs, and Deployment Considerations

An in‑depth analysis of Apple Intelligence and the broader LLM ecosystem, covering recent model scaling breakthroughs, data and compute requirements, pricing dynamics, hardware trends, on‑device versus cloud deployment, and strategic implications for developers, product managers, and AI practitioners.

AI hardwareApple IntelligenceLLM scaling
0 likes · 58 min read
Apple Intelligence and the Scaling Landscape of Large Language Models: Trends, Costs, and Deployment Considerations
Architects' Tech Alliance
Architects' Tech Alliance
May 14, 2024 · Fundamentals

Fundamentals of GPU Computing: PCIe, NVLink, NVSwitch, and HBM

This article provides a comprehensive overview of the core components and terminology of large‑scale GPU computing, covering GPU server architecture, PCIe interconnects, NVLink generations, NVSwitch, high‑bandwidth memory (HBM), and bandwidth unit considerations for AI and HPC workloads.

AI hardwareGPU computingHBM
0 likes · 11 min read
Fundamentals of GPU Computing: PCIe, NVLink, NVSwitch, and HBM
Architects' Tech Alliance
Architects' Tech Alliance
Dec 22, 2023 · Artificial Intelligence

AI Server Architecture, Market Trends, and Competitive Landscape in 2023

An in‑depth overview of AI server components, market growth, AIGC‑driven demand, heterogeneous computing architectures, major vendors, and future trends, highlighting hardware composition, cost breakdown, competitive rankings, and the impact of GPU, CPU, and emerging AI accelerators on the industry.

AI ServersAI hardwareCPU
0 likes · 14 min read
AI Server Architecture, Market Trends, and Competitive Landscape in 2023
Architects' Tech Alliance
Architects' Tech Alliance
Dec 3, 2023 · Artificial Intelligence

Overview of the AI Chip Market: Architectures, Companies, and Performance Comparisons

The rapidly growing multi‑billion‑dollar AI chip market in 2023 is categorized by architecture (GPGPU, FPGA, ASIC, compute‑in‑memory) and deployment location (cloud, edge, terminal), with Chinese vendors advancing training and inference chips but still lagging behind leading Nvidia products in performance and bandwidth.

AI chipsAI hardwareASIC
0 likes · 8 min read
Overview of the AI Chip Market: Architectures, Companies, and Performance Comparisons
Architects' Tech Alliance
Architects' Tech Alliance
Aug 21, 2023 · Artificial Intelligence

AI Compute Landscape: GPU Architectures, Tensor Cores, NVLink, and Scaling Challenges

The article surveys the AI compute ecosystem, explaining why CPUs are unsuitable for AI workloads, how heterogeneous CPU‑plus‑accelerator designs dominate, and detailing the evolution of NVIDIA GPUs, Tensor Cores, memory technologies, and inter‑GPU networking that enable large‑scale model training.

AI computeAI hardwareGPU architecture
0 likes · 11 min read
AI Compute Landscape: GPU Architectures, Tensor Cores, NVLink, and Scaling Challenges
DataFunTalk
DataFunTalk
Mar 18, 2023 · Artificial Intelligence

Review of Deep Learning Model Evolution, Current Limitations, and Future Trends

The article reviews the historical development of deep learning models, highlights scaling limits, universality, interpretability challenges, and hardware constraints, and then outlines future directions such as efficient architectures, self‑supervised training, broader applications, and emerging AI hardware, while also promoting a related ebook.

AI TrendsAI hardwareModel Scaling
0 likes · 6 min read
Review of Deep Learning Model Evolution, Current Limitations, and Future Trends
DataFunTalk
DataFunTalk
Mar 16, 2023 · Artificial Intelligence

Review of Deep Learning Model Evolution and Future Trends

The article reviews the past six years of deep learning model development, highlighting scaling limits, universality of Transformers, challenges in interpretability and control, and predicts future trends such as efficient architectures, multimodal capabilities, reinforcement learning in virtual worlds, and novel AI hardware, while also promoting a new deep‑learning practice ebook.

AI TrendsAI hardwareModel Scaling
0 likes · 6 min read
Review of Deep Learning Model Evolution and Future Trends
DataFunSummit
DataFunSummit
Feb 15, 2023 · Artificial Intelligence

ChatGPT Boom Fuels Surge in AI Chip Demand, Boosting Nvidia, Samsung, and SK Hynix

The explosive growth of ChatGPT and other AI chatbots is driving unprecedented demand for high‑performance AI chips and high‑bandwidth memory, positioning Nvidia as the primary beneficiary while also creating significant market opportunities for Samsung, SK Hynix, and other semiconductor manufacturers.

AI chipsAI hardwareChatGPT
0 likes · 11 min read
ChatGPT Boom Fuels Surge in AI Chip Demand, Boosting Nvidia, Samsung, and SK Hynix
Architects' Tech Alliance
Architects' Tech Alliance
Jan 27, 2023 · Artificial Intelligence

Challenges and Future Directions of GPU in AI Computing: A Comparison with TPU and FPGA

The article analyzes how GPUs, once dominant in accelerating AI workloads, now face limitations in precision, energy efficiency, and on‑chip networking, prompting a shift toward specialized accelerators like Google's TPU and FPGA solutions, while also exploring emerging GPU‑friendly scenarios such as VR/AR, cloud gaming, and military applications.

AI hardwareFPGAGPU
0 likes · 11 min read
Challenges and Future Directions of GPU in AI Computing: A Comparison with TPU and FPGA
Architects' Tech Alliance
Architects' Tech Alliance
Jun 29, 2021 · Artificial Intelligence

Evolution and Future Trends of Automotive Chips for Autonomous Driving

The article reviews the historical shift from CPU‑based ECUs to GPU‑centric and ASIC‑centric automotive processors, analyzes current GPU dominance, examines key industry players, and discusses why ASICs are expected to become the primary solution for future autonomous‑driving chips.

AI hardwareASICGPU
0 likes · 16 min read
Evolution and Future Trends of Automotive Chips for Autonomous Driving
Architects Research Society
Architects Research Society
Dec 1, 2020 · Cloud Computing

Pervasive Computing – Data Centers, Cloud, 5G, and Edge: Recent Industry Updates

This article summarizes recent developments across data‑center, cloud, 5G, edge, AI, security, automotive and aerospace sectors, including new partnerships, product launches, government contracts, and emerging threats shaping the pervasive computing landscape.

5GAI hardwareEdge Computing
0 likes · 8 min read
Pervasive Computing – Data Centers, Cloud, 5G, and Edge: Recent Industry Updates
Architects' Tech Alliance
Architects' Tech Alliance
Apr 2, 2019 · Artificial Intelligence

Breaking the Storage Wall: In‑Memory Computing and Integrated Compute‑Storage Architectures for AI

The article examines the growing bottlenecks of traditional compute architectures, explains why breaking the storage wall through high‑bandwidth communication, near‑data processing, and in‑memory compute is essential for AI workloads, and surveys the principles, advantages, challenges, future directions, and key industry players of integrated compute‑storage chips.

AI chipsAI hardwarecompute architecture
0 likes · 13 min read
Breaking the Storage Wall: In‑Memory Computing and Integrated Compute‑Storage Architectures for AI
Architects' Tech Alliance
Architects' Tech Alliance
Nov 27, 2018 · Artificial Intelligence

Comparative Analysis of AI Server Types and Guidelines for Selecting GPU Servers

This article compares CPU, GPU, FPGA, TPU, and ASIC based AI servers on performance and programmability, explains selection factors such as power, cost, precision, and memory, and provides practical guidelines for choosing appropriate GPU server architectures and models.

AI hardwareASICFPGA
0 likes · 5 min read
Comparative Analysis of AI Server Types and Guidelines for Selecting GPU Servers
Architects Research Society
Architects Research Society
Oct 7, 2018 · Artificial Intelligence

The Rise of Deep Neural Networks: From Research Breakthroughs to Industry Adoption

Deep neural networks, propelled by breakthroughs such as AlexNet and advances in GPU and TPU hardware, are rapidly moving from academic research into diverse applications—including earthquake prediction, medical imaging, and autonomous driving—driving massive industry investment, new semiconductor designs, and intense competition among tech giants and startups.

AI hardwareGPUTPU
0 likes · 9 min read
The Rise of Deep Neural Networks: From Research Breakthroughs to Industry Adoption