Tagged articles

NVIDIA

235 articles · Page 2 of 3

Aug 20, 2025 · Artificial Intelligence

Nvidia Unveils Nemotron‑Nano‑9B‑v2: Tiny Open‑Source LLM with Switchable Reasoning

Nvidia’s newly released Nemotron‑Nano‑9B‑v2, a 9‑billion‑parameter open‑source LLM optimized for a single Nvidia A10 GPU, introduces a toggleable reasoning mode and budget controls, delivering up to six‑fold speed gains, multilingual support, and strong benchmark results across various tasks.

AI inferenceMambaNVIDIA

0 likes · 5 min read

Nvidia Unveils Nemotron‑Nano‑9B‑v2: Tiny Open‑Source LLM with Switchable Reasoning

Refining Core Development Skills

Aug 7, 2025 · Fundamentals

Why NVIDIA’s First Data‑Center GPU Revolutionized Computing: Inside the Tesla G80 Architecture

This article explains how NVIDIA transitioned from gaming graphics cards to general‑purpose GPUs with the first data‑center Tesla GPU, detailing the unified shader architecture, the internal components of TPCs and SMs, CUDA 1.0 programming basics, and performance calculations that illustrate the massive computational advantage over contemporary CPUs.

CUDAGPGPUGPU architecture

0 likes · 23 min read

Why NVIDIA’s First Data‑Center GPU Revolutionized Computing: Inside the Tesla G80 Architecture

AI Cyberspace

Aug 4, 2025 · Artificial Intelligence

From Tesla to Hopper: How NVIDIA GPU Architectures Powered the AI Revolution

This article traces the evolution of NVIDIA GPU architectures—from the early Tesla series through Fermi, Kepler, Maxwell, Pascal, Volta, Turing, Ampere, Hopper, and up to the upcoming Blackwell—explaining their hardware innovations, CUDA programming model, and how each generation enabled breakthroughs in high‑performance computing, deep learning, and AI applications.

AICUDAGPU

0 likes · 67 min read

From Tesla to Hopper: How NVIDIA GPU Architectures Powered the AI Revolution

Architects' Tech Alliance

Jul 29, 2025 · Artificial Intelligence

Why NVIDIA Spectrum‑X and Quantum InfiniBand Are Redefining AI Data Center Networks

The article explains how AI‑driven data center networks must handle massive distributed workloads, why traditional Ethernet falls short, and how NVIDIA’s Spectrum‑X Ethernet and Quantum InfiniBand use loss‑less RDMA, dynamic routing, advanced congestion control, and hardware‑accelerated collective communication to deliver the bandwidth, latency, and scalability required for generative AI and large‑scale model training.

AIInfiniBandNVIDIA

0 likes · 8 min read

Why NVIDIA Spectrum‑X and Quantum InfiniBand Are Redefining AI Data Center Networks

Open Source Linux

Jul 16, 2025 · Artificial Intelligence

How Huawei’s New AI Chip Aims to Rival Nvidia and AMD GPUs

Huawei is developing a new AI‑focused GPU‑style chip that mirrors Nvidia and AMD architectures, aiming to ease Chinese developers’ shift from Nvidia hardware, but still faces software compatibility hurdles due to reliance on CUDA and ongoing U.S. export restrictions.

AI chipCUDAGPU

0 likes · 3 min read

How Huawei’s New AI Chip Aims to Rival Nvidia and AMD GPUs

AIWalker

Jun 18, 2025 · Artificial Intelligence

SeNaTra: Nvidia’s Spatial Grouping Layer Pushes Semantic Segmentation Past Swin Transformer

Nvidia introduces SeNaTra, a native‑segmentation vision transformer that replaces uniform down‑sampling with a content‑aware spatial grouping layer, delivering superior zero‑shot and supervised segmentation performance while cutting parameters and FLOPs compared with Swin Transformer and other backbones.

NVIDIASemantic SegmentationVision Transformer

0 likes · 29 min read

SeNaTra: Nvidia’s Spatial Grouping Layer Pushes Semantic Segmentation Past Swin Transformer

Ops Development Stories

Jun 12, 2025 · Cloud Native

One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models

This tutorial walks you through using a one‑click script to create a GPU‑enabled Kind Kubernetes cluster, evenly distribute GPU resources across nodes with nvkind, install necessary drivers and toolkits, deploy a vLLM‑served large language model, and verify its operation, all on a local or cloud environment.

AI model deploymentDockerGPU

0 likes · 23 min read

One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models

Architects' Tech Alliance

Jun 9, 2025 · Artificial Intelligence

What Makes Nvidia’s Blackwell GPUs a Game-Changer for AI Performance?

In March 2024 Nvidia unveiled the Blackwell GPU family and the GB200 NVL72 architecture, featuring 3‑4 nm processes, redesigned CUDA cores, next‑gen ray‑tracing, upgraded DLSS, massive FP16/FP8 compute gains, 8 TB/s memory bandwidth, and NVLink Gen5, while also presenting complex power, cooling, and packaging challenges for large‑scale AI deployments.

AI accelerationBlackwellGPU

0 likes · 6 min read

What Makes Nvidia’s Blackwell GPUs a Game-Changer for AI Performance?

Architects' Tech Alliance

Jun 6, 2025 · Artificial Intelligence

B30 vs H20: Which NVIDIA GPU Wins for AI Workloads and Budgets?

This article compares NVIDIA’s China‑specific B30 and high‑end H20 GPUs, detailing their CPU/CPU architecture updates, memory technologies, architectural differences, performance metrics, power and cooling characteristics, and price positioning, to help enterprises and developers choose the most suitable accelerator for AI and deep‑learning tasks.

AI accelerationB30GPU

0 likes · 13 min read

B30 vs H20: Which NVIDIA GPU Wins for AI Workloads and Budgets?

Python Programming Learning Circle

Jun 2, 2025 · Artificial Intelligence

NVIDIA Adds Native Python Support to CUDA – What It Means for Developers

NVIDIA announced at GTC 2025 that CUDA will now natively support Python, allowing developers to write GPU‑accelerated code directly in Python without C/C++ knowledge, introducing new APIs, libraries, JIT compilation, performance tools, and a tile‑based programming model that aligns with Python’s array‑centric workflow.

AICUDAGPU

0 likes · 7 min read

NVIDIA Adds Native Python Support to CUDA – What It Means for Developers

ShiZhen AI

May 26, 2025 · Industry Insights

Nvidia Plans Cheaper Blackwell AI Chip for China Amid Export Restrictions

Nvidia is reportedly preparing a lower‑cost Blackwell GPU for the Chinese market, priced at $6,500‑$8,000 and featuring 1.7 TB/s GDDR7 memory, while OpenAI’s o3 model uncovered a Linux kernel zero‑day (CVE‑2025‑37899), a study showed AI models can sabotage shutdown commands, and a tutorial demonstrates creating animated 3D icons with ChatGPT and Freepik tools.

3D icon creationAI hardwareAI safety

0 likes · 8 min read

Nvidia Plans Cheaper Blackwell AI Chip for China Amid Export Restrictions

Architects' Tech Alliance

May 23, 2025 · Artificial Intelligence

Analysis of Nvidia’s China‑Specific Cut‑Down GPUs: H20, B20, and B40

This article examines the impact of U.S. export restrictions on Nvidia’s China‑specific GPU lineup, detailing the specifications and architectural changes of the H20, B20, and B40 chips, while also discussing domestic alternatives and the broader implications for AI compute in China.

AI chipsB20B40

0 likes · 10 min read

Analysis of Nvidia’s China‑Specific Cut‑Down GPUs: H20, B20, and B40

Alibaba Cloud Big Data AI Platform

May 22, 2025 · Artificial Intelligence

Deploy NVIDIA Cosmos Reason-1: Zero‑Code Physical AI on Alibaba Cloud PAI

Cosmos Reason-1, a customizable multimodal physical AI model from NVIDIA, can be quickly deployed on Alibaba Cloud’s PAI‑Model Gallery with zero‑code, offering automatic cloud resource adaptation, ready‑to‑use APIs, enterprise‑grade security, and demonstrated superior reasoning on video tasks, while the upcoming tools enable fine‑tuning via SFT and RL.

Alibaba CloudNVIDIAZero‑Code Deployment

0 likes · 8 min read

Deploy NVIDIA Cosmos Reason-1: Zero‑Code Physical AI on Alibaba Cloud PAI

AI Product Manager Community

May 20, 2025 · Industry Insights

How Nvidia Is Shaping the Future of AI Infrastructure and Physical AI

At the 2025 Taipei International Computer Expo, Nvidia CEO Jensen Huang outlined the company's shift from a chipmaker to an AI infrastructure leader, introduced the concept of physical AI, and detailed upcoming hardware, software, and strategic initiatives that could reshape data centers, robotics, and autonomous driving.

AI InfrastructureNVIDIAartificial-intelligence

0 likes · 7 min read

How Nvidia Is Shaping the Future of AI Infrastructure and Physical AI

MaGe Linux Operations

May 16, 2025 · Operations

How to Install NVIDIA GPU Drivers on Ubuntu 22.04: Complete Step-by-Step Guide

Learn how to fully install NVIDIA graphics drivers on Ubuntu 22.04, covering system updates, disabling Nouveau, removing old drivers, two installation methods (GUI and command line), verification steps, manual installation options, troubleshooting tips, and important precautions to ensure a stable GPU setup.

GPU driversInstallationNVIDIA

0 likes · 6 min read

How to Install NVIDIA GPU Drivers on Ubuntu 22.04: Complete Step-by-Step Guide

Architects' Tech Alliance

May 13, 2025 · Industry Insights

How NVIDIA Builds AI Supercomputers: From H100 to GH200 and GB200 SuperPods

This article analyzes NVIDIA's evolving AI supercomputer architectures—detailing the H100‑based 256‑GPU SuperPod, the GH200‑based 256‑GPU SuperPod with integrated Grace CPU, and the GB200‑based 576‑GPU SuperPod—examining their NVLink and InfiniBand topologies, bandwidth limits, and scalability challenges.

AIGPUHPC

0 likes · 11 min read

How NVIDIA Builds AI Supercomputers: From H100 to GH200 and GB200 SuperPods

Java Tech Enthusiast

May 9, 2025 · Industry Insights

Why NVIDIA’s Native Python Support in CUDA Could Revolutionize GPU Computing

NVIDIA announced native Python support in its CUDA toolkit, enabling developers to write GPU‑accelerated code directly in Python, detailing the new programming model, JIT‑based architecture, performance benefits, and the broader impact on AI development and the developer ecosystem.

AICUDAGPU

0 likes · 15 min read

Why NVIDIA’s Native Python Support in CUDA Could Revolutionize GPU Computing

DataFunTalk

May 8, 2025 · Artificial Intelligence

Anthropic’s Report on Lobsters, Pregnant Women, and Banned Chips: A Critical Look at US‑China AI Chip Policy

The article reviews Anthropic’s controversial report that links lobsters, pregnant women, and banned chips to illustrate absurd claims about China’s AI capabilities, arguing that US export restrictions on high‑performance GPUs are essential to maintain America’s lead in artificial intelligence.

AnthropicChip PolicyNVIDIA

0 likes · 8 min read

Anthropic’s Report on Lobsters, Pregnant Women, and Banned Chips: A Critical Look at US‑China AI Chip Policy

Architects' Tech Alliance

May 6, 2025 · Artificial Intelligence

Evolution of NVIDIA GPU Architectures for AI from Volta to Blackwell

The article reviews NVIDIA's GPU architecture progression—from Volta's pioneering Tensor Cores through Turing, Ampere, Hopper, and the latest Blackwell and Rubin designs—highlighting key innovations, performance gains for deep learning, and related resource updates for AI practitioners.

GPU architectureHigh‑Performance ComputingNVIDIA

0 likes · 9 min read

Evolution of NVIDIA GPU Architectures for AI from Volta to Blackwell

Fighter's World

May 2, 2025 · Industry Insights

Token Economics Reveals Nvidia’s New AI Factory Narrative

The article analyses Nvidia’s shift from a chip supplier to a full‑stack AI infrastructure provider called AI Factory, explains the token‑economics framework that measures intelligent output, details the hardware‑software stack and network fabric, quantifies token consumption of advanced agents, and evaluates the strategic opportunities and risks for Nvidia.

AI InfrastructureAI factoryAgentic AI

0 likes · 29 min read

Token Economics Reveals Nvidia’s New AI Factory Narrative

Architects' Tech Alliance

Apr 28, 2025 · Artificial Intelligence

NVLink High‑Speed Interconnect: Architecture, Evolution, and Performance

NVLink, NVIDIA's high‑bandwidth interconnect introduced with the P100 GPU, replaces PCIe by offering significantly higher data rates and lower latency for GPU‑GPU and GPU‑CPU communication, and has evolved through multiple generations to support modern AI and high‑performance computing workloads.

AI accelerationGPU interconnectNVIDIA

0 likes · 9 min read

NVLink High‑Speed Interconnect: Architecture, Evolution, and Performance

Liangxu Linux

Apr 23, 2025 · Fundamentals

Which GPU Wins on Linux: AMD’s Plug‑and‑Play Simplicity vs NVIDIA’s Performance Edge

This article objectively compares AMD and NVIDIA graphics cards for Linux users, covering out‑of‑the‑box driver support, Wayland compatibility, gaming performance, machine‑learning capabilities, and cost‑effectiveness to help readers choose the best GPU for their needs.

AMDDriver SupportGPU

0 likes · 9 min read

Which GPU Wins on Linux: AMD’s Plug‑and‑Play Simplicity vs NVIDIA’s Performance Edge

Architects' Tech Alliance

Apr 13, 2025 · Industry Insights

Which NVIDIA GPU Wins for AI? Deep Dive into RTX & A‑Series Performance and Power

This article presents a detailed comparison of major NVIDIA GPUs—including RTX 4090, RTX 4090 D, RTX 3090, A10, A40, A100, and H100—covering memory size, bandwidth, Tensor BF16/FP16/FP32 throughput, FP16/FP32 performance, power draw and release dates, and explains how these specs affect AI workload efficiency.

AI workloadsGPUIndustry Analysis

0 likes · 9 min read

Which NVIDIA GPU Wins for AI? Deep Dive into RTX & A‑Series Performance and Power

Architects' Tech Alliance

Apr 10, 2025 · Artificial Intelligence

Which NVIDIA GPU Is Right for Your AI Compute Center? A Deep Dive into A100, H100, A800, H800, and H20

This article analyzes NVIDIA's A100, H100, A800, H800, and H20 GPUs, compares their architectures, performance, and pricing, and provides a step‑by‑step guide for building a private AI compute center tailored to training, inference, and high‑performance computing workloads.

A100AI trainingGPU

0 likes · 11 min read

Which NVIDIA GPU Is Right for Your AI Compute Center? A Deep Dive into A100, H100, A800, H800, and H20

AI Frontier Lectures

Apr 8, 2025 · Industry Insights

Nvidia’s GPU Names Explained: Ampere, Hopper, Blackwell, Rubin, Feynman

At the recent GTC conference Nvidia unveiled its roadmap of AI‑focused GPUs—Ampere, Hopper, Blackwell, Rubin and the upcoming Feynman—each named after a pioneering scientist, and this article explores the historical contributions of André‑Marie Ampère, Grace Hopper, David Blackwell, Vera Rubin and Richard Feynman, linking their legacies to the architectures’ innovations.

AIGPUNVIDIA

0 likes · 10 min read

Nvidia’s GPU Names Explained: Ampere, Hopper, Blackwell, Rubin, Feynman

Architects' Tech Alliance

Mar 28, 2025 · Artificial Intelligence

Evolution of NVIDIA GPU Architectures for Deep Learning: From Volta to Blackwell and Rubin

The article traces NVIDIA’s GPU architecture evolution from the Volta era’s pioneering Tensor Cores through Turing, Ampere, Hopper, and the latest Blackwell and Rubin designs, highlighting key innovations such as mixed‑precision support, sparsity, NVLink, and their impact on deep‑learning performance.

AI hardwareGPUNVIDIA

0 likes · 10 min read

Evolution of NVIDIA GPU Architectures for Deep Learning: From Volta to Blackwell and Rubin

Infra Learning Club

Mar 23, 2025 · Artificial Intelligence

Getting Started with cuda‑python and an Introduction to cuTicle

This article explains the cuda‑python ecosystem—including its core packages, installation via pip or conda, the experimental cuda.core API, a full Python‑to‑CUDA workflow with NVRTC compilation, performance comparison to C++, the covered APIs, and an overview of NVIDIA's new cuTicle programming model.

CUDAGPUNVIDIA

0 likes · 11 min read

Getting Started with cuda‑python and an Introduction to cuTicle

Code Mala Tang

Mar 21, 2025 · Artificial Intelligence

What Are the Four Waves of AI and How NVIDIA Is Shaping the Future?

NVIDIA’s GTC 2025 keynote outlines the four AI waves—from perception to physical AI—while highlighting the company’s latest Blackwell chips, DGX Spark/Station computers, Dynamo inference accelerator, robotics collaborations, GM autonomous‑driving partnership, and AI‑native 6G efforts, underscoring massive data‑center investment and future challenges.

AI hardwareData CenterNVIDIA

0 likes · 11 min read

What Are the Four Waves of AI and How NVIDIA Is Shaping the Future?

Software Engineering 3.0 Era

Mar 21, 2025 · Artificial Intelligence

NVIDIA GTC 2025: How AI Is Shifting From a Niche Toy to a Mass‑Market Tool and Ushering in the Robot Era

At NVIDIA GTC 2025, the company unveiled the Isaac GR00T N1 foundation model and the Mega Omniverse Blueprint platform, showing how AI‑driven robotics are moving from specialist prototypes to everyday tools through dual‑system architecture, synthetic‑data generation, cloud‑based development, and broad industry collaborations.

AICloudIsaac GR00T

0 likes · 10 min read

NVIDIA GTC 2025: How AI Is Shifting From a Niche Toy to a Mass‑Market Tool and Ushering in the Robot Era

Infra Learning Club

Mar 20, 2025 · Artificial Intelligence

How GPU Frequency, Power Consumption, and FLOPS Interrelate

The article explains the theoretical and practical relationships between GPU clock frequencies, power consumption, and FLOPS, describes key hardware metrics such as SM, memory, and video clocks, shows how to query and set these values with nvidia‑smi, and presents experiments on a Tesla P4 that reveal the non‑linear trade‑offs between performance, power, and temperature.

Clock SpeedDVFSFLOPS

0 likes · 15 min read

How GPU Frequency, Power Consumption, and FLOPS Interrelate

Architects' Tech Alliance

Mar 19, 2025 · Industry Insights

What Drives Nvidia’s AI Dominance and How Huawei’s Ascend Chips Compete

This article analyzes Nvidia’s evolution from a graphics pioneer to an AI hardware leader and examines Huawei’s Ascend AI processor roadmap, detailing technical specifications, ecosystem strategies, recent product releases, and the potential impact on related technology stocks.

AI chipsAI hardwareAscend

0 likes · 6 min read

What Drives Nvidia’s AI Dominance and How Huawei’s Ascend Chips Compete

Architects' Tech Alliance

Feb 28, 2025 · Industry Insights

Why Rubin288’s Orthogonal CLOS Architecture Beats Traditional Designs

The article analyzes NVIDIA's Rubin288 high‑density GPU cabinet, comparing its orthogonal CLOS architecture with the older non‑orthogonal designs, and explains how the new layout improves reliability, bandwidth, scalability, and cooling for modern data‑center HPC deployments.

CLOSDataCenterGPU

0 likes · 10 min read

Why Rubin288’s Orthogonal CLOS Architecture Beats Traditional Designs

Model Perspective

Feb 27, 2025 · Artificial Intelligence

Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand

The article explains how DeepSeek’s low‑cost large‑language‑model training reduces GPU price pressure, yet paradoxically fuels greater demand for Nvidia hardware by lowering entry barriers, illustrating the modern Jevons paradox and its broader economic and societal implications.

AI hardwareDeepSeekGPU demand

0 likes · 8 min read

Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand

Architects' Tech Alliance

Feb 15, 2025 · Industry Insights

Choosing the Right NVIDIA GPU for AI: A100, H100, A800, H800 & H20 Explained

This article provides a detailed technical analysis of NVIDIA's A100, H100, A800, H800 and H20 GPUs, compares their architectures, performance and cost, and offers step‑by‑step guidance on building a private AI compute center, selecting hardware, software stacks and budgeting for different workloads.

AI trainingGPUHardware Selection

0 likes · 11 min read

Choosing the Right NVIDIA GPU for AI: A100, H100, A800, H800 & H20 Explained

Infra Learning Club

Feb 12, 2025 · Fundamentals

Why Does Nvidia Report Less GPU Memory Than Specified?

The article investigates why Nvidia L40S and RTX A6000 GPUs show less memory via nvidia‑smi than their advertised 48 GB, revealing that enabled ECC memory reserves a few gigabytes, and demonstrates the effect by toggling ECC on a Tesla‑T4 card.

ECCGPU memoryL40S

0 likes · 4 min read

Why Does Nvidia Report Less GPU Memory Than Specified?

Software Engineering 3.0 Era

Jan 27, 2025 · Industry Insights

What Capital Currents Hide Behind DeepSeek’s R1 Model Surge?

The article analyzes how DeepSeek’s R1 model, touted as a low‑cost AI breakthrough, sparked Wall Street speculation, prompted a sharp Nvidia stock decline, and may be part of a broader quant‑driven strategy to manipulate market sentiment and capture short‑term capital gains.

AI hardwareDeepSeekNVIDIA

0 likes · 8 min read

What Capital Currents Hide Behind DeepSeek’s R1 Model Surge?

DataFunSummit

Jan 24, 2025 · Artificial Intelligence

Challenges and Debugging Strategies for FP8 Training of Large Models

The article explains the performance benefits of using FP8 for large‑model training, outlines three main categories of FP8‑related issues such as loss spikes, divergence, and downstream metric gaps, and introduces a dedicated FP8 debug tool with metrics like MSE, cosine similarity, underflow, and overflow to help diagnose and resolve these problems.

AIFP8NVIDIA

0 likes · 9 min read

Challenges and Debugging Strategies for FP8 Training of Large Models

Infra Learning Club

Jan 22, 2025 · Fundamentals

User‑Mode vs Kernel‑Mode GPU Virtualization: Architecture, Benefits, and Limits

The article compares user‑mode and kernel‑mode GPU virtualization, detailing their layered architectures, how they intercept APIs, the advantages such as openness, isolation, and unified memory, and the drawbacks including API complexity, kernel intrusion, legal risks, and cross‑process limitations.

API interceptionCUDAGPU virtualization

0 likes · 5 min read

User‑Mode vs Kernel‑Mode GPU Virtualization: Architecture, Benefits, and Limits

Infra Learning Club

Jan 21, 2025 · Cloud Native

Understanding Nvidia MIG: Concepts, Configuration, and Kubernetes Deployment

This article explains Nvidia's Multi‑Instance GPU (MIG) technology, compares it with vGPU, walks through enabling and partitioning MIG on A100 cards using nvidia‑smi commands, and shows how to expose MIG resources in Kubernetes with single and mixed strategies.

A100Device PluginGPU virtualization

0 likes · 15 min read

Understanding Nvidia MIG: Concepts, Configuration, and Kubernetes Deployment

Software Engineering 3.0 Era

Jan 18, 2025 · Industry Insights

Is AI Self‑Programming and Recursive Self‑Improvement Signaling the Endgame?

The article examines Nvidia’s claim that AI can now write software and build an “AI factory,” analyzes OpenAI’s emerging o‑series models that purportedly achieve recursive self‑improvement, and surveys community reactions ranging from excitement to safety concerns about a potential AI “game over.”

AI safetyIndustry AnalysisNVIDIA

0 likes · 8 min read

Is AI Self‑Programming and Recursive Self‑Improvement Signaling the Endgame?

Architects' Tech Alliance

Jan 9, 2025 · Industry Insights

What Nvidia’s RTX 50 Series and Blackwell Architecture Mean for GPUs and Data Centers

The article details Nvidia’s upcoming RTX 50 consumer GPUs, the Blackwell‑based Grace NVLink72 data‑center super‑chip, and the pocket‑sized Project DIGITS AI system, highlighting specifications, performance claims, pricing expectations, and the broader impact on the GPU market.

BlackwellData CenterGPU

0 likes · 6 min read

What Nvidia’s RTX 50 Series and Blackwell Architecture Mean for GPUs and Data Centers

Java Tech Enthusiast

Jan 9, 2025 · Cloud Native

Configuring NVIDIA Docker Plugin and GPU Access in Kubernetes

This guide walks through installing the NVIDIA container toolkit, configuring Docker to use the NVIDIA runtime, verifying GPU access, deploying the NVIDIA device plugin in Kubernetes, labeling GPU nodes, and running a GPU‑accelerated FFmpeg pod to confirm successful GPU integration.

Container ToolkitDockerGPU

0 likes · 12 min read

Configuring NVIDIA Docker Plugin and GPU Access in Kubernetes

Liangxu Linux

Jan 8, 2025 · Cloud Native

Enable NVIDIA GPU Access in Docker and Kubernetes with the NVIDIA Container Toolkit

This guide walks through checking system and software environments, installing and configuring the NVIDIA Docker plugin, verifying GPU access in Docker containers, deploying the NVIDIA device plugin on a Kubernetes cluster, creating GPU‑enabled pods, and troubleshooting common issues, all with concrete commands and configuration examples.

Container ToolkitFFmpegGPU

0 likes · 12 min read

Enable NVIDIA GPU Access in Docker and Kubernetes with the NVIDIA Container Toolkit

21CTO

Jan 7, 2025 · Artificial Intelligence

Nvidia Reveals RTX 50 GPUs, Thor Auto Chip, and AI Supercomputer at CES 2025

At CES 2025, Nvidia CEO Jensen Huang announced the RTX 50 series GPUs built on the Blackwell architecture, the Thor automotive processor, the Project Digits personal AI supercomputer, new AI agents and robotics initiatives, detailing pricing, performance specs, and partnerships across automotive and AI ecosystems.

AutomotiveCES 2025GPU

0 likes · 10 min read

Nvidia Reveals RTX 50 GPUs, Thor Auto Chip, and AI Supercomputer at CES 2025

Architects' Tech Alliance

Jan 6, 2025 · Industry Insights

How Nvidia’s GB300 GPU Is Shaping AI Inference and Cloud Supply Chains

The article provides a detailed technical analysis of Nvidia’s new GB300 and B300 GPUs, comparing their performance, memory architecture, and power consumption to previous generations, and examines how these changes affect AI inference workloads, NVL72 accelerator systems, and the supply‑chain strategies of major cloud providers.

AI inferenceCloud ComputingGPU

0 likes · 12 min read

How Nvidia’s GB300 GPU Is Shaping AI Inference and Cloud Supply Chains

Architects' Tech Alliance

Dec 10, 2024 · Industry Insights

Could Nvidia Face Up to $50 Billion in Chinese Antitrust Fines?

China’s market regulator has opened an antitrust investigation into Nvidia over alleged breaches of its 2020 Mellanox acquisition commitments, and analysts estimate that, based on the country’s Anti‑Monopoly Law, the company could be fined anywhere from $1 billion to as much as $50 billion, depending on the severity of the violation.

AntitrustChinaMarket Analysis

0 likes · 6 min read

Could Nvidia Face Up to $50 Billion in Chinese Antitrust Fines?

Architects' Tech Alliance

Dec 1, 2024 · Industry Insights

What Powers Nvidia’s AI Dominance? Inside Its Three‑Chip Strategy and Market Outlook

The article summarizes Nvidia’s AI industry development strategy report, highlighting its 98% GPU market share in data centers, the three‑chip (GPU‑CPU‑DPU) approach, the four core business segments, and future plans such as an AI factory and sovereign‑AI initiatives.

AI Industry AnalysisFuture OutlookGPU market share

0 likes · 7 min read

What Powers Nvidia’s AI Dominance? Inside Its Three‑Chip Strategy and Market Outlook

Architects' Tech Alliance

Nov 28, 2024 · Artificial Intelligence

Comprehensive Comparison of NVIDIA GPUs: A100, A800, H100, H200, H800, B100, B200, and L40S

This article provides an in‑depth overview of NVIDIA’s latest GPU families—including A100/A800, H100/H200/H800, B100/B200, and L40S—detailing their release backgrounds, key specifications, typical application scenarios, and pricing to help readers understand their performance and market positioning.

AIComparisonGPU

0 likes · 11 min read

Comprehensive Comparison of NVIDIA GPUs: A100, A800, H100, H200, H800, B100, B200, and L40S

Architects' Tech Alliance

Oct 26, 2024 · Industry Insights

Why NVIDIA’s Blackwell GB200 Outpaces H100: 5 Key Technical Advantages

The Blackwell GB200 series delivers a massive leap in AI compute power with 20 petaFLOPS FP4 performance, a dual‑chip N4P design, 192 GB HBM3E memory, modular MGX servers, and advanced copper DAC and liquid‑cooling solutions that together boost training speed up to 30‑fold over the H100.

BlackwellGB200GPU

0 likes · 6 min read

Why NVIDIA’s Blackwell GB200 Outpaces H100: 5 Key Technical Advantages

DataFunSummit

Oct 2, 2024 · Artificial Intelligence

NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

This article explains NVIDIA’s end‑to‑end stack for large language models, covering the NeMo Framework for data processing, training, and deployment, the open‑source TensorRT‑LLM inference accelerator, and the Retrieval‑Augmented Generation (RAG) technique that enriches model outputs with external knowledge.

NVIDIANeMoRAG

0 likes · 17 min read

NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

Architects' Tech Alliance

Sep 25, 2024 · Fundamentals

NVIDIA Quantum‑2 InfiniBand Platform: Technical Overview, Q&A, and Deployment Guidance

This article explains the growing demand for high‑performance computing, introduces NVIDIA's Quantum‑2 InfiniBand platform with its high‑speed, low‑latency capabilities, provides a curated list of related technical articles, and offers an extensive Q&A covering compatibility, cabling, UFM, PCIe limits, and best‑practice deployment for AI and HPC workloads.

AIGPUInfiniBand

0 likes · 11 min read

NVIDIA Quantum‑2 InfiniBand Platform: Technical Overview, Q&A, and Deployment Guidance

21CTO

Sep 11, 2024 · Artificial Intelligence

How Volvo’s Nvidia‑Powered Software Stack Will Redefine EV Costs

Volvo announced that its upcoming EX90 electric SUV will run on a unified software platform powered by Nvidia's Drive Orin AI chip, using megacasting manufacturing to cut costs while avoiding subscription‑based revenue models.

AI chipsMegacastingNVIDIA

0 likes · 3 min read

How Volvo’s Nvidia‑Powered Software Stack Will Redefine EV Costs

Architects' Tech Alliance

Sep 8, 2024 · Industry Insights

How Nvidia’s Rapid GPU Cycle Is Shaping the Future of AI Super‑Scale Networking

The article analyzes Nvidia’s accelerated GPU rollout, highlighting the Blackwell series’ massive performance and energy gains, the company’s AI‑focused Ethernet Spectrum‑X roadmap, and the broader impact on NVLink, InfiniBand, and Ethernet interconnects for upcoming massive AI clusters.

AI EthernetGPUNVIDIA

0 likes · 6 min read

How Nvidia’s Rapid GPU Cycle Is Shaping the Future of AI Super‑Scale Networking

DataFunSummit

Sep 5, 2024 · Artificial Intelligence

NVIDIA’s End‑to‑End Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

This article introduces NVIDIA’s comprehensive solutions for large language models, covering the NeMo Framework’s full‑stack development pipeline, the open‑source TensorRT‑LLM inference accelerator, and Retrieval‑Augmented Generation techniques, while detailing data preprocessing, distributed training, model fine‑tuning, deployment, and performance optimizations.

NVIDIANeMo FrameworkRetrieval-Augmented Generation

0 likes · 16 min read

NVIDIA’s End‑to‑End Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

Architects' Tech Alliance

Sep 3, 2024 · Industry Insights

How NVIDIA Grace Hopper Superchip Redefines HPC and AI Performance

The article provides an in‑depth technical overview of NVIDIA's Grace Hopper superchip, detailing its heterogeneous CPU‑GPU architecture, high‑bandwidth NVLink‑C2C interconnect, unified memory model, programming support, and system‑level scaling features that together deliver unprecedented performance for high‑performance computing and large‑scale AI workloads.

AIGrace HopperHPC

0 likes · 20 min read

How NVIDIA Grace Hopper Superchip Redefines HPC and AI Performance

AsiaInfo Technology: New Tech Exploration

Aug 30, 2024 · Industry Insights

How GPU Virtualization Powers AI and Cloud Computing: Techniques, Challenges, and Future Directions

This article examines the rapid rise of GPU virtualization as a solution for efficient GPU resource utilization in AI, big data, and high‑performance computing, detailing its concepts, implementation methods across user, kernel, and hardware layers, Kubernetes integration, real‑world use cases, challenges, and emerging research trends.

Cloud ComputingDevice PluginGPU virtualization

0 likes · 25 min read

How GPU Virtualization Powers AI and Cloud Computing: Techniques, Challenges, and Future Directions

Architects' Tech Alliance

Aug 29, 2024 · Industry Insights

How NVIDIA Builds 256‑GPU and 576‑GPU SuperPods with H100, GH200, and GB200 Interconnects

The article analyzes NVIDIA's DGX SuperPOD architectures across three GPU generations—H100, GH200, and GB200—detailing their NVLink/NVSwitch topologies, bandwidth calculations, scalability limits, and the practical challenges of constructing 256‑GPU and 576‑GPU supercomputing clusters.

Data CenterGPUHigh-performance computing

0 likes · 11 min read

How NVIDIA Builds 256‑GPU and 576‑GPU SuperPods with H100, GH200, and GB200 Interconnects

Linux Cloud Computing Practice

Aug 28, 2024 · Operations

How to Run Black Myth: Wukong Smoothly on deepin 23 – Full Installation Guide

This guide walks you through installing Steam, fixing NVIDIA driver issues, and configuring Proton on deepin 23 so you can enjoy the epic Chinese‑culture‑rich game Black Myth: Wukong with optimal performance.

Black MythNVIDIASteam

0 likes · 6 min read

How to Run Black Myth: Wukong Smoothly on deepin 23 – Full Installation Guide

Ops Development Stories

Jul 26, 2024 · Cloud Native

How to Deploy NVIDIA GPU Operator on Kubernetes for GPU‑Accelerated Rendering

Learn step‑by‑step how to install NVIDIA’s GPU‑operator on a Kubernetes cluster, configure GPU nodes, deploy a Blender workload for GPU‑accelerated rendering, monitor GPU metrics, and troubleshoot common issues, enabling seamless GPU scheduling and graphics rendering in cloud‑native environments.

BlenderCloud NativeGPU rendering

0 likes · 8 min read

How to Deploy NVIDIA GPU Operator on Kubernetes for GPU‑Accelerated Rendering

Architects' Tech Alliance

Jul 25, 2024 · Artificial Intelligence

NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market

The article reviews NVIDIA's newly released H20 AI accelerator for China, compares its performance and pricing with domestic chips, outlines the expanding Chinese AI chip ecosystem—including Huawei, Cambricon, HaiGuang, Alibaba, ByteDance, and Baidu—while highlighting market size growth, multi‑chip heterogeneity strategies, and the strong demand forecast through 2024.

AI chipsAI computeChina

0 likes · 8 min read

NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market

Architects' Tech Alliance

Jul 19, 2024 · Industry Insights

What Nvidia’s Blackwell GPUs and PCIe 6/7 Mean for AI and Data Centers

The article analyzes Nvidia's Blackwell‑based GB200, HGX B200 and HGX B100 servers, their integration of Blackwell GPUs and Grace CPUs, the shift to PCIe 6.0/7.0, the accompanying Quantum‑X800 InfiniBand platform and 1.6 T optical modules, and projects rapid market growth driven by AI workloads.

AIBlackwellData Center

0 likes · 9 min read

What Nvidia’s Blackwell GPUs and PCIe 6/7 Mean for AI and Data Centers

Open Source Linux

Jul 19, 2024 · Artificial Intelligence

How Much Is the PCB Inside an NVIDIA DGX A100 Worth? A Deep Dive

This article dissects the PCB composition of NVIDIA's DGX A100 AI server, detailing the GPU board, CPU motherboard, and auxiliary components to reveal their material area, cost breakdown, and overall value contribution in high‑performance computing systems.

AI serverDGX A100NVIDIA

0 likes · 11 min read

How Much Is the PCB Inside an NVIDIA DGX A100 Worth? A Deep Dive

Architects' Tech Alliance

Jul 9, 2024 · Industry Insights

How Nvidia’s Accelerated GPU Roadmap Is Shaping AI‑Scale Networking

Nvidia plans to shorten its GPU generation cycle to one year, launching Blackwell Ultra in 2025, Rubin in 2026, and Rubin Ultra in 2027, while boosting token‑generation efficiency and introducing AI‑optimized Ethernet solutions like Spectrum‑X800, aiming to dominate large‑scale AI clusters and reshape the high‑performance networking market.

AIEthernetGPU

0 likes · 6 min read

How Nvidia’s Accelerated GPU Roadmap Is Shaping AI‑Scale Networking

Architects' Tech Alliance

Jun 16, 2024 · Industry Insights

How Nvidia’s Blackwell GPUs Aim to Slash AI Training Costs and Power

The article analyzes Nvidia’s historic advantage, the massive performance and energy efficiency gains from Pascal to Blackwell GPUs, the economics of training large language models like GPT‑4, and the detailed roadmap of upcoming GPU, memory, and interconnect technologies shaping the future of data‑center AI.

AIGPUNVIDIA

0 likes · 14 min read

How Nvidia’s Blackwell GPUs Aim to Slash AI Training Costs and Power

DevOps

Jun 13, 2024 · R&D Management

Jensen Huang on Management Philosophy, Team Structure, and Innovation at NVIDIA

In this interview, NVIDIA founder Jensen Huang shares his management philosophy, emphasizing the value of tackling difficult tasks, maintaining a small yet empowered team, avoiding layoffs, fostering a zero‑market mindset, navigating the early challenges of CUDA, and leveraging AI to drive future innovation.

AICUDALeadership

0 likes · 12 min read

Jensen Huang on Management Philosophy, Team Structure, and Innovation at NVIDIA

Architects' Tech Alliance

Jun 13, 2024 · Industry Insights

How Nvidia’s New Blackwell GPUs and NVLink Redefine AI Acceleration in 2024

The article analyzes Nvidia's latest AI‑focused hardware and software breakthroughs showcased at ComputeX 2024, detailing how GPU‑CPU hybrid architectures, new libraries, and high‑speed interconnects like NVLink dramatically boost performance while keeping power and cost growth modest.

AI accelerationBlackwellDGX

0 likes · 12 min read

How Nvidia’s New Blackwell GPUs and NVLink Redefine AI Acceleration in 2024

Architects' Tech Alliance

Jun 10, 2024 · Artificial Intelligence

NVLink vs PCIe GPUs: Which Nvidia AI Server Fits Your Workload?

This article compares Nvidia's NVLink (SXM) and PCIe GPU versions for AI servers, detailing their architectures, bandwidth, power consumption, and ideal use cases, helping readers choose the optimal configuration based on performance needs and budget constraints.

AI serversGPUNVIDIA

0 likes · 8 min read

NVLink vs PCIe GPUs: Which Nvidia AI Server Fits Your Workload?

21CTO

Jun 7, 2024 · Artificial Intelligence

Nvidia Beats Apple in Market Value: AI Chip Wars, New AMD Processors & More

This roundup highlights Nvidia surpassing Apple in market cap, AMD's next‑gen AI processors, Elon Musk shifting Nvidia chips to X, Microsoft’s latest layoffs and AI spending, Google’s new developer program, GitHub Actions Arm64 support, Ubuntu Core 24 for IoT, and the release of Zabbix 7.0.

AI hardwareCloud ComputingDeveloper Tools

0 likes · 12 min read

Nvidia Beats Apple in Market Value: AI Chip Wars, New AMD Processors & More

IT Services Circle

Jun 6, 2024 · Artificial Intelligence

Nvidia Unveils Blackwell GPU and AI Supercomputing Roadmap

Nvidia’s latest Blackwell GPU, presented by Jensen Huang, promises unprecedented performance and energy efficiency for large‑scale AI models, while the company also showcases accelerated computing, NVLink interconnects, AI‑optimized DGX servers, the NIM platform for rapid LLM deployment, and ambitious projects such as Earth‑2 digital twins and next‑generation embodied AI robots.

AIBlackwellGPU

0 likes · 18 min read

Nvidia Unveils Blackwell GPU and AI Supercomputing Roadmap

Architects' Tech Alliance

May 7, 2024 · Industry Insights

Why GPUs Remain the Dominant AI Training Hardware: Trends and Challenges

The article analyzes why GPUs continue to dominate AI model training, comparing them with ASICs, CPUs, and other chips, and discusses ecosystem advantages, domestic development gaps, emerging edge‑AI demands, high‑bandwidth needs, and chiplet technology as future enablers.

AI hardwareChipletGPU

0 likes · 5 min read

Why GPUs Remain the Dominant AI Training Hardware: Trends and Challenges

Architects' Tech Alliance

May 5, 2024 · Operations

Essential Q&A on NVIDIA Quantum‑2 InfiniBand: Compatibility, Cabling, and Performance

This article compiles detailed technical Q&A about NVIDIA's Quantum‑2 InfiniBand platform, covering compatibility of CX7 NDR ports, cabling options, switch connections, UFM deployment, PCIe bandwidth limits, and performance considerations for high‑performance computing clusters.

CablingHPCInfiniBand

0 likes · 14 min read

Essential Q&A on NVIDIA Quantum‑2 InfiniBand: Compatibility, Cabling, and Performance

Architects' Tech Alliance

May 1, 2024 · Industry Insights

How NVIDIA’s Blackwell Platform Redefines AI Supercomputing Networks

The article examines NVIDIA’s Blackwell platform network architecture, detailing the fifth‑generation NVLink, sixth‑generation PCIe, 800 Gb/s InfiniBand and Ethernet adapters, the DGX B200 and GB200 configurations, new IB and Ethernet switches, and the implications of increased optical module demands for large‑scale AI clusters.

AI supercomputingBlackwellDGX

0 likes · 10 min read

How NVIDIA’s Blackwell Platform Redefines AI Supercomputing Networks

Architects' Tech Alliance

Apr 27, 2024 · Industry Insights

What We Know About Nvidia’s Upcoming Blackwell GPUs and Their Power Surge

Nvidia’s next‑generation GeForce RTX 50 (Blackwell) GPUs are rumored to retain a 384‑bit memory bus, possibly adopt GDDR7 for up to 1.5 TB/s bandwidth, and push power consumption toward 1 kW, while Dell’s COO hints at new AI accelerators without liquid cooling.

AI acceleratorBlackwellGDDR7

0 likes · 9 min read

What We Know About Nvidia’s Upcoming Blackwell GPUs and Their Power Surge

DataFunSummit

Apr 14, 2024 · Artificial Intelligence

TensorRT-LLM: NVIDIA’s Scalable LLM Inference Framework – Overview, Features, Workflow, Performance, and Future Directions

This article presents a comprehensive overview of NVIDIA’s TensorRT-LLM, detailing its product positioning as a scalable LLM inference solution, key features such as model support, low-precision and quantization techniques, parallelism strategies, the end-to-end usage workflow, performance highlights, future roadmap, and answers to common technical questions.

LLM InferenceNVIDIAQuantization

0 likes · 13 min read

TensorRT-LLM: NVIDIA’s Scalable LLM Inference Framework – Overview, Features, Workflow, Performance, and Future Directions

Architects' Tech Alliance

Apr 6, 2024 · Industry Insights

How NVIDIA’s Blackwell GB200 NVL72 Redefines AI Compute with 10 TB/s Interconnect

The article analyses NVIDIA’s new Blackwell platform, focusing on the GB200 NVL72 GPU and its 10 TB/s NVLink‑C2C interconnect, detailing massive training and inference speedups, rack‑level DGX SuperPOD architecture, copper‑cable trends, and the broader impact on AI‑driven data‑center workloads.

AIBlackwellGPU

0 likes · 13 min read

How NVIDIA’s Blackwell GB200 NVL72 Redefines AI Compute with 10 TB/s Interconnect

Architects' Tech Alliance

Apr 2, 2024 · Artificial Intelligence

Evolution and Forecast of Nvidia NVLink, NVLink C2C, and B100/X100 GPU Architectures

The article analyses the historical evolution of Nvidia's NVLink and NVLink C2C interconnect technologies, compares them with PCIe, Ethernet and InfiniBand, and uses these trends to predict future AI‑chip architectures such as the B100 and X100 GPUs, highlighting design trade‑offs and packaging challenges.

AI chipB100GPU architecture

0 likes · 15 min read

Evolution and Forecast of Nvidia NVLink, NVLink C2C, and B100/X100 GPU Architectures

Architects' Tech Alliance

Mar 30, 2024 · Industry Insights

How NVIDIA’s B200 GPU Redefines AI Compute and What It Means for the Chip Market

The article analyzes the latest AI‑compute announcements from NVIDIA, AMD and Intel—including NVIDIA’s B200 GPU with 20 petaFLOPS FP4 performance, AMD’s MI300/MI400 roadmap, and Intel’s Gaudi 3 and Falcon Shores—while examining pricing, launch timelines, supply‑chain capacity, and the shifting market share among major cloud providers.

AI computeAMDGPU

0 likes · 10 min read

How NVIDIA’s B200 GPU Redefines AI Compute and What It Means for the Chip Market

Sohu Tech Products

Mar 27, 2024 · Artificial Intelligence

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

NVIDIA’s comprehensive LLM ecosystem combines the full‑stack NeMo Framework for data curation, distributed training, fine‑tuning, inference acceleration with TensorRT‑LLM and Triton, plus Retrieval‑Augmented Generation and Guardrails, enabling efficient, low‑latency, knowledge‑grounded model deployment across clusters.

AI accelerationModel TrainingNVIDIA

0 likes · 16 min read

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

Architects' Tech Alliance

Mar 26, 2024 · Artificial Intelligence

Analysis and Forecast of Nvidia AI Chip Roadmap: From H100 to X100

The article analyzes Nvidia's AI chip evolution, assumes consistent storage‑compute‑interconnect ratios and predictable process scaling, and projects the architectures of H200, B100 and X100, highlighting the limits of chiplet packaging and the critical role of low‑latency, high‑reliability interconnect technologies for future AI compute scaling.

AI chipsChipletFuture Predictions

0 likes · 12 min read

Analysis and Forecast of Nvidia AI Chip Roadmap: From H100 to X100

Architects' Tech Alliance

Mar 24, 2024 · Artificial Intelligence

NVLink vs PCIe GPUs: Which NVIDIA Server GPU Wins for Your AI Workload?

This article compares NVIDIA's NVLink (SXM) and PCIe GPU versions for AI servers, detailing their architectures, bandwidth, power consumption, and ideal use cases, and provides guidance on selecting the right GPU based on workload size, flexibility, and cost considerations.

AI serversGPUHardware Comparison

0 likes · 9 min read

NVLink vs PCIe GPUs: Which NVIDIA Server GPU Wins for Your AI Workload?

Architects' Tech Alliance

Mar 22, 2024 · Industry Insights

Can Groq’s LPU Outsmart Nvidia GPUs in AI Inference?

The article examines Groq’s new LPU AI chip, comparing its inference speed and architecture to Nvidia GPUs, discusses the company’s market positioning, recent CEO statements, and the broader AI‑hardware race, while questioning whether Groq can become the go‑to accelerator for startups by the end of 2024.

AI chipsAI hardwareGroq

0 likes · 9 min read

Can Groq’s LPU Outsmart Nvidia GPUs in AI Inference?

Architects' Tech Alliance

Mar 20, 2024 · Industry Insights

What Nvidia’s B100 and GB200 Reveal About the Future of AI GPUs

The GTC 2024 recap highlights Nvidia’s upcoming B100 and GB200 GPUs, their BlackWell architecture, performance breakthroughs, embodied‑intelligence initiatives, and the expanding AI application ecosystem across industries, offering a clear view of the next wave in accelerated computing.

AIB100Embodied Intelligence

0 likes · 7 min read

What Nvidia’s B100 and GB200 Reveal About the Future of AI GPUs

21CTO

Mar 20, 2024 · Artificial Intelligence

Nvidia Unveils Blackwell GPU: A Quantum Leap for Generative AI

Nvidia introduced the Blackwell GPU architecture at GTC, highlighting six breakthrough technologies, a 4nm process, massive performance gains, and its integration into DGX SuperPOD systems that promise to accelerate generative AI, data processing, and high‑performance computing across industries.

AIBlackwellGPU

0 likes · 14 min read

Nvidia Unveils Blackwell GPU: A Quantum Leap for Generative AI

DataFunTalk

Mar 15, 2024 · Artificial Intelligence

NVIDIA’s NeMo Framework and TensorRT‑LLM: Full‑Stack Solutions for Large Language Models and Retrieval‑Augmented Generation

This article explains NVIDIA’s end‑to‑end ecosystem for large language models, covering the NeMo Framework’s data processing, distributed training, model fine‑tuning, inference acceleration with TensorRT‑LLM, deployment via Triton, and Retrieval‑Augmented Generation (RAG) techniques that enhance model reliability and performance.

AINVIDIANeMo

0 likes · 16 min read

NVIDIA’s NeMo Framework and TensorRT‑LLM: Full‑Stack Solutions for Large Language Models and Retrieval‑Augmented Generation

Architects' Tech Alliance

Mar 12, 2024 · Industry Insights

What’s Nvidia’s 2024‑2025 AI Chip Roadmap? A Deep Dive into GPUs, CPUs, and Interconnects

The article analyzes Nvidia’s 2023 investor‑meeting roadmap, revealing an annual GPU release cadence with H200, B100 and X100 chips, a unified "One Architecture" strategy spanning x86 and ARM, accelerated interconnects like NVLink‑C2C, and competitive pressures shaping its AI ecosystem.

AI hardwareGPU roadmapIndustry Analysis

0 likes · 20 min read

What’s Nvidia’s 2024‑2025 AI Chip Roadmap? A Deep Dive into GPUs, CPUs, and Interconnects

21CTO

Mar 9, 2024 · Artificial Intelligence

Can AI Really Replace Programmers? A Critical Look at Jensen Huang’s Predictions

The article examines Jensen Huang’s claim that AI will make programming obsolete, discusses existing AI coding tools, highlights their limitations, and argues that human expertise in design, reasoning, and error‑checking remains essential for software development.

AINVIDIAcode generation

0 likes · 10 min read

Can AI Really Replace Programmers? A Critical Look at Jensen Huang’s Predictions

DataFunTalk

Jan 31, 2024 · Artificial Intelligence

Introduction to NVIDIA TensorRT-LLM Inference Framework

TensorRT-LLM is NVIDIA's scalable inference framework for large language models that combines TensorRT compilation, fast kernels, multi‑GPU parallelism, low‑precision quantization, and a PyTorch‑like API to deliver high‑performance LLM serving with extensive customization and future‑focused enhancements.

GPU AccelerationLLM InferenceNVIDIA

0 likes · 12 min read

Introduction to NVIDIA TensorRT-LLM Inference Framework

Architects' Tech Alliance

Jan 25, 2024 · Industry Insights

Why Chinese Tech Giants Are Dropping Nvidia GPUs for Domestic Chips

Amid tightening U.S. export controls, Chinese cloud providers like Tencent, Alibaba, Baidu and ByteDance are cutting orders for Nvidia's downgraded AI GPUs and turning to domestic alternatives, driven by regulatory uncertainty, reduced performance of special‑edition chips, and a desire for more stable supply chains.

AI chipsChinaDomestic alternatives

0 likes · 11 min read

Why Chinese Tech Giants Are Dropping Nvidia GPUs for Domestic Chips

Architects' Tech Alliance

Jan 21, 2024 · Industry Insights

What Nvidia GH200 and AMD MI300 Reveal About the Future of AI Compute

The article examines Nvidia's GH200 superchip and AMD's Instinct MI300, compares CPU, GPU, FPGA, and ASIC architectures, analyzes market share trends, and discusses opportunities for domestic chip makers in the rapidly evolving AI compute landscape.

AI chipsAMDASIC

0 likes · 13 min read

What Nvidia GH200 and AMD MI300 Reveal About the Future of AI Compute

Architects' Tech Alliance

Jan 11, 2024 · Industry Insights

What Makes Nvidia’s RTX 5880 Ada Stand Out? Specs, Performance, and Market Position

Nvidia's RTX 5880 Ada, a China‑specific GPU built on a trimmed AD102 chip, offers 14,080 CUDA cores, 48 GB ECC GDDR6 memory, and an estimated 69.3 TFLOPS performance, positioning it between the RTX 6000 Ada and RTX 5000 Ada while complying with U.S. export limits.

CUDA coresGPUNVIDIA

0 likes · 8 min read

What Makes Nvidia’s RTX 5880 Ada Stand Out? Specs, Performance, and Market Position

IT Services Circle

Jan 2, 2024 · Fundamentals

NVIDIA Introduces RTX 4090 D: China‑Specific GPU with Reduced CUDA and Tensor Cores

Due to U.S. export restrictions, NVIDIA released a China‑specific RTX 4090 D GPU that meets the TPP limit by reducing CUDA and Tensor cores while keeping most other specifications unchanged, and it is priced the same as the standard RTX 4090.

GPUHardware SpecsNVIDIA

0 likes · 4 min read

NVIDIA Introduces RTX 4090 D: China‑Specific GPU with Reduced CUDA and Tensor Cores

Architects' Tech Alliance

Dec 27, 2023 · Industry Insights

Nvidia H100 vs Huawei Ascend 910B: In‑Depth GPU Performance and Bandwidth Comparison

This article compiles official specifications and benchmark data to compare Nvidia’s mainstream GPUs (L2, T4, A10, A10G, V100, A100, A800, H100) with Huawei’s Ascend series (910B, H20/L20), highlighting performance differences, inter‑GPU bandwidth via NVLink versus HCCS, and key takeaways for AI workloads.

AI hardwareGPUHuawei

0 likes · 5 min read

Nvidia H100 vs Huawei Ascend 910B: In‑Depth GPU Performance and Bandwidth Comparison

DataFunTalk

Dec 23, 2023 · Artificial Intelligence

NVIDIA Merlin: Product Overview, Models, Distributed Embeddings, Hierarchical KV and Parameter Server

This article introduces NVIDIA's Merlin recommendation system suite, detailing its product overview, model and system libraries, TensorFlow Distributed Embedding plugin, hierarchical key‑value store, and hierarchical parameter server, while highlighting integration with NVTABULAR, Triton, and performance gains on GPU‑accelerated training and inference.

Distributed EmbeddingHierarchical KVMerlin

0 likes · 13 min read

NVIDIA Merlin: Product Overview, Models, Distributed Embeddings, Hierarchical KV and Parameter Server

Open Source Linux

Nov 28, 2023 · Information Security

How a Screen‑Sharing Slip Exposed Nvidia to a Multi‑Million Dollar IP Lawsuit

A careless screen‑sharing mistake by Nvidia engineer Mohammad Moniruzzaman revealed Valeo's confidential automotive source code, leading to a German court conviction, a hefty fine, and a lawsuit accusing Nvidia of profiting from stolen intellectual property.

IP theftNVIDIAScreen Sharing

0 likes · 7 min read

How a Screen‑Sharing Slip Exposed Nvidia to a Multi‑Million Dollar IP Lawsuit

php Courses

Oct 24, 2023 · Fundamentals

Nvidia and AMD Plan Arm-Based CPUs for Windows PCs, Potential 2025 Release

According to Reuters, Nvidia and AMD are designing Arm‑based processors for Windows PCs that could be ready by 2025, while Microsoft continues its partnership with Qualcomm on Arm Windows 11 and rumors suggest Microsoft may also develop its own Arm server chips.

AMDArmCPU

0 likes · 4 min read

Nvidia and AMD Plan Arm-Based CPUs for Windows PCs, Potential 2025 Release

21CTO

Oct 20, 2023 · Artificial Intelligence

How New US AI Chip Export Ban Could Reshape China's AI Landscape

New U.S. export restrictions targeting high‑end AI GPUs such as Nvidia’s H800 and A800 aim to curb China’s access to advanced compute, potentially slowing its AI model development, affecting major chip makers and prompting Chinese firms to stockpile hardware or accelerate domestic chip efforts.

AI chipsAMDChina AI

0 likes · 10 min read

How New US AI Chip Export Ban Could Reshape China's AI Landscape

Baidu Geek Talk

Aug 22, 2023 · Industry Insights

What Baidu’s First Commercial AI Competition Reveals About AIGC Trends

The article reviews Baidu's 2023 generative AI initiatives, details the inaugural Baidu Commercial AI Technology Innovation Competition co‑hosted with the China AI Society and NVIDIA, highlights winning teams' technical approaches in conversion prediction and inference optimization, and shares insights from industry leaders on future AI talent and innovation.

AIAIGCBaidu

0 likes · 8 min read

What Baidu’s First Commercial AI Competition Reveals About AIGC Trends

DataFunSummit

May 4, 2023 · Artificial Intelligence

An Overview of NVIDIA NeMo for Speech AI: ASR Training, Chinese Support, and Related Applications

This article provides a comprehensive introduction to NVIDIA's NeMo toolkit for conversational AI, detailing its ASR capabilities, model architectures, training workflow, Chinese language support, deployment options, and additional speech AI features such as VAD and speaker diarization.

ASRChinese SpeechConformer

0 likes · 15 min read

An Overview of NVIDIA NeMo for Speech AI: ASR Training, Chinese Support, and Related Applications

Baidu Intelligent Cloud Tech Hub

Apr 17, 2023 · Artificial Intelligence

How NVIDIA’s GPU‑Powered AI is Revolutionizing Drug Discovery and Genomics

The article outlines NVIDIA’s CLARA platform, BioNeMo framework, and GPU‑accelerated tools such as CLARA Parabricks and RAPIDS, demonstrating how AI and high‑performance computing dramatically speed up drug‑target identification, molecular generation, protein structure prediction, and high‑throughput DNA/RNA sequencing, with benchmarks showing up to 80‑fold acceleration.

AI drug discoveryBioNeMoCLARA

0 likes · 11 min read

How NVIDIA’s GPU‑Powered AI is Revolutionizing Drug Discovery and Genomics

Full-Stack DevOps & Kubernetes

Apr 5, 2023 · Cloud Native

Enable GPU Acceleration in Kubernetes with NVIDIA Device Plugin

This guide explains how to set up NVIDIA drivers, install the NVIDIA device plugin, and create a Kubernetes pod that requests GPU resources, providing step‑by‑step commands and a sample YAML manifest for GPU‑enabled workloads.

Cloud NativeContainer OrchestrationDevice Plugin

0 likes · 4 min read

Enable GPU Acceleration in Kubernetes with NVIDIA Device Plugin