Tagged articles

NVIDIA

235 articles · Page 2 of 3
Instant Consumer Technology Team
Instant Consumer Technology Team
Aug 20, 2025 · Artificial Intelligence

Nvidia Unveils Nemotron‑Nano‑9B‑v2: Tiny Open‑Source LLM with Switchable Reasoning

Nvidia’s newly released Nemotron‑Nano‑9B‑v2, a 9‑billion‑parameter open‑source LLM optimized for a single Nvidia A10 GPU, introduces a toggleable reasoning mode and budget controls, delivering up to six‑fold speed gains, multilingual support, and strong benchmark results across various tasks.

AI inferenceMambaNVIDIA
0 likes · 5 min read
Nvidia Unveils Nemotron‑Nano‑9B‑v2: Tiny Open‑Source LLM with Switchable Reasoning
Refining Core Development Skills
Refining Core Development Skills
Aug 7, 2025 · Fundamentals

Why NVIDIA’s First Data‑Center GPU Revolutionized Computing: Inside the Tesla G80 Architecture

This article explains how NVIDIA transitioned from gaming graphics cards to general‑purpose GPUs with the first data‑center Tesla GPU, detailing the unified shader architecture, the internal components of TPCs and SMs, CUDA 1.0 programming basics, and performance calculations that illustrate the massive computational advantage over contemporary CPUs.

CUDAGPGPUGPU architecture
0 likes · 23 min read
Why NVIDIA’s First Data‑Center GPU Revolutionized Computing: Inside the Tesla G80 Architecture
AI Cyberspace
AI Cyberspace
Aug 4, 2025 · Artificial Intelligence

From Tesla to Hopper: How NVIDIA GPU Architectures Powered the AI Revolution

This article traces the evolution of NVIDIA GPU architectures—from the early Tesla series through Fermi, Kepler, Maxwell, Pascal, Volta, Turing, Ampere, Hopper, and up to the upcoming Blackwell—explaining their hardware innovations, CUDA programming model, and how each generation enabled breakthroughs in high‑performance computing, deep learning, and AI applications.

AICUDAGPU
0 likes · 67 min read
From Tesla to Hopper: How NVIDIA GPU Architectures Powered the AI Revolution
Architects' Tech Alliance
Architects' Tech Alliance
Jul 29, 2025 · Artificial Intelligence

Why NVIDIA Spectrum‑X and Quantum InfiniBand Are Redefining AI Data Center Networks

The article explains how AI‑driven data center networks must handle massive distributed workloads, why traditional Ethernet falls short, and how NVIDIA’s Spectrum‑X Ethernet and Quantum InfiniBand use loss‑less RDMA, dynamic routing, advanced congestion control, and hardware‑accelerated collective communication to deliver the bandwidth, latency, and scalability required for generative AI and large‑scale model training.

AIInfiniBandNVIDIA
0 likes · 8 min read
Why NVIDIA Spectrum‑X and Quantum InfiniBand Are Redefining AI Data Center Networks
Open Source Linux
Open Source Linux
Jul 16, 2025 · Artificial Intelligence

How Huawei’s New AI Chip Aims to Rival Nvidia and AMD GPUs

Huawei is developing a new AI‑focused GPU‑style chip that mirrors Nvidia and AMD architectures, aiming to ease Chinese developers’ shift from Nvidia hardware, but still faces software compatibility hurdles due to reliance on CUDA and ongoing U.S. export restrictions.

AI chipCUDAGPU
0 likes · 3 min read
How Huawei’s New AI Chip Aims to Rival Nvidia and AMD GPUs
AIWalker
AIWalker
Jun 18, 2025 · Artificial Intelligence

SeNaTra: Nvidia’s Spatial Grouping Layer Pushes Semantic Segmentation Past Swin Transformer

Nvidia introduces SeNaTra, a native‑segmentation vision transformer that replaces uniform down‑sampling with a content‑aware spatial grouping layer, delivering superior zero‑shot and supervised segmentation performance while cutting parameters and FLOPs compared with Swin Transformer and other backbones.

NVIDIASemantic SegmentationVision Transformer
0 likes · 29 min read
SeNaTra: Nvidia’s Spatial Grouping Layer Pushes Semantic Segmentation Past Swin Transformer
Ops Development Stories
Ops Development Stories
Jun 12, 2025 · Cloud Native

One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models

This tutorial walks you through using a one‑click script to create a GPU‑enabled Kind Kubernetes cluster, evenly distribute GPU resources across nodes with nvkind, install necessary drivers and toolkits, deploy a vLLM‑served large language model, and verify its operation, all on a local or cloud environment.

AI model deploymentDockerGPU
0 likes · 23 min read
One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models
Architects' Tech Alliance
Architects' Tech Alliance
Jun 9, 2025 · Artificial Intelligence

What Makes Nvidia’s Blackwell GPUs a Game-Changer for AI Performance?

In March 2024 Nvidia unveiled the Blackwell GPU family and the GB200 NVL72 architecture, featuring 3‑4 nm processes, redesigned CUDA cores, next‑gen ray‑tracing, upgraded DLSS, massive FP16/FP8 compute gains, 8 TB/s memory bandwidth, and NVLink Gen5, while also presenting complex power, cooling, and packaging challenges for large‑scale AI deployments.

AI accelerationBlackwellGPU
0 likes · 6 min read
What Makes Nvidia’s Blackwell GPUs a Game-Changer for AI Performance?
Architects' Tech Alliance
Architects' Tech Alliance
Jun 6, 2025 · Artificial Intelligence

B30 vs H20: Which NVIDIA GPU Wins for AI Workloads and Budgets?

This article compares NVIDIA’s China‑specific B30 and high‑end H20 GPUs, detailing their CPU/CPU architecture updates, memory technologies, architectural differences, performance metrics, power and cooling characteristics, and price positioning, to help enterprises and developers choose the most suitable accelerator for AI and deep‑learning tasks.

AI accelerationB30GPU
0 likes · 13 min read
B30 vs H20: Which NVIDIA GPU Wins for AI Workloads and Budgets?
Python Programming Learning Circle
Python Programming Learning Circle
Jun 2, 2025 · Artificial Intelligence

NVIDIA Adds Native Python Support to CUDA – What It Means for Developers

NVIDIA announced at GTC 2025 that CUDA will now natively support Python, allowing developers to write GPU‑accelerated code directly in Python without C/C++ knowledge, introducing new APIs, libraries, JIT compilation, performance tools, and a tile‑based programming model that aligns with Python’s array‑centric workflow.

AICUDAGPU
0 likes · 7 min read
NVIDIA Adds Native Python Support to CUDA – What It Means for Developers
ShiZhen AI
ShiZhen AI
May 26, 2025 · Industry Insights

Nvidia Plans Cheaper Blackwell AI Chip for China Amid Export Restrictions

Nvidia is reportedly preparing a lower‑cost Blackwell GPU for the Chinese market, priced at $6,500‑$8,000 and featuring 1.7 TB/s GDDR7 memory, while OpenAI’s o3 model uncovered a Linux kernel zero‑day (CVE‑2025‑37899), a study showed AI models can sabotage shutdown commands, and a tutorial demonstrates creating animated 3D icons with ChatGPT and Freepik tools.

3D icon creationAI hardwareAI safety
0 likes · 8 min read
Nvidia Plans Cheaper Blackwell AI Chip for China Amid Export Restrictions
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
May 22, 2025 · Artificial Intelligence

Deploy NVIDIA Cosmos Reason-1: Zero‑Code Physical AI on Alibaba Cloud PAI

Cosmos Reason-1, a customizable multimodal physical AI model from NVIDIA, can be quickly deployed on Alibaba Cloud’s PAI‑Model Gallery with zero‑code, offering automatic cloud resource adaptation, ready‑to‑use APIs, enterprise‑grade security, and demonstrated superior reasoning on video tasks, while the upcoming tools enable fine‑tuning via SFT and RL.

Alibaba CloudNVIDIAZero‑Code Deployment
0 likes · 8 min read
Deploy NVIDIA Cosmos Reason-1: Zero‑Code Physical AI on Alibaba Cloud PAI
AI Product Manager Community
AI Product Manager Community
May 20, 2025 · Industry Insights

How Nvidia Is Shaping the Future of AI Infrastructure and Physical AI

At the 2025 Taipei International Computer Expo, Nvidia CEO Jensen Huang outlined the company's shift from a chipmaker to an AI infrastructure leader, introduced the concept of physical AI, and detailed upcoming hardware, software, and strategic initiatives that could reshape data centers, robotics, and autonomous driving.

AI InfrastructureNVIDIAartificial-intelligence
0 likes · 7 min read
How Nvidia Is Shaping the Future of AI Infrastructure and Physical AI
DataFunTalk
DataFunTalk
May 8, 2025 · Artificial Intelligence

Anthropic’s Report on Lobsters, Pregnant Women, and Banned Chips: A Critical Look at US‑China AI Chip Policy

The article reviews Anthropic’s controversial report that links lobsters, pregnant women, and banned chips to illustrate absurd claims about China’s AI capabilities, arguing that US export restrictions on high‑performance GPUs are essential to maintain America’s lead in artificial intelligence.

AnthropicChip PolicyNVIDIA
0 likes · 8 min read
Anthropic’s Report on Lobsters, Pregnant Women, and Banned Chips: A Critical Look at US‑China AI Chip Policy
Architects' Tech Alliance
Architects' Tech Alliance
May 6, 2025 · Artificial Intelligence

Evolution of NVIDIA GPU Architectures for AI from Volta to Blackwell

The article reviews NVIDIA's GPU architecture progression—from Volta's pioneering Tensor Cores through Turing, Ampere, Hopper, and the latest Blackwell and Rubin designs—highlighting key innovations, performance gains for deep learning, and related resource updates for AI practitioners.

GPU architectureHigh‑Performance ComputingNVIDIA
0 likes · 9 min read
Evolution of NVIDIA GPU Architectures for AI from Volta to Blackwell
Fighter's World
Fighter's World
May 2, 2025 · Industry Insights

Token Economics Reveals Nvidia’s New AI Factory Narrative

The article analyses Nvidia’s shift from a chip supplier to a full‑stack AI infrastructure provider called AI Factory, explains the token‑economics framework that measures intelligent output, details the hardware‑software stack and network fabric, quantifies token consumption of advanced agents, and evaluates the strategic opportunities and risks for Nvidia.

AI InfrastructureAI factoryAgentic AI
0 likes · 29 min read
Token Economics Reveals Nvidia’s New AI Factory Narrative
Architects' Tech Alliance
Architects' Tech Alliance
Apr 28, 2025 · Artificial Intelligence

NVLink High‑Speed Interconnect: Architecture, Evolution, and Performance

NVLink, NVIDIA's high‑bandwidth interconnect introduced with the P100 GPU, replaces PCIe by offering significantly higher data rates and lower latency for GPU‑GPU and GPU‑CPU communication, and has evolved through multiple generations to support modern AI and high‑performance computing workloads.

AI accelerationGPU interconnectNVIDIA
0 likes · 9 min read
NVLink High‑Speed Interconnect: Architecture, Evolution, and Performance
Architects' Tech Alliance
Architects' Tech Alliance
Apr 13, 2025 · Industry Insights

Which NVIDIA GPU Wins for AI? Deep Dive into RTX & A‑Series Performance and Power

This article presents a detailed comparison of major NVIDIA GPUs—including RTX 4090, RTX 4090 D, RTX 3090, A10, A40, A100, and H100—covering memory size, bandwidth, Tensor BF16/FP16/FP32 throughput, FP16/FP32 performance, power draw and release dates, and explains how these specs affect AI workload efficiency.

AI workloadsGPUIndustry Analysis
0 likes · 9 min read
Which NVIDIA GPU Wins for AI? Deep Dive into RTX & A‑Series Performance and Power
AI Frontier Lectures
AI Frontier Lectures
Apr 8, 2025 · Industry Insights

Nvidia’s GPU Names Explained: Ampere, Hopper, Blackwell, Rubin, Feynman

At the recent GTC conference Nvidia unveiled its roadmap of AI‑focused GPUs—Ampere, Hopper, Blackwell, Rubin and the upcoming Feynman—each named after a pioneering scientist, and this article explores the historical contributions of André‑Marie Ampère, Grace Hopper, David Blackwell, Vera Rubin and Richard Feynman, linking their legacies to the architectures’ innovations.

AIGPUNVIDIA
0 likes · 10 min read
Nvidia’s GPU Names Explained: Ampere, Hopper, Blackwell, Rubin, Feynman
Architects' Tech Alliance
Architects' Tech Alliance
Mar 28, 2025 · Artificial Intelligence

Evolution of NVIDIA GPU Architectures for Deep Learning: From Volta to Blackwell and Rubin

The article traces NVIDIA’s GPU architecture evolution from the Volta era’s pioneering Tensor Cores through Turing, Ampere, Hopper, and the latest Blackwell and Rubin designs, highlighting key innovations such as mixed‑precision support, sparsity, NVLink, and their impact on deep‑learning performance.

AI hardwareGPUNVIDIA
0 likes · 10 min read
Evolution of NVIDIA GPU Architectures for Deep Learning: From Volta to Blackwell and Rubin
Infra Learning Club
Infra Learning Club
Mar 23, 2025 · Artificial Intelligence

Getting Started with cuda‑python and an Introduction to cuTicle

This article explains the cuda‑python ecosystem—including its core packages, installation via pip or conda, the experimental cuda.core API, a full Python‑to‑CUDA workflow with NVRTC compilation, performance comparison to C++, the covered APIs, and an overview of NVIDIA's new cuTicle programming model.

CUDAGPUNVIDIA
0 likes · 11 min read
Getting Started with cuda‑python and an Introduction to cuTicle
Code Mala Tang
Code Mala Tang
Mar 21, 2025 · Artificial Intelligence

What Are the Four Waves of AI and How NVIDIA Is Shaping the Future?

NVIDIA’s GTC 2025 keynote outlines the four AI waves—from perception to physical AI—while highlighting the company’s latest Blackwell chips, DGX Spark/Station computers, Dynamo inference accelerator, robotics collaborations, GM autonomous‑driving partnership, and AI‑native 6G efforts, underscoring massive data‑center investment and future challenges.

AI hardwareData CenterNVIDIA
0 likes · 11 min read
What Are the Four Waves of AI and How NVIDIA Is Shaping the Future?
Software Engineering 3.0 Era
Software Engineering 3.0 Era
Mar 21, 2025 · Artificial Intelligence

NVIDIA GTC 2025: How AI Is Shifting From a Niche Toy to a Mass‑Market Tool and Ushering in the Robot Era

At NVIDIA GTC 2025, the company unveiled the Isaac GR00T N1 foundation model and the Mega Omniverse Blueprint platform, showing how AI‑driven robotics are moving from specialist prototypes to everyday tools through dual‑system architecture, synthetic‑data generation, cloud‑based development, and broad industry collaborations.

AICloudIsaac GR00T
0 likes · 10 min read
NVIDIA GTC 2025: How AI Is Shifting From a Niche Toy to a Mass‑Market Tool and Ushering in the Robot Era
Infra Learning Club
Infra Learning Club
Mar 20, 2025 · Artificial Intelligence

How GPU Frequency, Power Consumption, and FLOPS Interrelate

The article explains the theoretical and practical relationships between GPU clock frequencies, power consumption, and FLOPS, describes key hardware metrics such as SM, memory, and video clocks, shows how to query and set these values with nvidia‑smi, and presents experiments on a Tesla P4 that reveal the non‑linear trade‑offs between performance, power, and temperature.

Clock SpeedDVFSFLOPS
0 likes · 15 min read
How GPU Frequency, Power Consumption, and FLOPS Interrelate
Model Perspective
Model Perspective
Feb 27, 2025 · Artificial Intelligence

Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand

The article explains how DeepSeek’s low‑cost large‑language‑model training reduces GPU price pressure, yet paradoxically fuels greater demand for Nvidia hardware by lowering entry barriers, illustrating the modern Jevons paradox and its broader economic and societal implications.

AI hardwareDeepSeekGPU demand
0 likes · 8 min read
Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand
Infra Learning Club
Infra Learning Club
Feb 12, 2025 · Fundamentals

Why Does Nvidia Report Less GPU Memory Than Specified?

The article investigates why Nvidia L40S and RTX A6000 GPUs show less memory via nvidia‑smi than their advertised 48 GB, revealing that enabled ECC memory reserves a few gigabytes, and demonstrates the effect by toggling ECC on a Tesla‑T4 card.

ECCGPU memoryL40S
0 likes · 4 min read
Why Does Nvidia Report Less GPU Memory Than Specified?
Software Engineering 3.0 Era
Software Engineering 3.0 Era
Jan 27, 2025 · Industry Insights

What Capital Currents Hide Behind DeepSeek’s R1 Model Surge?

The article analyzes how DeepSeek’s R1 model, touted as a low‑cost AI breakthrough, sparked Wall Street speculation, prompted a sharp Nvidia stock decline, and may be part of a broader quant‑driven strategy to manipulate market sentiment and capture short‑term capital gains.

AI hardwareDeepSeekNVIDIA
0 likes · 8 min read
What Capital Currents Hide Behind DeepSeek’s R1 Model Surge?
DataFunSummit
DataFunSummit
Jan 24, 2025 · Artificial Intelligence

Challenges and Debugging Strategies for FP8 Training of Large Models

The article explains the performance benefits of using FP8 for large‑model training, outlines three main categories of FP8‑related issues such as loss spikes, divergence, and downstream metric gaps, and introduces a dedicated FP8 debug tool with metrics like MSE, cosine similarity, underflow, and overflow to help diagnose and resolve these problems.

AIFP8NVIDIA
0 likes · 9 min read
Challenges and Debugging Strategies for FP8 Training of Large Models
Software Engineering 3.0 Era
Software Engineering 3.0 Era
Jan 18, 2025 · Industry Insights

Is AI Self‑Programming and Recursive Self‑Improvement Signaling the Endgame?

The article examines Nvidia’s claim that AI can now write software and build an “AI factory,” analyzes OpenAI’s emerging o‑series models that purportedly achieve recursive self‑improvement, and surveys community reactions ranging from excitement to safety concerns about a potential AI “game over.”

AI safetyIndustry AnalysisNVIDIA
0 likes · 8 min read
Is AI Self‑Programming and Recursive Self‑Improvement Signaling the Endgame?
Java Tech Enthusiast
Java Tech Enthusiast
Jan 9, 2025 · Cloud Native

Configuring NVIDIA Docker Plugin and GPU Access in Kubernetes

This guide walks through installing the NVIDIA container toolkit, configuring Docker to use the NVIDIA runtime, verifying GPU access, deploying the NVIDIA device plugin in Kubernetes, labeling GPU nodes, and running a GPU‑accelerated FFmpeg pod to confirm successful GPU integration.

Container ToolkitDockerGPU
0 likes · 12 min read
Configuring NVIDIA Docker Plugin and GPU Access in Kubernetes
Liangxu Linux
Liangxu Linux
Jan 8, 2025 · Cloud Native

Enable NVIDIA GPU Access in Docker and Kubernetes with the NVIDIA Container Toolkit

This guide walks through checking system and software environments, installing and configuring the NVIDIA Docker plugin, verifying GPU access in Docker containers, deploying the NVIDIA device plugin on a Kubernetes cluster, creating GPU‑enabled pods, and troubleshooting common issues, all with concrete commands and configuration examples.

Container ToolkitFFmpegGPU
0 likes · 12 min read
Enable NVIDIA GPU Access in Docker and Kubernetes with the NVIDIA Container Toolkit
21CTO
21CTO
Jan 7, 2025 · Artificial Intelligence

Nvidia Reveals RTX 50 GPUs, Thor Auto Chip, and AI Supercomputer at CES 2025

At CES 2025, Nvidia CEO Jensen Huang announced the RTX 50 series GPUs built on the Blackwell architecture, the Thor automotive processor, the Project Digits personal AI supercomputer, new AI agents and robotics initiatives, detailing pricing, performance specs, and partnerships across automotive and AI ecosystems.

AutomotiveCES 2025GPU
0 likes · 10 min read
Nvidia Reveals RTX 50 GPUs, Thor Auto Chip, and AI Supercomputer at CES 2025
Architects' Tech Alliance
Architects' Tech Alliance
Jan 6, 2025 · Industry Insights

How Nvidia’s GB300 GPU Is Shaping AI Inference and Cloud Supply Chains

The article provides a detailed technical analysis of Nvidia’s new GB300 and B300 GPUs, comparing their performance, memory architecture, and power consumption to previous generations, and examines how these changes affect AI inference workloads, NVL72 accelerator systems, and the supply‑chain strategies of major cloud providers.

AI inferenceCloud ComputingGPU
0 likes · 12 min read
How Nvidia’s GB300 GPU Is Shaping AI Inference and Cloud Supply Chains
Architects' Tech Alliance
Architects' Tech Alliance
Dec 10, 2024 · Industry Insights

Could Nvidia Face Up to $50 Billion in Chinese Antitrust Fines?

China’s market regulator has opened an antitrust investigation into Nvidia over alleged breaches of its 2020 Mellanox acquisition commitments, and analysts estimate that, based on the country’s Anti‑Monopoly Law, the company could be fined anywhere from $1 billion to as much as $50 billion, depending on the severity of the violation.

AntitrustChinaMarket Analysis
0 likes · 6 min read
Could Nvidia Face Up to $50 Billion in Chinese Antitrust Fines?
DataFunSummit
DataFunSummit
Oct 2, 2024 · Artificial Intelligence

NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

This article explains NVIDIA’s end‑to‑end stack for large language models, covering the NeMo Framework for data processing, training, and deployment, the open‑source TensorRT‑LLM inference accelerator, and the Retrieval‑Augmented Generation (RAG) technique that enriches model outputs with external knowledge.

NVIDIANeMoRAG
0 likes · 17 min read
NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation
Architects' Tech Alliance
Architects' Tech Alliance
Sep 25, 2024 · Fundamentals

NVIDIA Quantum‑2 InfiniBand Platform: Technical Overview, Q&A, and Deployment Guidance

This article explains the growing demand for high‑performance computing, introduces NVIDIA's Quantum‑2 InfiniBand platform with its high‑speed, low‑latency capabilities, provides a curated list of related technical articles, and offers an extensive Q&A covering compatibility, cabling, UFM, PCIe limits, and best‑practice deployment for AI and HPC workloads.

AIGPUInfiniBand
0 likes · 11 min read
NVIDIA Quantum‑2 InfiniBand Platform: Technical Overview, Q&A, and Deployment Guidance
21CTO
21CTO
Sep 11, 2024 · Artificial Intelligence

How Volvo’s Nvidia‑Powered Software Stack Will Redefine EV Costs

Volvo announced that its upcoming EX90 electric SUV will run on a unified software platform powered by Nvidia's Drive Orin AI chip, using megacasting manufacturing to cut costs while avoiding subscription‑based revenue models.

AI chipsMegacastingNVIDIA
0 likes · 3 min read
How Volvo’s Nvidia‑Powered Software Stack Will Redefine EV Costs
DataFunSummit
DataFunSummit
Sep 5, 2024 · Artificial Intelligence

NVIDIA’s End‑to‑End Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

This article introduces NVIDIA’s comprehensive solutions for large language models, covering the NeMo Framework’s full‑stack development pipeline, the open‑source TensorRT‑LLM inference accelerator, and Retrieval‑Augmented Generation techniques, while detailing data preprocessing, distributed training, model fine‑tuning, deployment, and performance optimizations.

NVIDIANeMo FrameworkRetrieval-Augmented Generation
0 likes · 16 min read
NVIDIA’s End‑to‑End Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation
Architects' Tech Alliance
Architects' Tech Alliance
Sep 3, 2024 · Industry Insights

How NVIDIA Grace Hopper Superchip Redefines HPC and AI Performance

The article provides an in‑depth technical overview of NVIDIA's Grace Hopper superchip, detailing its heterogeneous CPU‑GPU architecture, high‑bandwidth NVLink‑C2C interconnect, unified memory model, programming support, and system‑level scaling features that together deliver unprecedented performance for high‑performance computing and large‑scale AI workloads.

AIGrace HopperHPC
0 likes · 20 min read
How NVIDIA Grace Hopper Superchip Redefines HPC and AI Performance
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Aug 30, 2024 · Industry Insights

How GPU Virtualization Powers AI and Cloud Computing: Techniques, Challenges, and Future Directions

This article examines the rapid rise of GPU virtualization as a solution for efficient GPU resource utilization in AI, big data, and high‑performance computing, detailing its concepts, implementation methods across user, kernel, and hardware layers, Kubernetes integration, real‑world use cases, challenges, and emerging research trends.

Cloud ComputingDevice PluginGPU virtualization
0 likes · 25 min read
How GPU Virtualization Powers AI and Cloud Computing: Techniques, Challenges, and Future Directions
Architects' Tech Alliance
Architects' Tech Alliance
Aug 29, 2024 · Industry Insights

How NVIDIA Builds 256‑GPU and 576‑GPU SuperPods with H100, GH200, and GB200 Interconnects

The article analyzes NVIDIA's DGX SuperPOD architectures across three GPU generations—H100, GH200, and GB200—detailing their NVLink/NVSwitch topologies, bandwidth calculations, scalability limits, and the practical challenges of constructing 256‑GPU and 576‑GPU supercomputing clusters.

Data CenterGPUHigh-performance computing
0 likes · 11 min read
How NVIDIA Builds 256‑GPU and 576‑GPU SuperPods with H100, GH200, and GB200 Interconnects
Architects' Tech Alliance
Architects' Tech Alliance
Jul 25, 2024 · Artificial Intelligence

NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market

The article reviews NVIDIA's newly released H20 AI accelerator for China, compares its performance and pricing with domestic chips, outlines the expanding Chinese AI chip ecosystem—including Huawei, Cambricon, HaiGuang, Alibaba, ByteDance, and Baidu—while highlighting market size growth, multi‑chip heterogeneity strategies, and the strong demand forecast through 2024.

AI chipsAI computeChina
0 likes · 8 min read
NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market
Open Source Linux
Open Source Linux
Jul 19, 2024 · Artificial Intelligence

How Much Is the PCB Inside an NVIDIA DGX A100 Worth? A Deep Dive

This article dissects the PCB composition of NVIDIA's DGX A100 AI server, detailing the GPU board, CPU motherboard, and auxiliary components to reveal their material area, cost breakdown, and overall value contribution in high‑performance computing systems.

AI serverDGX A100NVIDIA
0 likes · 11 min read
How Much Is the PCB Inside an NVIDIA DGX A100 Worth? A Deep Dive
Architects' Tech Alliance
Architects' Tech Alliance
Jul 9, 2024 · Industry Insights

How Nvidia’s Accelerated GPU Roadmap Is Shaping AI‑Scale Networking

Nvidia plans to shorten its GPU generation cycle to one year, launching Blackwell Ultra in 2025, Rubin in 2026, and Rubin Ultra in 2027, while boosting token‑generation efficiency and introducing AI‑optimized Ethernet solutions like Spectrum‑X800, aiming to dominate large‑scale AI clusters and reshape the high‑performance networking market.

AIEthernetGPU
0 likes · 6 min read
How Nvidia’s Accelerated GPU Roadmap Is Shaping AI‑Scale Networking
Architects' Tech Alliance
Architects' Tech Alliance
Jun 16, 2024 · Industry Insights

How Nvidia’s Blackwell GPUs Aim to Slash AI Training Costs and Power

The article analyzes Nvidia’s historic advantage, the massive performance and energy efficiency gains from Pascal to Blackwell GPUs, the economics of training large language models like GPT‑4, and the detailed roadmap of upcoming GPU, memory, and interconnect technologies shaping the future of data‑center AI.

AIGPUNVIDIA
0 likes · 14 min read
How Nvidia’s Blackwell GPUs Aim to Slash AI Training Costs and Power
DevOps
DevOps
Jun 13, 2024 · R&D Management

Jensen Huang on Management Philosophy, Team Structure, and Innovation at NVIDIA

In this interview, NVIDIA founder Jensen Huang shares his management philosophy, emphasizing the value of tackling difficult tasks, maintaining a small yet empowered team, avoiding layoffs, fostering a zero‑market mindset, navigating the early challenges of CUDA, and leveraging AI to drive future innovation.

AICUDALeadership
0 likes · 12 min read
Jensen Huang on Management Philosophy, Team Structure, and Innovation at NVIDIA
21CTO
21CTO
Jun 7, 2024 · Artificial Intelligence

Nvidia Beats Apple in Market Value: AI Chip Wars, New AMD Processors & More

This roundup highlights Nvidia surpassing Apple in market cap, AMD's next‑gen AI processors, Elon Musk shifting Nvidia chips to X, Microsoft’s latest layoffs and AI spending, Google’s new developer program, GitHub Actions Arm64 support, Ubuntu Core 24 for IoT, and the release of Zabbix 7.0.

AI hardwareCloud ComputingDeveloper Tools
0 likes · 12 min read
Nvidia Beats Apple in Market Value: AI Chip Wars, New AMD Processors & More
IT Services Circle
IT Services Circle
Jun 6, 2024 · Artificial Intelligence

Nvidia Unveils Blackwell GPU and AI Supercomputing Roadmap

Nvidia’s latest Blackwell GPU, presented by Jensen Huang, promises unprecedented performance and energy efficiency for large‑scale AI models, while the company also showcases accelerated computing, NVLink interconnects, AI‑optimized DGX servers, the NIM platform for rapid LLM deployment, and ambitious projects such as Earth‑2 digital twins and next‑generation embodied AI robots.

AIBlackwellGPU
0 likes · 18 min read
Nvidia Unveils Blackwell GPU and AI Supercomputing Roadmap
Architects' Tech Alliance
Architects' Tech Alliance
May 1, 2024 · Industry Insights

How NVIDIA’s Blackwell Platform Redefines AI Supercomputing Networks

The article examines NVIDIA’s Blackwell platform network architecture, detailing the fifth‑generation NVLink, sixth‑generation PCIe, 800 Gb/s InfiniBand and Ethernet adapters, the DGX B200 and GB200 configurations, new IB and Ethernet switches, and the implications of increased optical module demands for large‑scale AI clusters.

AI supercomputingBlackwellDGX
0 likes · 10 min read
How NVIDIA’s Blackwell Platform Redefines AI Supercomputing Networks
DataFunSummit
DataFunSummit
Apr 14, 2024 · Artificial Intelligence

TensorRT-LLM: NVIDIA’s Scalable LLM Inference Framework – Overview, Features, Workflow, Performance, and Future Directions

This article presents a comprehensive overview of NVIDIA’s TensorRT-LLM, detailing its product positioning as a scalable LLM inference solution, key features such as model support, low-precision and quantization techniques, parallelism strategies, the end-to-end usage workflow, performance highlights, future roadmap, and answers to common technical questions.

LLM InferenceNVIDIAQuantization
0 likes · 13 min read
TensorRT-LLM: NVIDIA’s Scalable LLM Inference Framework – Overview, Features, Workflow, Performance, and Future Directions
Architects' Tech Alliance
Architects' Tech Alliance
Apr 2, 2024 · Artificial Intelligence

Evolution and Forecast of Nvidia NVLink, NVLink C2C, and B100/X100 GPU Architectures

The article analyses the historical evolution of Nvidia's NVLink and NVLink C2C interconnect technologies, compares them with PCIe, Ethernet and InfiniBand, and uses these trends to predict future AI‑chip architectures such as the B100 and X100 GPUs, highlighting design trade‑offs and packaging challenges.

AI chipB100GPU architecture
0 likes · 15 min read
Evolution and Forecast of Nvidia NVLink, NVLink C2C, and B100/X100 GPU Architectures
Architects' Tech Alliance
Architects' Tech Alliance
Mar 30, 2024 · Industry Insights

How NVIDIA’s B200 GPU Redefines AI Compute and What It Means for the Chip Market

The article analyzes the latest AI‑compute announcements from NVIDIA, AMD and Intel—including NVIDIA’s B200 GPU with 20 petaFLOPS FP4 performance, AMD’s MI300/MI400 roadmap, and Intel’s Gaudi 3 and Falcon Shores—while examining pricing, launch timelines, supply‑chain capacity, and the shifting market share among major cloud providers.

AI computeAMDGPU
0 likes · 10 min read
How NVIDIA’s B200 GPU Redefines AI Compute and What It Means for the Chip Market
Sohu Tech Products
Sohu Tech Products
Mar 27, 2024 · Artificial Intelligence

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

NVIDIA’s comprehensive LLM ecosystem combines the full‑stack NeMo Framework for data curation, distributed training, fine‑tuning, inference acceleration with TensorRT‑LLM and Triton, plus Retrieval‑Augmented Generation and Guardrails, enabling efficient, low‑latency, knowledge‑grounded model deployment across clusters.

AI accelerationModel TrainingNVIDIA
0 likes · 16 min read
NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions
Architects' Tech Alliance
Architects' Tech Alliance
Mar 26, 2024 · Artificial Intelligence

Analysis and Forecast of Nvidia AI Chip Roadmap: From H100 to X100

The article analyzes Nvidia's AI chip evolution, assumes consistent storage‑compute‑interconnect ratios and predictable process scaling, and projects the architectures of H200, B100 and X100, highlighting the limits of chiplet packaging and the critical role of low‑latency, high‑reliability interconnect technologies for future AI compute scaling.

AI chipsChipletFuture Predictions
0 likes · 12 min read
Analysis and Forecast of Nvidia AI Chip Roadmap: From H100 to X100
Architects' Tech Alliance
Architects' Tech Alliance
Mar 22, 2024 · Industry Insights

Can Groq’s LPU Outsmart Nvidia GPUs in AI Inference?

The article examines Groq’s new LPU AI chip, comparing its inference speed and architecture to Nvidia GPUs, discusses the company’s market positioning, recent CEO statements, and the broader AI‑hardware race, while questioning whether Groq can become the go‑to accelerator for startups by the end of 2024.

AI chipsAI hardwareGroq
0 likes · 9 min read
Can Groq’s LPU Outsmart Nvidia GPUs in AI Inference?
Architects' Tech Alliance
Architects' Tech Alliance
Mar 20, 2024 · Industry Insights

What Nvidia’s B100 and GB200 Reveal About the Future of AI GPUs

The GTC 2024 recap highlights Nvidia’s upcoming B100 and GB200 GPUs, their BlackWell architecture, performance breakthroughs, embodied‑intelligence initiatives, and the expanding AI application ecosystem across industries, offering a clear view of the next wave in accelerated computing.

AIB100Embodied Intelligence
0 likes · 7 min read
What Nvidia’s B100 and GB200 Reveal About the Future of AI GPUs
21CTO
21CTO
Mar 20, 2024 · Artificial Intelligence

Nvidia Unveils Blackwell GPU: A Quantum Leap for Generative AI

Nvidia introduced the Blackwell GPU architecture at GTC, highlighting six breakthrough technologies, a 4nm process, massive performance gains, and its integration into DGX SuperPOD systems that promise to accelerate generative AI, data processing, and high‑performance computing across industries.

AIBlackwellGPU
0 likes · 14 min read
Nvidia Unveils Blackwell GPU: A Quantum Leap for Generative AI
DataFunTalk
DataFunTalk
Mar 15, 2024 · Artificial Intelligence

NVIDIA’s NeMo Framework and TensorRT‑LLM: Full‑Stack Solutions for Large Language Models and Retrieval‑Augmented Generation

This article explains NVIDIA’s end‑to‑end ecosystem for large language models, covering the NeMo Framework’s data processing, distributed training, model fine‑tuning, inference acceleration with TensorRT‑LLM, deployment via Triton, and Retrieval‑Augmented Generation (RAG) techniques that enhance model reliability and performance.

AINVIDIANeMo
0 likes · 16 min read
NVIDIA’s NeMo Framework and TensorRT‑LLM: Full‑Stack Solutions for Large Language Models and Retrieval‑Augmented Generation
Architects' Tech Alliance
Architects' Tech Alliance
Mar 12, 2024 · Industry Insights

What’s Nvidia’s 2024‑2025 AI Chip Roadmap? A Deep Dive into GPUs, CPUs, and Interconnects

The article analyzes Nvidia’s 2023 investor‑meeting roadmap, revealing an annual GPU release cadence with H200, B100 and X100 chips, a unified "One Architecture" strategy spanning x86 and ARM, accelerated interconnects like NVLink‑C2C, and competitive pressures shaping its AI ecosystem.

AI hardwareGPU roadmapIndustry Analysis
0 likes · 20 min read
What’s Nvidia’s 2024‑2025 AI Chip Roadmap? A Deep Dive into GPUs, CPUs, and Interconnects
21CTO
21CTO
Mar 9, 2024 · Artificial Intelligence

Can AI Really Replace Programmers? A Critical Look at Jensen Huang’s Predictions

The article examines Jensen Huang’s claim that AI will make programming obsolete, discusses existing AI coding tools, highlights their limitations, and argues that human expertise in design, reasoning, and error‑checking remains essential for software development.

AINVIDIAcode generation
0 likes · 10 min read
Can AI Really Replace Programmers? A Critical Look at Jensen Huang’s Predictions
DataFunTalk
DataFunTalk
Jan 31, 2024 · Artificial Intelligence

Introduction to NVIDIA TensorRT-LLM Inference Framework

TensorRT-LLM is NVIDIA's scalable inference framework for large language models that combines TensorRT compilation, fast kernels, multi‑GPU parallelism, low‑precision quantization, and a PyTorch‑like API to deliver high‑performance LLM serving with extensive customization and future‑focused enhancements.

GPU AccelerationLLM InferenceNVIDIA
0 likes · 12 min read
Introduction to NVIDIA TensorRT-LLM Inference Framework
Architects' Tech Alliance
Architects' Tech Alliance
Jan 25, 2024 · Industry Insights

Why Chinese Tech Giants Are Dropping Nvidia GPUs for Domestic Chips

Amid tightening U.S. export controls, Chinese cloud providers like Tencent, Alibaba, Baidu and ByteDance are cutting orders for Nvidia's downgraded AI GPUs and turning to domestic alternatives, driven by regulatory uncertainty, reduced performance of special‑edition chips, and a desire for more stable supply chains.

AI chipsChinaDomestic alternatives
0 likes · 11 min read
Why Chinese Tech Giants Are Dropping Nvidia GPUs for Domestic Chips
DataFunTalk
DataFunTalk
Dec 23, 2023 · Artificial Intelligence

NVIDIA Merlin: Product Overview, Models, Distributed Embeddings, Hierarchical KV and Parameter Server

This article introduces NVIDIA's Merlin recommendation system suite, detailing its product overview, model and system libraries, TensorFlow Distributed Embedding plugin, hierarchical key‑value store, and hierarchical parameter server, while highlighting integration with NVTABULAR, Triton, and performance gains on GPU‑accelerated training and inference.

Distributed EmbeddingHierarchical KVMerlin
0 likes · 13 min read
NVIDIA Merlin: Product Overview, Models, Distributed Embeddings, Hierarchical KV and Parameter Server
21CTO
21CTO
Oct 20, 2023 · Artificial Intelligence

How New US AI Chip Export Ban Could Reshape China's AI Landscape

New U.S. export restrictions targeting high‑end AI GPUs such as Nvidia’s H800 and A800 aim to curb China’s access to advanced compute, potentially slowing its AI model development, affecting major chip makers and prompting Chinese firms to stockpile hardware or accelerate domestic chip efforts.

AI chipsAMDChina AI
0 likes · 10 min read
How New US AI Chip Export Ban Could Reshape China's AI Landscape
Baidu Geek Talk
Baidu Geek Talk
Aug 22, 2023 · Industry Insights

What Baidu’s First Commercial AI Competition Reveals About AIGC Trends

The article reviews Baidu's 2023 generative AI initiatives, details the inaugural Baidu Commercial AI Technology Innovation Competition co‑hosted with the China AI Society and NVIDIA, highlights winning teams' technical approaches in conversion prediction and inference optimization, and shares insights from industry leaders on future AI talent and innovation.

AIAIGCBaidu
0 likes · 8 min read
What Baidu’s First Commercial AI Competition Reveals About AIGC Trends
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Apr 17, 2023 · Artificial Intelligence

How NVIDIA’s GPU‑Powered AI is Revolutionizing Drug Discovery and Genomics

The article outlines NVIDIA’s CLARA platform, BioNeMo framework, and GPU‑accelerated tools such as CLARA Parabricks and RAPIDS, demonstrating how AI and high‑performance computing dramatically speed up drug‑target identification, molecular generation, protein structure prediction, and high‑throughput DNA/RNA sequencing, with benchmarks showing up to 80‑fold acceleration.

AI drug discoveryBioNeMoCLARA
0 likes · 11 min read
How NVIDIA’s GPU‑Powered AI is Revolutionizing Drug Discovery and Genomics