Tagged articles
204 articles
Page 2 of 3
Infra Learning Club
Infra Learning Club
Mar 20, 2025 · Artificial Intelligence

How GPU Frequency, Power Consumption, and FLOPS Interrelate

The article explains the theoretical and practical relationships between GPU clock frequencies, power consumption, and FLOPS, describes key hardware metrics such as SM, memory, and video clocks, shows how to query and set these values with nvidia‑smi, and presents experiments on a Tesla P4 that reveal the non‑linear trade‑offs between performance, power, and temperature.

Clock SpeedDVFSFLOPS
0 likes · 15 min read
How GPU Frequency, Power Consumption, and FLOPS Interrelate
Model Perspective
Model Perspective
Feb 27, 2025 · Artificial Intelligence

Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand

The article explains how DeepSeek’s low‑cost large‑language‑model training reduces GPU price pressure, yet paradoxically fuels greater demand for Nvidia hardware by lowering entry barriers, illustrating the modern Jevons paradox and its broader economic and societal implications.

AI hardwareDeepSeekGPU demand
0 likes · 8 min read
Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand
Infra Learning Club
Infra Learning Club
Feb 12, 2025 · Fundamentals

Why Does Nvidia Report Less GPU Memory Than Specified?

The article investigates why Nvidia L40S and RTX A6000 GPUs show less memory via nvidia‑smi than their advertised 48 GB, revealing that enabled ECC memory reserves a few gigabytes, and demonstrates the effect by toggling ECC on a Tesla‑T4 card.

ECCGPU MemoryL40S
0 likes · 4 min read
Why Does Nvidia Report Less GPU Memory Than Specified?
DataFunSummit
DataFunSummit
Jan 24, 2025 · Artificial Intelligence

Challenges and Debugging Strategies for FP8 Training of Large Models

The article explains the performance benefits of using FP8 for large‑model training, outlines three main categories of FP8‑related issues such as loss spikes, divergence, and downstream metric gaps, and introduces a dedicated FP8 debug tool with metrics like MSE, cosine similarity, underflow, and overflow to help diagnose and resolve these problems.

AIDebuggingFP8
0 likes · 9 min read
Challenges and Debugging Strategies for FP8 Training of Large Models
Java Tech Enthusiast
Java Tech Enthusiast
Jan 9, 2025 · Cloud Native

Configuring NVIDIA Docker Plugin and GPU Access in Kubernetes

This guide walks through installing the NVIDIA container toolkit, configuring Docker to use the NVIDIA runtime, verifying GPU access, deploying the NVIDIA device plugin in Kubernetes, labeling GPU nodes, and running a GPU‑accelerated FFmpeg pod to confirm successful GPU integration.

Container ToolkitDockerGPU
0 likes · 12 min read
Configuring NVIDIA Docker Plugin and GPU Access in Kubernetes
Liangxu Linux
Liangxu Linux
Jan 8, 2025 · Cloud Native

Enable NVIDIA GPU Access in Docker and Kubernetes with the NVIDIA Container Toolkit

This guide walks through checking system and software environments, installing and configuring the NVIDIA Docker plugin, verifying GPU access in Docker containers, deploying the NVIDIA device plugin on a Kubernetes cluster, creating GPU‑enabled pods, and troubleshooting common issues, all with concrete commands and configuration examples.

Container ToolkitGPUKubernetes
0 likes · 12 min read
Enable NVIDIA GPU Access in Docker and Kubernetes with the NVIDIA Container Toolkit
21CTO
21CTO
Jan 7, 2025 · Artificial Intelligence

Nvidia Reveals RTX 50 GPUs, Thor Auto Chip, and AI Supercomputer at CES 2025

At CES 2025, Nvidia CEO Jensen Huang announced the RTX 50 series GPUs built on the Blackwell architecture, the Thor automotive processor, the Project Digits personal AI supercomputer, new AI agents and robotics initiatives, detailing pricing, performance specs, and partnerships across automotive and AI ecosystems.

CES 2025GPUNvidia
0 likes · 10 min read
Nvidia Reveals RTX 50 GPUs, Thor Auto Chip, and AI Supercomputer at CES 2025
Architects' Tech Alliance
Architects' Tech Alliance
Jan 6, 2025 · Industry Insights

How Nvidia’s GB300 GPU Is Shaping AI Inference and Cloud Supply Chains

The article provides a detailed technical analysis of Nvidia’s new GB300 and B300 GPUs, comparing their performance, memory architecture, and power consumption to previous generations, and examines how these changes affect AI inference workloads, NVL72 accelerator systems, and the supply‑chain strategies of major cloud providers.

AI inferenceGPUNvidia
0 likes · 12 min read
How Nvidia’s GB300 GPU Is Shaping AI Inference and Cloud Supply Chains
Architects' Tech Alliance
Architects' Tech Alliance
Dec 10, 2024 · Industry Insights

Could Nvidia Face Up to $50 Billion in Chinese Antitrust Fines?

China’s market regulator has opened an antitrust investigation into Nvidia over alleged breaches of its 2020 Mellanox acquisition commitments, and analysts estimate that, based on the country’s Anti‑Monopoly Law, the company could be fined anywhere from $1 billion to as much as $50 billion, depending on the severity of the violation.

AntitrustChinaMarket analysis
0 likes · 6 min read
Could Nvidia Face Up to $50 Billion in Chinese Antitrust Fines?
DataFunSummit
DataFunSummit
Oct 2, 2024 · Artificial Intelligence

NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

This article explains NVIDIA’s end‑to‑end stack for large language models, covering the NeMo Framework for data processing, training, and deployment, the open‑source TensorRT‑LLM inference accelerator, and the Retrieval‑Augmented Generation (RAG) technique that enriches model outputs with external knowledge.

NeMoNvidiaRAG
0 likes · 17 min read
NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation
Architects' Tech Alliance
Architects' Tech Alliance
Sep 25, 2024 · Fundamentals

NVIDIA Quantum‑2 InfiniBand Platform: Technical Overview, Q&A, and Deployment Guidance

This article explains the growing demand for high‑performance computing, introduces NVIDIA's Quantum‑2 InfiniBand platform with its high‑speed, low‑latency capabilities, provides a curated list of related technical articles, and offers an extensive Q&A covering compatibility, cabling, UFM, PCIe limits, and best‑practice deployment for AI and HPC workloads.

AIGPUInfiniBand
0 likes · 11 min read
NVIDIA Quantum‑2 InfiniBand Platform: Technical Overview, Q&A, and Deployment Guidance
21CTO
21CTO
Sep 11, 2024 · Artificial Intelligence

How Volvo’s Nvidia‑Powered Software Stack Will Redefine EV Costs

Volvo announced that its upcoming EX90 electric SUV will run on a unified software platform powered by Nvidia's Drive Orin AI chip, using megacasting manufacturing to cut costs while avoiding subscription‑based revenue models.

AI chipsMegacastingNvidia
0 likes · 3 min read
How Volvo’s Nvidia‑Powered Software Stack Will Redefine EV Costs
DataFunSummit
DataFunSummit
Sep 5, 2024 · Artificial Intelligence

NVIDIA’s End‑to‑End Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

This article introduces NVIDIA’s comprehensive solutions for large language models, covering the NeMo Framework’s full‑stack development pipeline, the open‑source TensorRT‑LLM inference accelerator, and Retrieval‑Augmented Generation techniques, while detailing data preprocessing, distributed training, model fine‑tuning, deployment, and performance optimizations.

NeMo FrameworkNvidiaRetrieval Augmented Generation
0 likes · 16 min read
NVIDIA’s End‑to‑End Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation
Architects' Tech Alliance
Architects' Tech Alliance
Sep 3, 2024 · Industry Insights

How NVIDIA Grace Hopper Superchip Redefines HPC and AI Performance

The article provides an in‑depth technical overview of NVIDIA's Grace Hopper superchip, detailing its heterogeneous CPU‑GPU architecture, high‑bandwidth NVLink‑C2C interconnect, unified memory model, programming support, and system‑level scaling features that together deliver unprecedented performance for high‑performance computing and large‑scale AI workloads.

AIGrace HopperHPC
0 likes · 20 min read
How NVIDIA Grace Hopper Superchip Redefines HPC and AI Performance
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Aug 30, 2024 · Industry Insights

How GPU Virtualization Powers AI and Cloud Computing: Techniques, Challenges, and Future Directions

This article examines the rapid rise of GPU virtualization as a solution for efficient GPU resource utilization in AI, big data, and high‑performance computing, detailing its concepts, implementation methods across user, kernel, and hardware layers, Kubernetes integration, real‑world use cases, challenges, and emerging research trends.

Device PluginGPU virtualizationKubernetes
0 likes · 25 min read
How GPU Virtualization Powers AI and Cloud Computing: Techniques, Challenges, and Future Directions
Architects' Tech Alliance
Architects' Tech Alliance
Aug 29, 2024 · Industry Insights

How NVIDIA Builds 256‑GPU and 576‑GPU SuperPods with H100, GH200, and GB200 Interconnects

The article analyzes NVIDIA's DGX SuperPOD architectures across three GPU generations—H100, GH200, and GB200—detailing their NVLink/NVSwitch topologies, bandwidth calculations, scalability limits, and the practical challenges of constructing 256‑GPU and 576‑GPU supercomputing clusters.

Data centerGPUHigh‑performance computing
0 likes · 11 min read
How NVIDIA Builds 256‑GPU and 576‑GPU SuperPods with H100, GH200, and GB200 Interconnects
Architects' Tech Alliance
Architects' Tech Alliance
Jul 25, 2024 · Artificial Intelligence

NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market

The article reviews NVIDIA's newly released H20 AI accelerator for China, compares its performance and pricing with domestic chips, outlines the expanding Chinese AI chip ecosystem—including Huawei, Cambricon, HaiGuang, Alibaba, ByteDance, and Baidu—while highlighting market size growth, multi‑chip heterogeneity strategies, and the strong demand forecast through 2024.

AI chipsAI computeChina
0 likes · 8 min read
NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market
Open Source Linux
Open Source Linux
Jul 19, 2024 · Artificial Intelligence

How Much Is the PCB Inside an NVIDIA DGX A100 Worth? A Deep Dive

This article dissects the PCB composition of NVIDIA's DGX A100 AI server, detailing the GPU board, CPU motherboard, and auxiliary components to reveal their material area, cost breakdown, and overall value contribution in high‑performance computing systems.

AI serverDGX A100Nvidia
0 likes · 11 min read
How Much Is the PCB Inside an NVIDIA DGX A100 Worth? A Deep Dive
Architects' Tech Alliance
Architects' Tech Alliance
Jul 9, 2024 · Industry Insights

How Nvidia’s Accelerated GPU Roadmap Is Shaping AI‑Scale Networking

Nvidia plans to shorten its GPU generation cycle to one year, launching Blackwell Ultra in 2025, Rubin in 2026, and Rubin Ultra in 2027, while boosting token‑generation efficiency and introducing AI‑optimized Ethernet solutions like Spectrum‑X800, aiming to dominate large‑scale AI clusters and reshape the high‑performance networking market.

AIGPUNvidia
0 likes · 6 min read
How Nvidia’s Accelerated GPU Roadmap Is Shaping AI‑Scale Networking
Architects' Tech Alliance
Architects' Tech Alliance
Jun 16, 2024 · Industry Insights

How Nvidia’s Blackwell GPUs Aim to Slash AI Training Costs and Power

The article analyzes Nvidia’s historic advantage, the massive performance and energy efficiency gains from Pascal to Blackwell GPUs, the economics of training large language models like GPT‑4, and the detailed roadmap of upcoming GPU, memory, and interconnect technologies shaping the future of data‑center AI.

AIGPUNvidia
0 likes · 14 min read
How Nvidia’s Blackwell GPUs Aim to Slash AI Training Costs and Power
DevOps
DevOps
Jun 13, 2024 · R&D Management

Jensen Huang on Management Philosophy, Team Structure, and Innovation at NVIDIA

In this interview, NVIDIA founder Jensen Huang shares his management philosophy, emphasizing the value of tackling difficult tasks, maintaining a small yet empowered team, avoiding layoffs, fostering a zero‑market mindset, navigating the early challenges of CUDA, and leveraging AI to drive future innovation.

AICUDAInnovation
0 likes · 12 min read
Jensen Huang on Management Philosophy, Team Structure, and Innovation at NVIDIA
21CTO
21CTO
Jun 7, 2024 · Artificial Intelligence

Nvidia Beats Apple in Market Value: AI Chip Wars, New AMD Processors & More

This roundup highlights Nvidia surpassing Apple in market cap, AMD's next‑gen AI processors, Elon Musk shifting Nvidia chips to X, Microsoft’s latest layoffs and AI spending, Google’s new developer program, GitHub Actions Arm64 support, Ubuntu Core 24 for IoT, and the release of Zabbix 7.0.

AI hardwareNvidiacloud computing
0 likes · 12 min read
Nvidia Beats Apple in Market Value: AI Chip Wars, New AMD Processors & More
IT Services Circle
IT Services Circle
Jun 6, 2024 · Artificial Intelligence

Nvidia Unveils Blackwell GPU and AI Supercomputing Roadmap

Nvidia’s latest Blackwell GPU, presented by Jensen Huang, promises unprecedented performance and energy efficiency for large‑scale AI models, while the company also showcases accelerated computing, NVLink interconnects, AI‑optimized DGX servers, the NIM platform for rapid LLM deployment, and ambitious projects such as Earth‑2 digital twins and next‑generation embodied AI robots.

AIBlackwellGPU
0 likes · 18 min read
Nvidia Unveils Blackwell GPU and AI Supercomputing Roadmap
Architects' Tech Alliance
Architects' Tech Alliance
May 1, 2024 · Industry Insights

How NVIDIA’s Blackwell Platform Redefines AI Supercomputing Networks

The article examines NVIDIA’s Blackwell platform network architecture, detailing the fifth‑generation NVLink, sixth‑generation PCIe, 800 Gb/s InfiniBand and Ethernet adapters, the DGX B200 and GB200 configurations, new IB and Ethernet switches, and the implications of increased optical module demands for large‑scale AI clusters.

AI supercomputingBlackwellDGX
0 likes · 10 min read
How NVIDIA’s Blackwell Platform Redefines AI Supercomputing Networks
DataFunSummit
DataFunSummit
Apr 14, 2024 · Artificial Intelligence

TensorRT-LLM: NVIDIA’s Scalable LLM Inference Framework – Overview, Features, Workflow, Performance, and Future Directions

This article presents a comprehensive overview of NVIDIA’s TensorRT-LLM, detailing its product positioning as a scalable LLM inference solution, key features such as model support, low-precision and quantization techniques, parallelism strategies, the end-to-end usage workflow, performance highlights, future roadmap, and answers to common technical questions.

LLM inferenceNvidiaParallelism
0 likes · 13 min read
TensorRT-LLM: NVIDIA’s Scalable LLM Inference Framework – Overview, Features, Workflow, Performance, and Future Directions
Architects' Tech Alliance
Architects' Tech Alliance
Apr 2, 2024 · Artificial Intelligence

Evolution and Forecast of Nvidia NVLink, NVLink C2C, and B100/X100 GPU Architectures

The article analyses the historical evolution of Nvidia's NVLink and NVLink C2C interconnect technologies, compares them with PCIe, Ethernet and InfiniBand, and uses these trends to predict future AI‑chip architectures such as the B100 and X100 GPUs, highlighting design trade‑offs and packaging challenges.

AI ChipB100GPU architecture
0 likes · 15 min read
Evolution and Forecast of Nvidia NVLink, NVLink C2C, and B100/X100 GPU Architectures
Architects' Tech Alliance
Architects' Tech Alliance
Mar 30, 2024 · Industry Insights

How NVIDIA’s B200 GPU Redefines AI Compute and What It Means for the Chip Market

The article analyzes the latest AI‑compute announcements from NVIDIA, AMD and Intel—including NVIDIA’s B200 GPU with 20 petaFLOPS FP4 performance, AMD’s MI300/MI400 roadmap, and Intel’s Gaudi 3 and Falcon Shores—while examining pricing, launch timelines, supply‑chain capacity, and the shifting market share among major cloud providers.

AI computeAMDGPU
0 likes · 10 min read
How NVIDIA’s B200 GPU Redefines AI Compute and What It Means for the Chip Market
Sohu Tech Products
Sohu Tech Products
Mar 27, 2024 · Artificial Intelligence

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

NVIDIA’s comprehensive LLM ecosystem combines the full‑stack NeMo Framework for data curation, distributed training, fine‑tuning, inference acceleration with TensorRT‑LLM and Triton, plus Retrieval‑Augmented Generation and Guardrails, enabling efficient, low‑latency, knowledge‑grounded model deployment across clusters.

AI accelerationModel TrainingNeMo Framework
0 likes · 16 min read
NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions
Architects' Tech Alliance
Architects' Tech Alliance
Mar 26, 2024 · Artificial Intelligence

Analysis and Forecast of Nvidia AI Chip Roadmap: From H100 to X100

The article analyzes Nvidia's AI chip evolution, assumes consistent storage‑compute‑interconnect ratios and predictable process scaling, and projects the architectures of H200, B100 and X100, highlighting the limits of chiplet packaging and the critical role of low‑latency, high‑reliability interconnect technologies for future AI compute scaling.

AI chipsChipletFuture Predictions
0 likes · 12 min read
Analysis and Forecast of Nvidia AI Chip Roadmap: From H100 to X100
Architects' Tech Alliance
Architects' Tech Alliance
Mar 22, 2024 · Industry Insights

Can Groq’s LPU Outsmart Nvidia GPUs in AI Inference?

The article examines Groq’s new LPU AI chip, comparing its inference speed and architecture to Nvidia GPUs, discusses the company’s market positioning, recent CEO statements, and the broader AI‑hardware race, while questioning whether Groq can become the go‑to accelerator for startups by the end of 2024.

AI chipsAI hardwareGroq
0 likes · 9 min read
Can Groq’s LPU Outsmart Nvidia GPUs in AI Inference?
Architects' Tech Alliance
Architects' Tech Alliance
Mar 20, 2024 · Industry Insights

What Nvidia’s B100 and GB200 Reveal About the Future of AI GPUs

The GTC 2024 recap highlights Nvidia’s upcoming B100 and GB200 GPUs, their BlackWell architecture, performance breakthroughs, embodied‑intelligence initiatives, and the expanding AI application ecosystem across industries, offering a clear view of the next wave in accelerated computing.

AIB100Embodied Intelligence
0 likes · 7 min read
What Nvidia’s B100 and GB200 Reveal About the Future of AI GPUs
21CTO
21CTO
Mar 20, 2024 · Artificial Intelligence

Nvidia Unveils Blackwell GPU: A Quantum Leap for Generative AI

Nvidia introduced the Blackwell GPU architecture at GTC, highlighting six breakthrough technologies, a 4nm process, massive performance gains, and its integration into DGX SuperPOD systems that promise to accelerate generative AI, data processing, and high‑performance computing across industries.

AIBlackwellGPU
0 likes · 14 min read
Nvidia Unveils Blackwell GPU: A Quantum Leap for Generative AI
DataFunTalk
DataFunTalk
Mar 15, 2024 · Artificial Intelligence

NVIDIA’s NeMo Framework and TensorRT‑LLM: Full‑Stack Solutions for Large Language Models and Retrieval‑Augmented Generation

This article explains NVIDIA’s end‑to‑end ecosystem for large language models, covering the NeMo Framework’s data processing, distributed training, model fine‑tuning, inference acceleration with TensorRT‑LLM, deployment via Triton, and Retrieval‑Augmented Generation (RAG) techniques that enhance model reliability and performance.

AINeMoNvidia
0 likes · 16 min read
NVIDIA’s NeMo Framework and TensorRT‑LLM: Full‑Stack Solutions for Large Language Models and Retrieval‑Augmented Generation
Architects' Tech Alliance
Architects' Tech Alliance
Mar 12, 2024 · Industry Insights

What’s Nvidia’s 2024‑2025 AI Chip Roadmap? A Deep Dive into GPUs, CPUs, and Interconnects

The article analyzes Nvidia’s 2023 investor‑meeting roadmap, revealing an annual GPU release cadence with H200, B100 and X100 chips, a unified "One Architecture" strategy spanning x86 and ARM, accelerated interconnects like NVLink‑C2C, and competitive pressures shaping its AI ecosystem.

AI hardwareGPU roadmapIndustry analysis
0 likes · 20 min read
What’s Nvidia’s 2024‑2025 AI Chip Roadmap? A Deep Dive into GPUs, CPUs, and Interconnects
21CTO
21CTO
Mar 9, 2024 · Artificial Intelligence

Can AI Really Replace Programmers? A Critical Look at Jensen Huang’s Predictions

The article examines Jensen Huang’s claim that AI will make programming obsolete, discusses existing AI coding tools, highlights their limitations, and argues that human expertise in design, reasoning, and error‑checking remains essential for software development.

AICode GenerationNvidia
0 likes · 10 min read
Can AI Really Replace Programmers? A Critical Look at Jensen Huang’s Predictions
DataFunTalk
DataFunTalk
Jan 31, 2024 · Artificial Intelligence

Introduction to NVIDIA TensorRT-LLM Inference Framework

TensorRT-LLM is NVIDIA's scalable inference framework for large language models that combines TensorRT compilation, fast kernels, multi‑GPU parallelism, low‑precision quantization, and a PyTorch‑like API to deliver high‑performance LLM serving with extensive customization and future‑focused enhancements.

GPU AccelerationLLM inferenceNvidia
0 likes · 12 min read
Introduction to NVIDIA TensorRT-LLM Inference Framework
Architects' Tech Alliance
Architects' Tech Alliance
Jan 25, 2024 · Industry Insights

Why Chinese Tech Giants Are Dropping Nvidia GPUs for Domestic Chips

Amid tightening U.S. export controls, Chinese cloud providers like Tencent, Alibaba, Baidu and ByteDance are cutting orders for Nvidia's downgraded AI GPUs and turning to domestic alternatives, driven by regulatory uncertainty, reduced performance of special‑edition chips, and a desire for more stable supply chains.

AI chipsChinaDomestic alternatives
0 likes · 11 min read
Why Chinese Tech Giants Are Dropping Nvidia GPUs for Domestic Chips
DataFunTalk
DataFunTalk
Dec 23, 2023 · Artificial Intelligence

NVIDIA Merlin: Product Overview, Models, Distributed Embeddings, Hierarchical KV and Parameter Server

This article introduces NVIDIA's Merlin recommendation system suite, detailing its product overview, model and system libraries, TensorFlow Distributed Embedding plugin, hierarchical key‑value store, and hierarchical parameter server, while highlighting integration with NVTABULAR, Triton, and performance gains on GPU‑accelerated training and inference.

Distributed EmbeddingHierarchical KVMerlin
0 likes · 13 min read
NVIDIA Merlin: Product Overview, Models, Distributed Embeddings, Hierarchical KV and Parameter Server
21CTO
21CTO
Oct 20, 2023 · Artificial Intelligence

How New US AI Chip Export Ban Could Reshape China's AI Landscape

New U.S. export restrictions targeting high‑end AI GPUs such as Nvidia’s H800 and A800 aim to curb China’s access to advanced compute, potentially slowing its AI model development, affecting major chip makers and prompting Chinese firms to stockpile hardware or accelerate domestic chip efforts.

AI chipsAMDChina AI
0 likes · 10 min read
How New US AI Chip Export Ban Could Reshape China's AI Landscape
Baidu Geek Talk
Baidu Geek Talk
Aug 22, 2023 · Industry Insights

What Baidu’s First Commercial AI Competition Reveals About AIGC Trends

The article reviews Baidu's 2023 generative AI initiatives, details the inaugural Baidu Commercial AI Technology Innovation Competition co‑hosted with the China AI Society and NVIDIA, highlights winning teams' technical approaches in conversion prediction and inference optimization, and shares insights from industry leaders on future AI talent and innovation.

AIAIGCBaidu
0 likes · 8 min read
What Baidu’s First Commercial AI Competition Reveals About AIGC Trends
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Apr 17, 2023 · Artificial Intelligence

How NVIDIA’s GPU‑Powered AI is Revolutionizing Drug Discovery and Genomics

The article outlines NVIDIA’s CLARA platform, BioNeMo framework, and GPU‑accelerated tools such as CLARA Parabricks and RAPIDS, demonstrating how AI and high‑performance computing dramatically speed up drug‑target identification, molecular generation, protein structure prediction, and high‑throughput DNA/RNA sequencing, with benchmarks showing up to 80‑fold acceleration.

AI drug discoveryBioNeMoCLARA
0 likes · 11 min read
How NVIDIA’s GPU‑Powered AI is Revolutionizing Drug Discovery and Genomics
DataFunTalk
DataFunTalk
Apr 1, 2023 · Artificial Intelligence

Nvidia Meets OpenAI: Highlights from the GTC Fireside Chat on GPT‑4, Deep Learning History, and the Future of AI

In a GTC fireside chat, Nvidia CEO Jensen Huang and OpenAI co‑founder Ilya Sutskever discuss GPT‑4's multimodal advances, the evolution of deep learning from early neural networks to large‑scale models, the pivotal role of GPUs and datasets like ImageNet, and their vision for more reliable, scalable artificial intelligence.

Deep LearningGPT-4Neural Networks
0 likes · 10 min read
Nvidia Meets OpenAI: Highlights from the GTC Fireside Chat on GPT‑4, Deep Learning History, and the Future of AI
DataFunSummit
DataFunSummit
Feb 15, 2023 · Artificial Intelligence

ChatGPT Boom Fuels Surge in AI Chip Demand, Boosting Nvidia, Samsung, and SK Hynix

The explosive growth of ChatGPT and other AI chatbots is driving unprecedented demand for high‑performance AI chips and high‑bandwidth memory, positioning Nvidia as the primary beneficiary while also creating significant market opportunities for Samsung, SK Hynix, and other semiconductor manufacturers.

AI chipsAI hardwareChatGPT
0 likes · 11 min read
ChatGPT Boom Fuels Surge in AI Chip Demand, Boosting Nvidia, Samsung, and SK Hynix
Open Source Linux
Open Source Linux
Dec 1, 2022 · Fundamentals

How NVIDIA Boosted Software Safety by Switching from C to SPARK

NVIDIA’s security team adopted the formally verified SPARK language, replacing C in safety‑critical components, and after a successful proof‑of‑concept demonstrated improved security, verification efficiency, and unchanged performance, leading to widespread internal adoption across many products.

AdaCoreC to SPARK migrationNvidia
0 likes · 4 min read
How NVIDIA Boosted Software Safety by Switching from C to SPARK
DataFunTalk
DataFunTalk
Oct 31, 2022 · Artificial Intelligence

NVIDIA Merlin HugeCTR: System Overview, Architecture, and Performance

This article introduces NVIDIA Merlin's HugeCTR recommendation system framework, covering its three main modules—NV Tabular, HugeCTR, and Triton—detailing model‑parallel embedding handling, CUDA kernel fusion, mixed‑precision training, hierarchical parameter server inference, Sparse Operation Kit for TensorFlow, performance benchmarks, and practical deployment considerations.

EmbeddingGPU AccelerationHugeCTR
0 likes · 19 min read
NVIDIA Merlin HugeCTR: System Overview, Architecture, and Performance
Architects' Tech Alliance
Architects' Tech Alliance
Aug 18, 2022 · Cloud Computing

NVIDIA BlueField DPU Series: Architecture, Features, and Ecosystem Overview

The article provides a comprehensive overview of NVIDIA's BlueField DPU series—including BlueField‑2,‑3, and‑4—detailing their high‑performance architecture, network, security, and storage capabilities, as well as the DOCA development ecosystem that enables programmable acceleration for modern cloud data‑center workloads.

BlueFieldDPUData center
0 likes · 12 min read
NVIDIA BlueField DPU Series: Architecture, Features, and Ecosystem Overview
Architects' Tech Alliance
Architects' Tech Alliance
May 23, 2022 · Industry Insights

GPU Wars in the Data Center: How Nvidia, AMD, and Intel Compete for AI and HPC Dominance

The article examines how GPUs have evolved from gaming accelerators to essential data‑center processors for AI, HPC, and scientific workloads, and compares the latest server‑grade offerings from Nvidia, AMD, and Intel—including performance specs, memory technologies, interconnects, and software ecosystems—highlighting the fierce competition shaping the future of compute.

AIAMDData center
0 likes · 12 min read
GPU Wars in the Data Center: How Nvidia, AMD, and Intel Compete for AI and HPC Dominance
21CTO
21CTO
May 13, 2022 · Fundamentals

Why Nvidia’s Open‑Source GPU Driver Could Transform Linux and AI Development

Nvidia’s release of the open‑source R515 GPU driver for Linux, supporting data‑center and consumer GPUs under a dual GPL/MIT license, marks a pivotal shift that eases integration for AI/ML developers, gamers, and cloud users while fostering community‑driven improvements to driver quality and security.

AI/MLAmpereGPU Driver
0 likes · 7 min read
Why Nvidia’s Open‑Source GPU Driver Could Transform Linux and AI Development
Architects' Tech Alliance
Architects' Tech Alliance
May 4, 2022 · Industry Insights

What the Next‑Gen Nvidia and AMD GPUs Could Mean for the 2022‑2023 Market

Based on recent leaks from 3DCenter.org and Twitter insiders Kopite7kimi and 暴龙兽55, the article forecasts Nvidia's Lovelace RTX 4000 series and AMD's RDNA 3 Navi 33/32 GPUs to launch between September 2022 and early 2023, analyzes their expected specifications, pricing dynamics, and potential market impact, and notes Intel's upcoming Arc cards as a wildcard.

AMDGPULovelace
0 likes · 7 min read
What the Next‑Gen Nvidia and AMD GPUs Could Mean for the 2022‑2023 Market
IT Services Circle
IT Services Circle
Mar 24, 2022 · Artificial Intelligence

NVIDIA Unveils H100 GPU with Hopper Architecture: Massive Performance Gains for AI

At the recent GTC event, NVIDIA introduced the H100 GPU built on the Hopper architecture using TSMC 4nm process, featuring 800 billion transistors, 16,896 CUDA cores, up to 700 W power, 3 TB/s memory bandwidth, and a specialized Transformer engine that accelerates large‑model training up to six times faster, alongside the Grace CPU Superchip and new AI supercomputing systems.

AIGPUGrace CPU
0 likes · 11 min read
NVIDIA Unveils H100 GPU with Hopper Architecture: Massive Performance Gains for AI
Architects' Tech Alliance
Architects' Tech Alliance
Dec 14, 2021 · Industry Insights

Why NVIDIA’s BlueField DPU Is Redefining Data‑Center Architecture

The article provides an in‑depth analysis of NVIDIA’s BlueField DPU series—detailing the roadmap from BlueField‑2 to BlueField‑4, the technical capabilities of BlueField‑3 across networking, security, and storage, and the DOCA ecosystem that enables programmable, hardware‑accelerated data‑center services, positioning DPUs as a core pillar of modern cloud infrastructure.

BlueFieldDPUData center
0 likes · 14 min read
Why NVIDIA’s BlueField DPU Is Redefining Data‑Center Architecture
Liangxu Linux
Liangxu Linux
Aug 17, 2021 · Cloud Native

How to Enable GPU Acceleration in Docker on Linux

This guide walks you through installing NVIDIA drivers, CUDA, and nvidia-docker2 on a Linux host, configuring Docker to access the GPU, and verifying the setup with commands and sample TensorFlow/PyTorch code, enabling deep‑learning workloads inside containers.

CUDADeep LearningDocker
0 likes · 7 min read
How to Enable GPU Acceleration in Docker on Linux
DataFunTalk
DataFunTalk
Jun 13, 2021 · Artificial Intelligence

GPU Virtual Sharing for AI Inference Services on Kubernetes

The article presents a GPU virtual‑sharing solution for AI inference workloads that isolates memory and compute resources via CUDA API interception, integrates with Kubernetes using the open‑source aliyun‑gpushare scheduler, and demonstrates doubled GPU utilization and minimal performance loss across multiple tests.

CUDAGPU virtualizationKubernetes
0 likes · 16 min read
GPU Virtual Sharing for AI Inference Services on Kubernetes
Architects' Tech Alliance
Architects' Tech Alliance
Mar 20, 2021 · Fundamentals

Evolution of NVIDIA GPU Architectures from Fermi to Ampere

This article outlines the progression of NVIDIA GPU architectures—from the early Fermi and Kepler designs through Maxwell, Pascal, Volta, Turing, and the latest Ampere—detailing compute capabilities, SM structures, FP64/FP32 ratios, Tensor Core introductions, and their impact on AI and high‑performance computing.

AICUDAGPU architecture
0 likes · 19 min read
Evolution of NVIDIA GPU Architectures from Fermi to Ampere
Architects' Tech Alliance
Architects' Tech Alliance
Mar 15, 2021 · Artificial Intelligence

Evolution of NVIDIA GPU Architectures from Fermi to Ampere

This article provides a comprehensive overview of NVIDIA's GPU architecture evolution—covering Fermi, Kepler, Maxwell, Pascal, Volta, Turing, and Ampere—detailing compute capabilities, SM structures, specialized units such as Tensor Cores, and their impact on AI and high‑performance computing workloads.

AICUDAGPU
0 likes · 19 min read
Evolution of NVIDIA GPU Architectures from Fermi to Ampere
JD Cloud Developers
JD Cloud Developers
Feb 8, 2021 · Artificial Intelligence

This Week’s Must‑Read Tech & AI Highlights: From Digital Currency to Cutting‑Edge Research

The developer community weekly roundup covers a digital RMB lottery on JD, AI data‑annotation market trends, Google Drive’s enterprise‑personal merge, Windows 10 cloud configuration, NVIDIA’s breakthrough in real‑time SDF rendering and A100 performance, plus new research on intent discovery and EEG‑based emotion recognition.

AINvidiacloud computing
0 likes · 8 min read
This Week’s Must‑Read Tech & AI Highlights: From Digital Currency to Cutting‑Edge Research
Architects' Tech Alliance
Architects' Tech Alliance
Dec 30, 2020 · Artificial Intelligence

Understanding GPUs, AI Accelerators, and Market Trends

The article explains GPU evolution, its integration with CPUs, interconnect technologies like PCIe and NVLink, market shares of NVIDIA, AMD and Intel, AI accelerator types (GPU, FPGA, ASIC), and the roles of training and inference in cloud AI, while also promoting a paid 182‑page PPT resource.

AI acceleratorGPUHPC
0 likes · 7 min read
Understanding GPUs, AI Accelerators, and Market Trends
Programmer DD
Programmer DD
Dec 17, 2020 · Artificial Intelligence

Can Huang’s Law Double AI Performance Every Two Years? NVIDIA GTC 2020 Insights

At NVIDIA’s GTC China 2020, chief scientist Bill Dally highlighted the “Huang’s Law” predicting GPU-driven AI performance to double biennially, introduced projects like MAGNet, optical interconnects, and the Legate programming model, and discussed the broader implications for AI ecosystem development and industry adoption.

AI PerformanceGPUHuang's Law
0 likes · 8 min read
Can Huang’s Law Double AI Performance Every Two Years? NVIDIA GTC 2020 Insights
Programmer DD
Programmer DD
Dec 6, 2020 · Cloud Native

Enable GPU Support in Kubernetes with Containerd and NVIDIA Runtime

This guide walks through installing NVIDIA drivers, CUDA toolkit, nvidia-container-runtime, configuring Containerd, deploying the NVIDIA device plugin, and testing GPU access inside Kubernetes pods, providing a complete solution for GPU workloads on containerd‑based clusters.

CUDADevice PluginsGPU
0 likes · 11 min read
Enable GPU Support in Kubernetes with Containerd and NVIDIA Runtime
Architects' Tech Alliance
Architects' Tech Alliance
Oct 28, 2020 · Artificial Intelligence

Understanding NVIDIA NVLink: Architecture, Features, and Applications

The article introduces NVIDIA’s third‑generation NVLink technology, detailing its high‑bandwidth GPU‑GPU and GPU‑CPU interconnect, key architectural breakthroughs such as the Ampere‑based A100 GPU, multi‑instance GPU, and NVSwitch, and discusses its impact on AI, HPC, and graphics workloads.

GPU interconnectHigh-performance computingNVLink
0 likes · 7 min read
Understanding NVIDIA NVLink: Architecture, Features, and Applications
Programmer DD
Programmer DD
Sep 17, 2020 · Artificial Intelligence

Why Nvidia’s $40B Arm Acquisition Could Redefine AI Computing

Nvidia’s $40 billion purchase of Arm from SoftBank merges its AI chip expertise with Arm’s vast processor ecosystem, promising new AI research centers, open licensing, and a strategic edge against rivals while reshaping the future of computing hardware.

AIARMNvidia
0 likes · 8 min read
Why Nvidia’s $40B Arm Acquisition Could Redefine AI Computing
21CTO
21CTO
May 16, 2020 · R&D Management

How Nvidia’s Chief Scientist Built a $400 Open‑Source Ventilator to Fight COVID‑19

Bill Dally, Nvidia’s chief scientist, designed a low‑cost, open‑source mechanical ventilator using off‑the‑shelf components that can be assembled in minutes for about $400, aiming to alleviate COVID‑19 ventilator shortages, while collaborating with experts across AI, robotics, and medical fields.

Bill DallyCOVID-19Nvidia
0 likes · 6 min read
How Nvidia’s Chief Scientist Built a $400 Open‑Source Ventilator to Fight COVID‑19
Architects' Tech Alliance
Architects' Tech Alliance
Dec 28, 2019 · Artificial Intelligence

Understanding CPU vs GPU, GPU Parameters, and NVIDIA Architectures for AI and High‑Performance Computing

The article explains how CPUs and GPUs differ in architecture and workload handling, details key GPU specifications such as CUDA cores, memory bandwidth and floating‑point precision, reviews NVIDIA's product families and architectural evolution, and highlights the role of GPUs in deep learning training and inference while also mentioning a related technical ebook promotion.

AICPUCUDA
0 likes · 13 min read
Understanding CPU vs GPU, GPU Parameters, and NVIDIA Architectures for AI and High‑Performance Computing