Tagged articles

warp scheduling

2 articles · Page 1 of 1
Linux Kernel Journey
Linux Kernel Journey
Dec 7, 2025 · Fundamentals

CUDA Optimization Basics: Understanding GPU Architecture and Warp Scheduling

This article explains the fundamentals of CUDA performance tuning, covering GPU architectures from Kepler to Volta, the role of SMX, warp schedulers, registers and memory hierarchies, and provides practical guidance on launch configuration, latency hiding, and thread‑block sizing to maximize throughput.

CUDAGPU architecturememory latency
0 likes · 21 min read
CUDA Optimization Basics: Understanding GPU Architecture and Warp Scheduling