March 2026
Intermediate
534 pages
12h 51m
English
This part introduces the core principles of GPU acceleration and how parallelization drives performance, as well as exploring its practical limits through theory and profiling. We will guide you through setting up a CUDA development environment locally or in the cloud, then dive into CUDA kernel development using Numba-CUDA. Finally, we will cover how to profile and debug CUDA code. By the end, you'll have the foundational knowledge to write, execute, profile, and debug CUDA code in Python, which you can use to solve practical problems.
This part of the book includes the following chapters:
Read now
Unlock full access