AbstractIn this chapter you will6.1 GPU programming6.2 CUDA’S programming model: Threads, blocks, and grids6.3 CUDA’S execution model: Streaming multiprocessors and warps6.4 CUDA compilation process6.5 Putting together a CUDA project6.6 Memory hierarchy6.7 Optimization techniques6.8 Dynamic parallelism6.9 Debugging CUDA programs6.10 Profiling CUDA programs6.11 CUDA and MPI