It is known that increasing the number of cores in a multicore central processing unit (CPU) to thousands is not possible due to architectural and power limitations, while a graphics processing unit (GPU) can easily exploit thousands of simple and efficient cores each executing a single thread of instructions.

The enormous appetite for both computation and bandwidth of graphic applications has led to the emergence of GPUs as the dominant massively parallel architecture. Today, general-purpose GPUs (GPGPU) with floating-point computation capability are widely used for general purpose computations in most of the top machines with hybrid CPU-GPU architectures. A recent product of such GPGPUs, NVIDIA Pascal, has 3840 cores.

The main problem ...

Get Advances in GPU Research and Practice now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.