Skip to Content
The CUDA Handbook: A Comprehensive Guide to GPU Programming
book

The CUDA Handbook: A Comprehensive Guide to GPU Programming

by Nicholas Wilt
June 2013
Intermediate to advanced
528 pages
13h 11m
English
Addison-Wesley Professional
Content preview from The CUDA Handbook: A Comprehensive Guide to GPU Programming

Chapter 9. Multiple GPUs

This chapter describes CUDA’s facilities for multi-GPU programming, including threading models, peer-to-peer, and inter-GPU synchronization. As an example, we’ll first explore inter-GPU synchronization using CUDA streams and events by implementing a peer-to-peer memcpy that stages through portable pinned memory. We then discuss how to implement the N-body problem (fully described in Chapter 14) with single- and multithreaded implementations that use multiple GPUs.

9.1. Overview

Systems with multiple GPUs generally contain multi-GPU boards with a PCI Express bridge chip (such as the GeForce GTX 690) or multiple PCI Express slots, or both, as described in Section 2.3. Each GPU in such a system is separated by PCI Express ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

CUDA Programming

CUDA Programming

Shane Cook
Professional CUDA C Programming

Professional CUDA C Programming

John Cheng, Max Grossman, Ty McKercher

Publisher Resources

ISBN: 9780133261516Purchase book