Skip to Main Content
Designing Scientific Applications on GPUs
book

Designing Scientific Applications on GPUs

by Raphael Couturier
November 2013
Intermediate to advanced content levelIntermediate to advanced
498 pages
17h 6m
English
Chapman and Hall/CRC
Content preview from Designing Scientific Applications on GPUs
110 Designing Scientific Applications on GPUs
2 - Sync. CPU GPU
data transfer
2 - Async. GPU parallel
computation
1 - Synchronous
internode
CPU comms
2 - Sync. GPU CPU
data transfer
.......
waiting
MPI
comms
GPU
thread
exec.
CPU GPU
Multithreaded CPU program
CPU main thread
next instructions
1 - thread
comm
2 - thread
comput
CPU thread creation
CPU thread
synchonization barrier
FIGURE 7.2. Overlap of internode CPU communications with a sequence of
CPU/GPU data transfers and GPU computations.
insures an implicit synchronization of all operations involving the same
GPU stream, like the default stream in this example. The transfer of the
results has to wait un
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Introduction to Numerical Analysis and Scientific Computing

Introduction to Numerical Analysis and Scientific Computing

Nabil Nassif, Dolly Khuwayri Fayyad
Computational Electromagnetism

Computational Electromagnetism

Alain Bossavit, Isaak D. Mayergoyz

Publisher Resources

ISBN: 9781466571648