Skip to Main Content
Designing Scientific Applications on GPUs
book

Designing Scientific Applications on GPUs

by Raphael Couturier
November 2013
Intermediate to advanced content levelIntermediate to advanced
498 pages
17h 6m
English
Chapman and Hall/CRC
Content preview from Designing Scientific Applications on GPUs
Development methodologies for GPU and cluster of GPUs 115
tion on line 49. This synchronization is not mandatory, but it will make the
implementation more robust and will facilitate the debugging steps: all GPU
computations run by the OpenMP thread number 1 will be achieved before
this thread enters a new loop iteration, or before the computation loop has
ended.
If a partial result has to be transferred from GPU to CPU memory at
the end of each loop iteration (for example, the result of one reduction per
iteration), this transfer is achieved synchronously on the default stream (no
particular stream is specified) on lines 51–54. Availability of the result val-
ues is ensured by the synchronization implemented on line 49. However, if a
partial result
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Introduction to Numerical Analysis and Scientific Computing

Introduction to Numerical Analysis and Scientific Computing

Nabil Nassif, Dolly Khuwayri Fayyad
Computational Electromagnetism

Computational Electromagnetism

Alain Bossavit, Isaak D. Mayergoyz

Publisher Resources

ISBN: 9781466571648