Skip to Main Content
Programming Massively Parallel Processors, 3rd Edition
book

Programming Massively Parallel Processors, 3rd Edition

by David B. Kirk, Wen-mei W. Hwu
November 2016
Intermediate to advanced content levelIntermediate to advanced
576 pages
18h 22m
English
Morgan Kaufmann
Content preview from Programming Massively Parallel Processors, 3rd Edition
Chapter 18

Programming a heterogeneous computing cluster

Isaac Gelado and Javier Cabezas

Abstract

This chapter introduces joint MPI/CUDA programming. It presents a sufficient number of basic MPI concepts for the reader to understand a simple MPI/CUDA program. It then focuses on the practical use of pinned memory and asynchronous data transfers to enable overlapping computation with communication. The chapter ends with an overview of how CUDA-aware MPI systems help simplify the code and improve efficiency.

Keywords

Message passing interface; message passing; communication; overlapping communication with computation; asynchronous; domain partition; collective; point-to-point communication; pinned memory; CUDA streams; barrier

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Programming Massively Parallel Processors, 4th Edition

Programming Massively Parallel Processors, 4th Edition

Wen-mei W. Hwu, David B. Kirk, Izzat El Hajj
Engineering a Compiler, 2nd Edition

Engineering a Compiler, 2nd Edition

Keith D. Cooper, Linda Torczon
Algorithms, 4th Edition

Algorithms, 4th Edition

Robert Sedgewick, Kevin Wayne

Publisher Resources

ISBN: 9780128119877