O'Reilly logo

Parallel Programming with OpenACC by Rob Farber

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 7

Multidevice programming with OpenACC

Jiri Kraus     NVIDIA GmbH, Würselen, DEU, Germany

Abstract

The purpose of this chapter is to explain how to program multiple OpenACC devices to work cooperatively on a single problem.

At the end of this chapter the reader will have a basic understanding of:

 How to program multidevice systems or accelerated clusters with OpenACC using a single host thread, OpenMP, or MPI

 Coordinate the work of multiple devices using a domain decomposition strategy

 How to use the async clause to overlap computation and MPI communication

 How to use the NVIDIA® tools for MPI+OpenACC applications

Keywords

Multidevice programming; OpenACC; Domain decomposition; GPU; CUDA-aware MPI; Debugging; ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required