Tuning OpenACC loop execution
Saber Feki*; Malek Smaoui † * KAUST Supercomputing Laboratory, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia † Computer, Electrical, Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Abstract
The purpose of this chapter is to help OpenACC developer who is already familiar with the basic and essential directives to further improve his code performance by adding more descriptive clauses to OpenACC loop constructs.
At the end of this chapter the reader will:
• Have a better understanding of the purpose of the OpenACC loop construct and its associated clauses illustrated with use cases
•
Get Parallel Programming with OpenACC now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.