Portable Performance with OpenCL
Simon McIntosh-Smith*; Tim Mattson† * University of Bristol, United Kingdom† Intel, USA
Abstract
This is an Intel Xeon Phi coprocessor Gem because we show the potential for using the OpenCL standard parallel programming language to deliver portable performance on Intel Xeon Phi coprocessors, Xeon processors, and many-core devices such as GPUs from multiple vendors. This portable performance can be delivered from a single program without needing multiple versions of the code, an advantage of OpenCL over most other approaches available today. As proof of OpenCL’s ability to deliver performance portability, we describe results from the BUDE molecular docking code, which sustains over 30% of peak floating-point ...
Get High Performance Parallelism Pearls Volume One now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.