Chapter 8

Dissecting OpenCL on a heterogeneous system

Abstract

This chapter shows how OpenCL maps to a system with an x86-based FX-8350 central processing unit and a discrete R9 290X graphics processing unit. It also discusses memory performance considerations for global and local memory.

Keywords

FX-8350

R9 290X

heterogeneous system

local data share

bank conflicts

resource allocation

In Chapter 2, we discussed trade-offs present in different architectures, many of which support the execution of OpenCL programs. The design of OpenCL is such that the models map capably to a wide range of architectures, allowing for tuning and acceleration of kernel code. In this chapter, we discuss OpenCL’s mapping to a real system in the form of a high-end ...

Get Heterogeneous Computing with OpenCL 2.0 now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.