Skip to Content
C++ High Performance
book

C++ High Performance

by Viktor Sehr, Björn Andrist
January 2018
Intermediate to advanced
374 pages
9h 53m
English
Packt Publishing
Content preview from C++ High Performance

Parallelizing for two dimensions

Now, let's use the filters by applying them to the images. The arguments for the Boost Compute kernel are set using set_arg before execution, and when the execution is performed using enqueue_nd_range_kernel(), we apply the number of dimensions and the ranges of each dimension, which is the equivalent of how a double for-loop is used in the C++ code. The corresponding x and y variables in the kernel are then fetched using get_global_id() in OpenCL.

Take notice of the similarities between STL algorithms and the Boost Compute equivalents as shown in the table below:

Box filter on CPU

Box filter on GPU

auto box_filter_test_cpu( int w, int h, int r) { using array_t = std::array<size_t,2>; // Create ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

C++ High Performance - Second Edition

C++ High Performance - Second Edition

Björn Andrist, Viktor Sehr
Advanced C++

Advanced C++

Gazihan Alankus, Olena Lizina, Rakesh Mane, Vivek Nagarajan, Brian Price
C++ In a Nutshell

C++ In a Nutshell

Ray Lischner
C++ Cookbook

C++ Cookbook

D. Ryan Stephens, Christopher Diggins, Jonathan Turkanis, Jeff Cogswell

Publisher Resources

ISBN: 9781787120952Supplemental Content