Skip to Content
C++ High Performance
book

C++ High Performance

by Viktor Sehr, Björn Andrist
January 2018
Intermediate to advanced
374 pages
9h 53m
English
Packt Publishing
Content preview from C++ High Performance

Implementing the transform-reduction algorithm on the GPU

When implementing the actual transformation, we need to copy the data back and forth. The data structures housed at the GPU are prefixed with gpu_, and data structures housed at the CPU are prefixed with cpu_.

Note that Boost Compute has been nice enough to provide a compute::plus<float> functor equivalent of std::plus, which we use when the areas are reduced:

namespace bc = boost::compute; 
auto circle_areas_gpu(bc::context& context, bc::command_queue& q) { 
  // Create a bunch of random circles and copy to the GPU  const auto n = 1024; 
  auto cpu_circles = make_circles(n); 
  auto gpu_circles = bc::vector<Circle>(n, context); bc::copy(cpu_circles.begin(), cpu_circles.end(), gpu_circles.begin(), ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

C++ High Performance - Second Edition

C++ High Performance - Second Edition

Björn Andrist, Viktor Sehr
Advanced C++

Advanced C++

Gazihan Alankus, Olena Lizina, Rakesh Mane, Vivek Nagarajan, Brian Price
C++ In a Nutshell

C++ In a Nutshell

Ray Lischner
C++ Cookbook

C++ Cookbook

D. Ryan Stephens, Christopher Diggins, Jonathan Turkanis, Jeff Cogswell

Publisher Resources

ISBN: 9781787120952Supplemental Content