book

C++ High Performance

by Viktor Sehr, Björn Andrist

January 2018

Intermediate to advanced

374 pages

9h 53m

English

Packt Publishing

Read now

Unlock full access

Content preview from C++ High Performance

Implementing the transform-reduction algorithm on the GPU

When implementing the actual transformation, we need to copy the data back and forth. The data structures housed at the GPU are prefixed with gpu_, and data structures housed at the CPU are prefixed with cpu_.

Note that Boost Compute has been nice enough to provide a compute::plus<float> functor equivalent of std::plus, which we use when the areas are reduced:

namespace bc = boost::compute; 
auto circle_areas_gpu(bc::context& context, bc::command_queue& q) { 
  // Create a bunch of random circles and copy to the GPU  const auto n = 1024; 
  auto cpu_circles = make_circles(n); 
  auto gpu_circles = bc::vector<Circle>(n, context); bc::copy(cpu_circles.begin(), cpu_circles.end(), gpu_circles.begin(), ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

C++ High Performance - Second Edition

Björn Andrist, Viktor Sehr

Advanced C++

Gazihan Alankus, Olena Lizina, Rakesh Mane, Vivek Nagarajan, Brian Price

C++ In a Nutshell

Ray Lischner

C++ Cookbook

D. Ryan Stephens, Christopher Diggins, Jonathan Turkanis, Jeff Cogswell

Publisher Resources

ISBN: 9781787120952Supplemental Content