We need to discuss our role as the concert master for our parallel programs. The proper orchestration of a parallel program is a thing of beauty—code running full speed without waiting for data, because we have arranged for all data to arrive and depart at the proper times. Code well-decomposed ...
8. Scheduling Kernels and Data Movement
Get Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.