Fast Matrix Computations on Heterogeneous Streams
Gaurav Bansal; Chris J. Newburn; Paul Besl Intel Corporation, USA
Abstract
This chapter examines the hStreams library which supports programming of a heterogeneous system by abstracting programming to be akin to feeding a system with streams of actions (computations, data transfers, and synchronizations). Use of the library is illustrated with some problems from the field of linear algebra, endeavoring to show the speed and flexibility of hStreams on both processors and coprocessors. Performance results highlight four key benefits of using hStreams: (1) concurrency of computes across nodes and within a node, (2) pipelined concurrency among data transfers, and between data transfers ...
Get High Performance Parallelism Pearls Volume Two now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.