November 2018
Intermediate to advanced
310 pages
7h 54m
English
We will now look at how to perform a general matrix-matrix multiplication (GEMM) with CuBLAS. We will actually try to make something a little more utilitarian than the last few examples we saw in cuBLAS—we will use this as a performance metric for our GPU to determine the number of Floating Point Operations Per Second (FLOPS) it can perform, which will be two separate values: the case of single precision, and that of double precision. Using GEMM is a standard technique for evaluating the performance of computing hardware in FLOPS, as it gives a much better understanding of sheer computational power than using pure clock speed in MHz or GHz.
Read now
Unlock full access