Rio Yokota; Mustafa AbdulJabbar King Abdullah University of Science and Technology, Saudi Arabia
N-Body methods have an arithmetic intensity that is even higher than DGEMM and can extract the full potential of architectures such as used on the Intel Xeon Phi coprocessor. The direct N-body kernel that I describe in my chapter is the key component of the fast multipole method (FMM). FMM is recently attracting a lot of attention because it has a unique combination of optimal arithmetic complexity of O(N), while having high arithmetic intensity, and an optimal communication complexity of O(log P), while having high asynchronicity. FMM can be used instead of fast Fourier transform or sparse matrix solvers in ...
Get High Performance Parallelism Pearls Volume One now with O’Reilly online learning.
O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.