O'Reilly logo

High Performance Parallelism Pearls Volume One by James Jeffers, James Reinders

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 10

N-Body Methods

Rio Yokota; Mustafa AbdulJabbar    King Abdullah University of Science and Technology, Saudi Arabia

Abstract

N-Body methods have an arithmetic intensity that is even higher than DGEMM and can extract the full potential of architectures such as used on the Intel Xeon Phi coprocessor. The direct N-body kernel that I describe in my chapter is the key component of the fast multipole method (FMM). FMM is recently attracting a lot of attention because it has a unique combination of optimal arithmetic complexity of O(N), while having high arithmetic intensity, and an optimal communication complexity of O(log P), while having high asynchronicity. FMM can be used instead of fast Fourier transform or sparse matrix solvers in ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required