Skip to Main Content
GPU Computing Gems Emerald Edition
book

GPU Computing Gems Emerald Edition

by Wen-mei W. Hwu
January 2011
Intermediate to advanced content levelIntermediate to advanced
886 pages
28h 35m
English
Morgan Kaufmann
Content preview from GPU Computing Gems Emerald Edition
Chapter 39. Large-Scale Fast Fourier Transform
Yifeng Chen, Xiang Cui and Hong Mei
Bandwidth-intensive tasks such as large-scale fast Fourier transfers (FFTs) without data locality are hard to accelerate on GPU clusters because the bottleneck often lies with the PCI bus or the communication network. Optimizing FFT for a single-GPU device will not improve the overall performance. This chapter shows how to achieve substantial speedups for these tasks. Three GPU-related factors contribute to better performance: first, the use of GPU devices improves the sustained memory bandwidth for processing large-size data; second, GPU device memory allows larger subtasks to be processed in whole and hence reduces repeated data transfers between memory and processors; ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

GPU Gems 3

GPU Gems 3

Hubert Nguyen
OpenGL – Build high performance graphics

OpenGL – Build high performance graphics

Muhammad Mobeen Movania, David Wolff, Raymond C. H. Lo, William C. Y. Lo

Publisher Resources

ISBN: 9780123849885