Skip to Main Content
Designing Scientific Applications on GPUs
book

Designing Scientific Applications on GPUs

by Raphael Couturier
November 2013
Intermediate to advanced content levelIntermediate to advanced
498 pages
17h 6m
English
Chapman and Hall/CRC
Content preview from Designing Scientific Applications on GPUs
Implementing an efficient convolution operation on GPU 63
FIGURE 5.3. Organization of the prefetching stage of data, for a 5 × 5 mask
and a thread block size of 8 ×4. Threads in both top corners of the top figure
are identified either by a circle or by a star symbol. The image tile, loaded into
shared memory, includes the pixels to be updated by the threads of the block,
as well as its 2-pixel wide halo. Here, circle and star symbols in the image tile
show which pixels are actually loaded into one shared memory vector by its
corresponding thread.
Mask size
Image size 3 × 3 5 × 5 7 × 7 9 × 9 11 × 11 13 × 13
512 × 512 1394 1176 907 670 567 477
1024 × 1024 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Introduction to Numerical Analysis and Scientific Computing

Introduction to Numerical Analysis and Scientific Computing

Nabil Nassif, Dolly Khuwayri Fayyad
Computational Electromagnetism

Computational Electromagnetism

Alain Bossavit, Isaak D. Mayergoyz

Publisher Resources

ISBN: 9781466571648