Chapter 6

Finite Difference Method


This case study discusses how a finite-difference code using a large one-dimensional stencil is optimized on a three-dimensional mesh. Achieving good performance requires maximizing data reuse through use of shared memory. Attention is paid to coalescing global memory accesses for derivatives in all directions and using constant memory effectively.


Constant memory; Coalescing; Nonuniform mesh; 1D stencil; Instruction-level parallelism

In many fields of science and engineering, the governing system of equations takes the form of either ordinary or partial differential equations. One method of solving these equations is to use finite differences, where the continuous analytical derivatives are approximated ...

Get CUDA Fortran for Scientists and Engineers now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.