
386 Designing Scientific Applications on GPUs
0
0.5
1
1.5
2
2.5
0 50 100 150 200
Time (seconds)
Sector number
I/O time
Sector evaluation time
FIGURE 16.10. Compute and I/O times for the GPU V3 on one C1060.
But this evaluation time grows linearly with the strip number and rapidly
exceeds the I/O time.
It is thus interesting to use a double-buffering technique to overlap the I/O
time with the evaluation time: for each sector, the evaluation of sector n is
performed (on GPU) simultaneously with the reading of data for sector n + 1
(on CPU). This requires the duplication in the CPU memory of all the data
structures used for storing data read from I/O files ...