
178 Designing Scientific Applications on GPUs
By carefully analyzing each of the scenarios of data placement on the
memory hierarchies of the GPU, the recommendation is to put in the shared
memory the Johnson’s and the processing time matrices (JM and PTM) if
they fit in together. Otherwise, the whole or a part of the Johnson’s matrix
has to be given in priority in the shared memory. The other data structures
are mapped to the global memory.
8.10 Conclusion and future work
In this chapter, we have revisited the design of parallel B&B algorithms
on GPU accelerators to allow highly efficient solving of permutation-based
COPs. To do so, our contributions consisted ...