May 2016
Intermediate to advanced
662 pages
20h 17m
English
Presents performance optimization methods applicable to N-body simulations on Knights Landing. Discusses the optimization of arithmetic expressions, data structures, thread parallelism, and memory traffic on Knights Landing. Results demonstrate that the N-body simulation, previously optimized for parallelism on Knights Corner, achieves most of the performance improvements available with Knights Landing without any code adaptation—only a recompilation is necessary to take advantage of AVX-512.
N-body; Strength reduction; Alignment; Vectorization; SoA; AoS; Unit-stride; OpenMP; Threads; Affinity; Tiling; MCDRAM
Read now
Unlock full access