James P. Briggs*; Simon J. Pennycook†; James R. Fergusson*; Juha Jäykkä*; Edward P. Shellard** University of Cambridge, UK† Intel Corporation, UK
This chapter discusses the steps taken to optimize and modernize Modal, a cosmological statistical analysis code for studying the formation of the early universe developed by theoretical physicists at the University of Cambridge. In order to achieve higher levels of performance and to reduce the memory footprint, the optimization work included introducing nested parallelism. The chapter explored the different nested parallelism approaches available in OpenMP, discussing the strengths and weaknesses of each ...
Get High Performance Parallelism Pearls Volume Two now with O’Reilly online learning.
O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.