Scalable Out-of-Core Solvers on a Cluster
Eduardo D’Azevedo*; Ki Sing Chan†; Shi-Quan Su‡; Kwai Wong‡ * Oak Ridge National Laboratory, United States† Chinese University of Hong Kong, Hong Kong‡ University of Tennessee, United States
Abstract
This chapters documents the implementation of a parallel distributed memory out-of-core (OOC) solver for performing LU and Cholesky factorizations of a large dense matrix on clusters equipped with Intel® Xeon Phi™ coprocessors. The OOC solver takes advantage of NVIDIA graphics processing units (GPU) or Intel Xeon Phi coprocessor (MIC) and allows problems larger than device memory to be solved. The OOC solver is built to be compatible with the format of the ScaLAPACK software library, making ...
Get High Performance Parallelism Pearls Volume One now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.