16 File operations for a parallel world

This chapter covers

  • Modifying a parallel application for standard file operations
  • Writing out data using parallel file operations with MPI-IO and HDF5
  • Tuning parallel file operations for different parallel filesystems

Filesystems create a streamlined workflow of retrieving, storing, and updating data. For any computing work, the product is the output, whether it be data, graphics, or statistics. This includes final results but also intermediate output for graphics, checkpointing, and analysis. Checkpointing is a special need on large HPC systems with long-running calculations that might span days, weeks, or months.

Definition Checkpointing is the practice of periodically storing the state of a calculation ...

Get Parallel and High Performance Computing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.