Book description
This IBM® Redbooks® publication documents and addresses topics to provide step-by-step programming concepts to tune the applications to use IBM POWER8® hardware architecture with the technical computing software stack. This publication explores, tests, and documents how to implement an IBM high-performance computing (HPC) solution on POWER8 by using IBM technical innovations to help solve challenging scientific, technical, and business problems.
This book demonstrates and documents that the combination of IBM HPC hardware and software solutions delivers significant value to technical computing clients in need of cost-effective, highly scalable, and robust solutions.
This book targets technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for delivering cost-effective HPC solutions that help uncover insights among clients' data so that they can act to optimize business results, product development, and scientific discoveries.
Table of contents
- Front cover
- Notices
- IBM Redbooks promotions
- Preface
- Chapter 1. Introduction
- Chapter 2. Planning for your high-performance computing environment
-
Chapter 3. Software deployment and configuration
- 3.1 Hardware, firmware, and the software stack
- 3.2 OPAL firmware and the ASM interface
- 3.3 Intelligent Platform Management Interface (IPMI)
-
3.4 xCAT overview
- 3.4.1 xCAT cluster: Nodes and networks
- 3.4.2 xCAT database: Tables and objects
- 3.4.3 xCAT node booting
- 3.4.4 xCAT node discovery
- 3.4.5 xCAT FSP discovery
- 3.4.6 xCAT operating system installation types: Disks and state
- 3.4.7 xCAT network adapters: Primary and secondary or additional
- 3.4.8 xCAT Software Kits
- 3.4.9 xCAT version
- 3.4.10 xCAT cluster scenario: Networks, IP addresses, and hostnames
- 3.5 xCAT management node
- 3.6 xCAT node discovery
-
3.7 xCAT compute nodes: Operating system
- 3.7.1 Set node attributes for operating system network installation
- 3.7.2 Set the root password for nodes
- 3.7.3 Create the operating system image object definitions
- 3.7.4 Download the netboot files
- 3.7.5 Create a package repository for installer modules that are compatible with the netboot files
- 3.7.6 Set the operating system image of the nodes
- 3.7.7 Start the operating system provisioning
-
3.8 xCAT compute nodes: Software stack
- 3.8.1 Configure nodes for access to the Internet
- 3.8.2 Check for errors in package updates
- 3.8.3 Mellanox InfiniBand
- 3.8.4 XL C/C++ compiler
- 3.8.5 XL Fortran compiler
- 3.8.6 Parallel Environment Runtime Edition (PE RTE)
- 3.8.7 Parallel Environment Developer Edition (PE DE)
- 3.8.8 Engineering and Scientific Subroutine Library (ESSL)
- 3.8.9 Parallel Engineering and Scientific Subroutine Library (PESSL)
- 3.8.10 SDK, Java Technology Edition (IBM Java)
- 3.8.11 IBM Platform Load Sharing Facility (LSF)
- 3.8.12 IBM Spectrum Scale (formerly GPFS)
-
3.9 Software and firmware updates
- 3.9.1 Software updates: Operating system packages
- 3.9.2 Software updates: Other packages
- 3.9.3 Software updates: Package updates in Ubuntu server
- 3.9.4 Software updates: Problems in package updates
- 3.9.5 Software updates: Kernel package updates
- 3.9.6 Software updates: Early package updates
- 3.9.7 Firmware updates
- 3.10 System tuning
- Chapter 4. Cluster monitoring
- Chapter 5. Application development
- Chapter 6. Running applications
- Chapter 7. Tuning and debugging applications
- Chapter 8. NVIDIA CUDA on IBM POWER8
- Appendix A. Problem determination
- Appendix B. Useful commands
- Appendix C. IBM Tivoli Workload Scheduler LoadLeveler to IBM Platform Load Sharing Facility migration
- Appendix D. Applications and performance
- Related publications
- Back cover
Product information
- Title: Implementing an IBM High-Performance Computing Solution on IBM POWER8
- Author(s):
- Release date: September 2015
- Publisher(s): IBM Redbooks
- ISBN: 9780738440934
You might also like
book
Highly Efficient Data Access with RoCE on IBM Elastic Storage Systems and IBM Spectrum Scale
With Remote Direct Memory Access (RDMA), you can make a subset of a host's memory directly …
book
IBM High-Performance Computing Insights with IBM Power System AC922 Clustered Solution
This IBM® Redbooks® publication documents and addresses topics to set up a complete infrastructure environment and …
article
Reinventing the Organization for GenAI and LLMs
Previous technology breakthroughs did not upend organizational structure, but generative AI and LLMs will. We now …
book
Solaris™ 7 Reference
2004H-5 Thoroughly cross-referenced and packed with comprehensive examples for administrators and programmers Easy-to-understand explanations of UNIX …