book

High Performance Linux Clusters with OSCAR, Rocks, OpenMosix, and MPI

Name: High Performance Linux Clusters with OSCAR, Rocks, OpenMosix, and MPI
Author: Joseph D Sloan
ISBN: 9780596005702

by Joseph D Sloan

November 2004

Intermediate to advanced

368 pages

10h 24m

English

O'Reilly Media, Inc.

Read now

Unlock full access

A Note Regarding Supplemental Files
Preface
AudienceOrganizationConventionsHow to Contact UsUsing Code ExamplesAcknowledgments
I. An Introduction to Clusters
1. Cluster Architecture
1.1. Modern Computing and the Role of Clusters1.1.1. Uniprocessor Computers1.1.2. Multiple Processors1.1.2.1. Centralized multiprocessors1.1.2.2. Multicomputers1.1.2.3. Cluster structure1.2. Types of Clusters1.3. Distributed Computing and Clusters1.4. Limitations1.4.1. Amdahl’s Law1.5. My Biases
2. Cluster Planning
2.1. Design Steps2.2. Determining Your Cluster’s Mission2.2.1. What Is Your User Base?2.2.2. How Heavily Will the Cluster Be Used?2.2.3. What Kinds of Software Will You Run on the Cluster?2.2.4. How Much Control Do You Need?2.2.5. Will This Be a Dedicated or Shared Cluster?2.2.6. What Resources Do You Have?2.2.7. How Will Cluster Access Be Managed?2.2.8. What Is the Extent of Your Cluster?2.2.9. What Security Concerns Do You Have?2.3. Architecture and Cluster Software2.3.1. System Software2.3.2. Programming Software2.3.3. Control and Management2.4. Cluster Kits2.5. CD-ROM-Based Clusters2.5.1. BCCD2.6. Benchmarks
3. Cluster Hardware
3.1. Design Decisions3.1.1. Node Hardware3.1.1.1. CPUs and motherboards3.1.1.2. Memory and disks3.1.1.3. Monitors, keyboards, and mice3.1.1.4. Adapters, power supplies, and cases3.1.2. Cluster Head and Servers3.1.3. Cluster Network3.2. Environment3.2.1. Cluster Layout3.2.2. Power and Air Conditioning3.2.2.1. Power3.2.2.2. HVAC3.2.3. Physical Security
4. Linux for Clusters
4.1. Installing Linux4.1.1. Selecting a Distribution4.1.2. Downloading Linux4.1.3. What to Install?4.2. Configuring Services4.2.1. DHCP4.2.2. NFS4.2.2.1. Running NFS4.2.2.2. Automount4.2.3. Other Cluster File System4.2.4. SSH4.2.4.1. Using SSH4.2.5. Other Services and Configuration Tasks4.2.5.1. Apache4.2.5.2. Network Time Protocol (NTP)4.2.5.3. Virtual Network Computing (VNC)4.2.5.4. Multicasting4.2.5.5. Hosts file and name services4.3. Cluster Security
II. Getting Started Quickly
5. openMosix
5.1. What Is openMosix?5.2. How openMosix Works5.3. Selecting an Installation Approach5.4. Installing a Precompiled Kernel5.4.1. Downloading5.4.2. Installing5.4.3. Configuration Changes5.5. Using openMosix5.5.1. User Tools5.5.1.1. mps and mtop5.5.1.2. migrate5.5.1.3. mosctl5.5.1.4. mosmon5.5.1.5. mosrun5.5.1.6. setpe5.5.2. openMosixView5.5.3. Testing openMosix5.6. Recompiling the Kernel5.7. Is openMosix Right for You?
6. OSCAR
6.1. Why OSCAR?6.2. What’s in OSCAR6.3. Installing OSCAR6.3.1. Prerequisites6.3.2. Network Configuration6.3.3. Loading Software on Your Server6.3.4. A Basic OSCAR Installation6.3.4.1. Step 0: Downloading additional packages6.3.4.2. Step 1: Package selection6.3.4.3. Step 2: Configuring packages6.3.4.4. Step 3: Installing server software6.3.4.5. Step 4: Building a client image6.3.4.6. Step 5: Defining clients6.3.4.7. Step 6: Setting up the network6.3.4.8. Step 7: Completing the setup6.3.4.9. Step 8: Testing6.3.5. Custom Installations6.3.6. Changes OSCAR Makes6.3.7. Making Changes6.4. Security and OSCAR6.4.1. pfilter6.4.2. SSH and OPIUM6.5. Using switcher6.6. Using LAM/MPI with OSCAR

7. Rocks
7.1. Installing Rocks7.1.1. Prerequisites7.1.2. Downloading Rocks7.1.3. Installing the Frontend7.1.4. Install Compute Nodes7.1.5. Customizing the Frontend7.1.5.1. User management with 4117.1.5.2. X Window System7.1.6. Customizing Compute Nodes7.1.6.1. Adding packages7.1.6.2. Changing disk partitions7.1.6.3. Other changes7.2. Managing Rocks7.3. Using MPICH with Rocks
III. Building Custom Clusters
8. Cloning Systems
8.1. Configuring Systems8.1.1. Distributing Files8.1.1.1. Pushing files with rsync8.2. Automating Installations8.2.1. Kickstart8.2.1.1. Configuration file8.2.1.2. Using Kickstart8.2.2. g4u8.2.3. SystemImager8.2.3.1. Image server setup8.2.3.2. Golden client setup8.2.3.3. Retrieving the image8.2.3.4. Cloning the systems8.2.3.5. Other tasks8.3. Notes for OSCAR and Rocks Users
9. Programming Software
9.1. Programming Languages9.2. Selecting a Library9.3. LAM/MPI9.3.1. Installing LAM/MPI9.3.2. User Configuration9.3.3. Using LAM/MPI9.3.4. Testing the Installation9.4. MPICH9.4.1. Installing9.4.2. User Configuration9.4.3. Using MPICH9.4.4. Testing the Installation9.4.5. MPE9.5. Other Programming Software9.5.1. Debuggers9.5.2. HDF59.5.3. SPRNG9.6. Notes for OSCAR Users9.6.1. Adding MPE9.7. Notes for Rocks Users
10. Management Software
10.1. C310.1.1. Installing C310.1.2. Using C3 Commands10.1.2.1. cexec10.1.2.2. cget10.1.2.3. ckill10.1.2.4. cpush10.1.2.5. crm10.1.2.6. cshutdown10.1.2.7. clist, cname, and cnum10.1.2.8. Further examples and comments10.2. Ganglia10.2.1. Installing and Using Ganglia10.2.1.1. RRDTool10.2.1.2. Apache and PHP10.2.1.3. Ganglia monitor core10.2.1.4. Web frontend10.3. Notes for OSCAR and Rocks Users
11. Scheduling Software
11.1. OpenPBS11.1.1. Architecture11.1.2. Installing OpenPBS11.1.3. Configuring PBS11.1.4. Managing PBS11.1.5. Using PBS11.1.6. PBS’s GUI11.1.7. Maui Scheduler11.2. Notes for OSCAR and Rocks Users
12. Parallel Filesystems
12.1. PVFS12.1.1. Installing PVFS on the Head Node12.1.2. Configuring the Metadata Server12.1.3. I/O Server Setup12.1.4. Client Setup12.1.5. Running PVFS12.1.5.1. Troubleshooting12.2. Using PVFS12.3. Notes for OSCAR and Rocks Users
IV. Cluster Programming
13. Getting Started with MPI
13.1. MPI13.1.1. Core MPI13.1.1.1. MPI_Init13.1.1.2. MPI_Finalize13.1.1.3. MPI_Comm_size13.1.1.4. MPI_Comm_rank13.1.1.5. MPI_Get_processor_name13.2. A Simple Problem13.2.1. Background13.2.2. Single-Processor Program13.3. An MPI Solution13.3.1. A C Solution13.3.2. Transferring Data13.3.2.1. MPI_Send13.3.2.2. MPI_Recv13.3.3. MPI Using FORTRAN13.3.4. MPI Using C++13.4. I/O with MPI13.5. Broadcast Communications13.5.1. Broadcast Functions13.5.1.1. MPI_Bcast13.5.1.2. MPI_Reduce
14. Additional MPI Features
14.1. More on Point-to-Point Communication14.1.1. Non-Blocking Communication14.1.1.1. MPI_Isend and MPI_Irecv14.1.1.2. MPI_Wait14.1.1.3. MPI_Test14.1.1.4. MPI_Iprobe14.1.1.5. MPI_Cancel14.1.1.6. MPI_Sendrecv and MPI_Sendrecv_replace14.2. More on Collective Communication14.2.1. Gather and Scatter14.2.1.1. MPI_Gather14.2.1.2. MPI_Scatter14.3. Managing Communicators14.3.1. Communicator Commands14.3.1.1. MPI_Comm_group14.3.1.2. MPI_Group_incl and MPI_Group_excl14.3.1.3. MPI_Comm_create14.3.1.4. MPI_Comm_free and MPI_Group_free14.3.1.5. MPI_Comm_split14.4. Packaging Data14.4.1. User-Defined Types14.4.1.1. MPI_Type_struct14.4.1.2. MPI_Type_commit14.4.2. Packing Data14.4.2.1. MPI_Pack14.4.2.2. MPI_Unpack
15. Designing Parallel Programs
15.1. Overview15.2. Problem Decomposition15.2.1. Decomposition Strategies15.2.1.1. Data decomposition15.2.1.2. Control decomposition15.3. Mapping Tasks to Processors15.3.1. Communication Overhead15.3.2. Load Balancing15.4. Other Considerations15.4.1. Parallel I/O15.4.2. MPI-IO Functions15.4.2.1. MPI_File_open15.4.2.2. MPI_File_seek15.4.2.3. MPI_File_read15.4.2.4. MPI_File_close15.4.3. Random Numbers
16. Debugging Parallel Programs
16.1. Debugging and Parallel Programs16.2. Avoiding Problems16.3. Programming Tools16.4. Rereading Code16.5. Tracing with printf16.6. Symbolic Debuggers16.6.1. gdb16.6.2. ddd16.7. Using gdb and ddd with MPI16.8. Notes for OSCAR and Rocks Users
17. Profiling Parallel Programs
17.1. Why Profile?17.2. Writing and Optimizing Code17.3. Timing Complete Programs17.4. Timing C Code Segments17.4.1. Manual Timing with MPI17.4.2. MPI Functions17.4.2.1. MPI_Wtime17.4.2.2. MPI_Wtick17.4.2.3. MPI_Barrier17.4.3. PMPI17.5. Profilers17.5.1. gprof17.5.2. gcov17.5.3. Profiling Parallel Programs with gprof and gcov17.6. MPE17.6.1. Using MPE17.7. Customized MPE Logging17.8. Notes for OSCAR and Rocks Users
V. Appendix
A. References
A.1. BooksA.2. URLsA.2.1. General Cluster InformationA.2.2. LinuxA.2.3. Cluster SoftwareA.2.4. Grid Computing and ToolsA.2.5. Cloning and Management SoftwareA.2.6. FilesystemsA.2.7. Parallel BenchmarksA.2.8. Programming SoftwareA.2.9. Scheduling SoftwareA.2.10. System Software and Utilities
About the Author
Colophon
Copyright

Content preview from High Performance Linux Clusters with OSCAR, Rocks, OpenMosix, and MPI

Chapter 12. Parallel Filesystems

If you are certain that your cluster will only be used for computationally intensive tasks that involve very little interaction with the filesystem, you can safely skip this chapter. But increasingly, tasks that are computationally expensive also involve a large amount of I/O, frequently accessing either large data sets or large databases. If this is true for at least some of your cluster’s applications, you need to ensure that the I/O subsystem you are using can keep up. For these applications to perform well, you will need a high-performance filesystem.

Selecting a filesystem for a cluster is a balancing act. There are a number of different characteristics that can be used to compare filesystems, including robustness, failure recovery, journaling, enhanced security, and reduced latency. With clusters, however, it often comes down to a trade-off between convenience and performance. From the perspective of convenience, the filesystem should be transparent to users, with files readily available across the cluster. From the perspective of performance, data should be available to the processor that needs it as quickly as possible. Getting the most from a high-performance filesystem often means programming with the filesystem in mind—typically a very “inconvenient” task. The good news is that you are not limited to a single filesystem.

The Network File System (NFS) was introduced in Chapter 4. NFS is strong on convenience. With NFS, you will recall, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 0596005709Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

High Performance Linux Clusters with OSCAR, Rocks, OpenMosix, and MPI

by Joseph D Sloan

Chapter 12. Parallel Filesystems

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.