book

High Performance Linux Clusters with OSCAR, Rocks, OpenMosix, and MPI

Name: High Performance Linux Clusters with OSCAR, Rocks, OpenMosix, and MPI
Author: Joseph D Sloan
ISBN: 9780596005702

by Joseph D Sloan

November 2004

Intermediate to advanced

368 pages

10h 24m

English

O'Reilly Media, Inc.

Read now

Unlock full access

A Note Regarding Supplemental Files
Preface
AudienceOrganizationConventionsHow to Contact UsUsing Code ExamplesAcknowledgments
I. An Introduction to Clusters
1. Cluster Architecture
1.1. Modern Computing and the Role of Clusters1.1.1. Uniprocessor Computers1.1.2. Multiple Processors1.1.2.1. Centralized multiprocessors1.1.2.2. Multicomputers1.1.2.3. Cluster structure1.2. Types of Clusters1.3. Distributed Computing and Clusters1.4. Limitations1.4.1. Amdahl’s Law1.5. My Biases
2. Cluster Planning
2.1. Design Steps2.2. Determining Your Cluster’s Mission2.2.1. What Is Your User Base?2.2.2. How Heavily Will the Cluster Be Used?2.2.3. What Kinds of Software Will You Run on the Cluster?2.2.4. How Much Control Do You Need?2.2.5. Will This Be a Dedicated or Shared Cluster?2.2.6. What Resources Do You Have?2.2.7. How Will Cluster Access Be Managed?2.2.8. What Is the Extent of Your Cluster?2.2.9. What Security Concerns Do You Have?2.3. Architecture and Cluster Software2.3.1. System Software2.3.2. Programming Software2.3.3. Control and Management2.4. Cluster Kits2.5. CD-ROM-Based Clusters2.5.1. BCCD2.6. Benchmarks
3. Cluster Hardware
3.1. Design Decisions3.1.1. Node Hardware3.1.1.1. CPUs and motherboards3.1.1.2. Memory and disks3.1.1.3. Monitors, keyboards, and mice3.1.1.4. Adapters, power supplies, and cases3.1.2. Cluster Head and Servers3.1.3. Cluster Network3.2. Environment3.2.1. Cluster Layout3.2.2. Power and Air Conditioning3.2.2.1. Power3.2.2.2. HVAC3.2.3. Physical Security
4. Linux for Clusters
4.1. Installing Linux4.1.1. Selecting a Distribution4.1.2. Downloading Linux4.1.3. What to Install?4.2. Configuring Services4.2.1. DHCP4.2.2. NFS4.2.2.1. Running NFS4.2.2.2. Automount4.2.3. Other Cluster File System4.2.4. SSH4.2.4.1. Using SSH4.2.5. Other Services and Configuration Tasks4.2.5.1. Apache4.2.5.2. Network Time Protocol (NTP)4.2.5.3. Virtual Network Computing (VNC)4.2.5.4. Multicasting4.2.5.5. Hosts file and name services4.3. Cluster Security
II. Getting Started Quickly
5. openMosix
5.1. What Is openMosix?5.2. How openMosix Works5.3. Selecting an Installation Approach5.4. Installing a Precompiled Kernel5.4.1. Downloading5.4.2. Installing5.4.3. Configuration Changes5.5. Using openMosix5.5.1. User Tools5.5.1.1. mps and mtop5.5.1.2. migrate5.5.1.3. mosctl5.5.1.4. mosmon5.5.1.5. mosrun5.5.1.6. setpe5.5.2. openMosixView5.5.3. Testing openMosix5.6. Recompiling the Kernel5.7. Is openMosix Right for You?
6. OSCAR
6.1. Why OSCAR?6.2. What’s in OSCAR6.3. Installing OSCAR6.3.1. Prerequisites6.3.2. Network Configuration6.3.3. Loading Software on Your Server6.3.4. A Basic OSCAR Installation6.3.4.1. Step 0: Downloading additional packages6.3.4.2. Step 1: Package selection6.3.4.3. Step 2: Configuring packages6.3.4.4. Step 3: Installing server software6.3.4.5. Step 4: Building a client image6.3.4.6. Step 5: Defining clients6.3.4.7. Step 6: Setting up the network6.3.4.8. Step 7: Completing the setup6.3.4.9. Step 8: Testing6.3.5. Custom Installations6.3.6. Changes OSCAR Makes6.3.7. Making Changes6.4. Security and OSCAR6.4.1. pfilter6.4.2. SSH and OPIUM6.5. Using switcher6.6. Using LAM/MPI with OSCAR

7. Rocks
7.1. Installing Rocks7.1.1. Prerequisites7.1.2. Downloading Rocks7.1.3. Installing the Frontend7.1.4. Install Compute Nodes7.1.5. Customizing the Frontend7.1.5.1. User management with 4117.1.5.2. X Window System7.1.6. Customizing Compute Nodes7.1.6.1. Adding packages7.1.6.2. Changing disk partitions7.1.6.3. Other changes7.2. Managing Rocks7.3. Using MPICH with Rocks
III. Building Custom Clusters
8. Cloning Systems
8.1. Configuring Systems8.1.1. Distributing Files8.1.1.1. Pushing files with rsync8.2. Automating Installations8.2.1. Kickstart8.2.1.1. Configuration file8.2.1.2. Using Kickstart8.2.2. g4u8.2.3. SystemImager8.2.3.1. Image server setup8.2.3.2. Golden client setup8.2.3.3. Retrieving the image8.2.3.4. Cloning the systems8.2.3.5. Other tasks8.3. Notes for OSCAR and Rocks Users
9. Programming Software
9.1. Programming Languages9.2. Selecting a Library9.3. LAM/MPI9.3.1. Installing LAM/MPI9.3.2. User Configuration9.3.3. Using LAM/MPI9.3.4. Testing the Installation9.4. MPICH9.4.1. Installing9.4.2. User Configuration9.4.3. Using MPICH9.4.4. Testing the Installation9.4.5. MPE9.5. Other Programming Software9.5.1. Debuggers9.5.2. HDF59.5.3. SPRNG9.6. Notes for OSCAR Users9.6.1. Adding MPE9.7. Notes for Rocks Users
10. Management Software
10.1. C310.1.1. Installing C310.1.2. Using C3 Commands10.1.2.1. cexec10.1.2.2. cget10.1.2.3. ckill10.1.2.4. cpush10.1.2.5. crm10.1.2.6. cshutdown10.1.2.7. clist, cname, and cnum10.1.2.8. Further examples and comments10.2. Ganglia10.2.1. Installing and Using Ganglia10.2.1.1. RRDTool10.2.1.2. Apache and PHP10.2.1.3. Ganglia monitor core10.2.1.4. Web frontend10.3. Notes for OSCAR and Rocks Users
11. Scheduling Software
11.1. OpenPBS11.1.1. Architecture11.1.2. Installing OpenPBS11.1.3. Configuring PBS11.1.4. Managing PBS11.1.5. Using PBS11.1.6. PBS’s GUI11.1.7. Maui Scheduler11.2. Notes for OSCAR and Rocks Users
12. Parallel Filesystems
12.1. PVFS12.1.1. Installing PVFS on the Head Node12.1.2. Configuring the Metadata Server12.1.3. I/O Server Setup12.1.4. Client Setup12.1.5. Running PVFS12.1.5.1. Troubleshooting12.2. Using PVFS12.3. Notes for OSCAR and Rocks Users
IV. Cluster Programming
13. Getting Started with MPI
13.1. MPI13.1.1. Core MPI13.1.1.1. MPI_Init13.1.1.2. MPI_Finalize13.1.1.3. MPI_Comm_size13.1.1.4. MPI_Comm_rank13.1.1.5. MPI_Get_processor_name13.2. A Simple Problem13.2.1. Background13.2.2. Single-Processor Program13.3. An MPI Solution13.3.1. A C Solution13.3.2. Transferring Data13.3.2.1. MPI_Send13.3.2.2. MPI_Recv13.3.3. MPI Using FORTRAN13.3.4. MPI Using C++13.4. I/O with MPI13.5. Broadcast Communications13.5.1. Broadcast Functions13.5.1.1. MPI_Bcast13.5.1.2. MPI_Reduce
14. Additional MPI Features
14.1. More on Point-to-Point Communication14.1.1. Non-Blocking Communication14.1.1.1. MPI_Isend and MPI_Irecv14.1.1.2. MPI_Wait14.1.1.3. MPI_Test14.1.1.4. MPI_Iprobe14.1.1.5. MPI_Cancel14.1.1.6. MPI_Sendrecv and MPI_Sendrecv_replace14.2. More on Collective Communication14.2.1. Gather and Scatter14.2.1.1. MPI_Gather14.2.1.2. MPI_Scatter14.3. Managing Communicators14.3.1. Communicator Commands14.3.1.1. MPI_Comm_group14.3.1.2. MPI_Group_incl and MPI_Group_excl14.3.1.3. MPI_Comm_create14.3.1.4. MPI_Comm_free and MPI_Group_free14.3.1.5. MPI_Comm_split14.4. Packaging Data14.4.1. User-Defined Types14.4.1.1. MPI_Type_struct14.4.1.2. MPI_Type_commit14.4.2. Packing Data14.4.2.1. MPI_Pack14.4.2.2. MPI_Unpack
15. Designing Parallel Programs
15.1. Overview15.2. Problem Decomposition15.2.1. Decomposition Strategies15.2.1.1. Data decomposition15.2.1.2. Control decomposition15.3. Mapping Tasks to Processors15.3.1. Communication Overhead15.3.2. Load Balancing15.4. Other Considerations15.4.1. Parallel I/O15.4.2. MPI-IO Functions15.4.2.1. MPI_File_open15.4.2.2. MPI_File_seek15.4.2.3. MPI_File_read15.4.2.4. MPI_File_close15.4.3. Random Numbers
16. Debugging Parallel Programs
16.1. Debugging and Parallel Programs16.2. Avoiding Problems16.3. Programming Tools16.4. Rereading Code16.5. Tracing with printf16.6. Symbolic Debuggers16.6.1. gdb16.6.2. ddd16.7. Using gdb and ddd with MPI16.8. Notes for OSCAR and Rocks Users
17. Profiling Parallel Programs
17.1. Why Profile?17.2. Writing and Optimizing Code17.3. Timing Complete Programs17.4. Timing C Code Segments17.4.1. Manual Timing with MPI17.4.2. MPI Functions17.4.2.1. MPI_Wtime17.4.2.2. MPI_Wtick17.4.2.3. MPI_Barrier17.4.3. PMPI17.5. Profilers17.5.1. gprof17.5.2. gcov17.5.3. Profiling Parallel Programs with gprof and gcov17.6. MPE17.6.1. Using MPE17.7. Customized MPE Logging17.8. Notes for OSCAR and Rocks Users
V. Appendix
A. References
A.1. BooksA.2. URLsA.2.1. General Cluster InformationA.2.2. LinuxA.2.3. Cluster SoftwareA.2.4. Grid Computing and ToolsA.2.5. Cloning and Management SoftwareA.2.6. FilesystemsA.2.7. Parallel BenchmarksA.2.8. Programming SoftwareA.2.9. Scheduling SoftwareA.2.10. System Software and Utilities
About the Author
Colophon
Copyright

Content preview from High Performance Linux Clusters with OSCAR, Rocks, OpenMosix, and MPI

Chapter 17. Profiling Parallel Programs

Since the raison d'être for a cluster is higher performance, it stands to reason that if you really need a cluster, writing efficient code should be important to you. The key to improving the efficiency of your code is knowing where your code spends its time. Thus, the astute cluster user will want to master code profiling. This chapter provides an introduction to profiling in general, to the problems you’ll face with parallel programs, and to some of the tools you can use.

We’ll begin by looking briefly at issues that impact program efficiency. Next, we’ll look at ways you can time programs (and parts of programs) using readily available tools and the special features of MPI. Finally, we’ll look at the MPE library, a library that extends MPI and is particularly useful for profiling program performance. Where appropriate, we’ll look first at techniques typically used with serial programs to put the techniques in context, and then at extending them to parallel programs.

Why Profile?

You have probably heard it before—the typical program will spend over 90% of its execution time in less that 10% of the actual code. This is just a rule of thumb or heuristic, and as such, will be wildly inaccurate or totally irrelevant for some programs. But for many, if not most, programs, it is a reasonable observation. The actual numbers don’t matter since they will change from program to program. It is the idea that is important—for most programs, most of the ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 0596005709Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

High Performance Linux Clusters with OSCAR, Rocks, OpenMosix, and MPI

by Joseph D Sloan

Chapter 17. Profiling Parallel Programs

Why Profile?

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.