Chapter 10. Cluster Planning

In this chapter, we will cover the following recipes:

  • Disk space calculations
  • Nodes needed in the cluster
  • Memory requirements
  • Sizing the cluster as per SLA
  • Network design
  • Estimating the cost of the Hadoop cluster
  • Hardware and software options

Introduction

In this chapter, we will look at cluster planning and some of the important aspects of cluster utilization.

Although this is a recipe book, it is good to have an understanding on the Hadoop cluster layout, network components, operating system, disk arrangements, and memory. We will try to cover some of the fundamental concepts on cluster planning and a few formulas to estimate the cluster size.

Let's say we are ready with our big data initiative and want to take the plunge into ...

Get Hadoop 2.x Administration Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.