Configuring CapacityScheduler

Hadoop CapacityScheduler is a pluggable MapReduce job scheduler. The goal is to maximize the Hadoop cluster utilization by sharing the cluster among multiple users. CapacityScheduler uses queues to guarantee the minimum share of each user. It has features of being secure, elastic, operable, and supporting job priority. In this recipe, we will outline steps to configure CapacityScheduler for a Hadoop cluster.

Getting ready

We assume that our Hadoop cluster has been properly configured and all the daemons are running without any issues.

Log in to the master node from the cluster administrator machine using the following command:

ssh hduser@master

How to do it...

Configure CapacityScheduler with the following steps:

  1. Configure ...

Get Hadoop Operations and Cluster Management Cookbook now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.