O'Reilly logo

Hadoop Operations and Cluster Management Cookbook by Shumin Guo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 4. Managing a Hadoop Cluster

In this chapter, we will cover:

  • Managing the HDFS cluster
  • Configuring SecondaryNameNode
  • Managing the MapReduce cluster
  • Managing TaskTracker
  • Decommissioning DataNode
  • Replacing a slave node
  • Managing MapReduce jobs
  • Checking job history from the web UI
  • Importing data to HDFS
  • Manipulating files on HDFS
  • Configuring the HDFS quota
  • Configuring CapacityScheduler
  • Configuring Fair Scheduler
  • Configuring Hadoop daemon logging
  • Configuring Hadoop audit logging
  • Upgrading Hadoop

Introduction

From the perspective of functionality, a Hadoop cluster is composed of an HDFS cluster and a MapReduce cluster. The HDFS cluster consists of the default filesystem for Hadoop. It has one or more NameNodes to keep track of the filesystem metadata, while ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required