O'Reilly logo

Hadoop 2.x Administration Cookbook by Gurmukh Singh

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 3. Maintaining Hadoop Cluster – YARN and MapReduce

In this chapter, we will cover the following recipes:

  • Running a simple MapReduce program
  • Hadoop streaming
  • Configuring YARN history server
  • Job history web interface and metrics
  • Configuring ResourceManager components
  • YARN containers resource allocations
  • ResourceManager Web UI and JMX metrics
  • Preserving ResourceManager states

Introduction

In the previous chapters, we learned about the storage layer HDFS, how to configure it, and what are its different components. We mainly talked about Namenode, Datanode, and its concepts.

In this chapter, we will take a look at the processing layer which is MapReduce and the resource management framework YARN. Prior to Hadoop 2.x, MapReduce was the only processing ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required