O'Reilly logo

Hadoop 2.x Administration Cookbook by Gurmukh Singh

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 4. High Availability

In this chapter, we will cover the following recipes:

  • Namenode HA using shared storage
  • ZooKeeper configuration
  • Namenode HA using Journal node
  • Resourcemanager HA using ZooKeeper
  • Rolling upgrade in HA
  • Configuring shared cache manager
  • Configuring HDFS cache
  • HDFS snapshots
  • Configuring storage-based policies
  • Configuring HA for Edge nodes

Introduction

In this chapter, we will configure high availability for Namenode and Resourcemanager, as both of them are single points of failure. We will also walk through the various options of configuring high availability, cache configuration, storage policies, and snapshots for backup.

Note

In this chapter, HA is from the Hadoop perspective only and there is no HA at Linux level. The Hadoop cluster ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required