Chapter 4. High Availability

In this chapter, we will cover the following recipes:

  • Namenode HA using shared storage
  • ZooKeeper configuration
  • Namenode HA using Journal node
  • Resourcemanager HA using ZooKeeper
  • Rolling upgrade in HA
  • Configuring shared cache manager
  • Configuring HDFS cache
  • HDFS snapshots
  • Configuring storage-based policies
  • Configuring HA for Edge nodes

Introduction

In this chapter, we will configure high availability for Namenode and Resourcemanager, as both of them are single points of failure. We will also walk through the various options of configuring high availability, cache configuration, storage policies, and snapshots for backup.

Note

In this chapter, HA is from the Hadoop perspective only and there is no HA at Linux level. The Hadoop cluster ...

Get Hadoop 2.x Administration Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.