Chapter 11. Troubleshooting, Diagnostics, and Best Practices

In this chapter, we will cover the following recipes:

  • Namenode troubleshooting
  • Datanode troubleshooting
  • Resourcemanager troubleshooting
  • Diagnose communication issues
  • Parse logs for errors
  • Hive troubleshooting
  • HBase troubleshooting
  • Hadoop best practices

Introduction

In this chapter, we will look at best practices and troubleshooting techniques for various components of Hadoop. The same can be used to troubleshoot any other service or application.

With distributed systems and the scale at which Hadoop operates, it can become cumbersome to troubleshoot it. In production, most will use log management and parsing tools such as Splunk and a combination of Ganglia, Nagios, or other tools for monitoring ...

Get Hadoop 2.x Administration Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.