Chapter 17
Administering Your Hadoop Cluster
Seeing why having a well-running Hadoop cluster is good for you
Exploring administration commands
Improving performance and setting benchmarks
Planning for when things go wrong
Working with Apache Hadoop’s Capacity Scheduler
Dealing with security issues
Adding resources to your administrator toolset
You’ll want to keep your Hadoop cluster running smoothly and at a high level of performance. For that to happen, you need to master the mysteries of Hadoop administration. Part of this process involves careful planning to ensure that you deploy and configure appropriate hardware for your Hadoop cluster, the use of judicious benchmarking to evaluate performance, and a good understanding of the anticipated workloads.
Complicating matters a bit ...
Get Hadoop For Dummies now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.