O'Reilly logo

Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture by George J. Trujillo Jr., Justin Murray, Rommel Garcia, Steven Jones, Charles Kim

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 12. Virtualizing the Hadoop Worker Nodes

Two roads diverged in a yellow wood. I took the one less traveled by, and that has made all the difference.

—Robert Frost

In this chapter we look at different ways of virtualizing the worker nodes in a Hadoop cluster—those nodes that contain the NodeManager, Application Master, Containers, and DataNode processes as their Hadoop roles. The standard native configuration that you might have seen up to now has been to combine compute processing (NodeManager, Application Master, Container) and storage (DataNode) on each Hadoop worker node. We will initially use that approach here when we virtualize the worker nodes and then show how you can go beyond it for flexibility reasons.

The design approaches ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required