Chapter 7. Multitenancy in Hadoop

In the long history of humankind (and animal kind, too) those who learned to collaborate and improvise most effectively have prevailed.

—William Arthur Ward

Successful communities and organizations must be able to collaborate and share common sources in a way that protects the individuals (tenants) and the shared sources. A data lake has the same requirements. A data lake must provide a level of multitenancy that can logically isolate the tenants to protect the physical shared sources (data). This chapter introduces you to the concepts of isolating data, resources, and processes in a Hadoop cluster that is meant to enforce desired, acceptable service-level requirements across different types of applications and ...

Get Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.