O'Reilly logo

Hadoop Blueprints by Tanmay Deshpande, Anurag Shrivastava

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Hadoop on Cloud

Hadoop is a distributed system and it is capable of running over thousands of distributed nodes. Hadoop mega clusters with thousands of nodes are already in production. In this book, we developed solutions on a single-node cluster. Such a setup is good for learning but not sufficient for a production environment. Setting up even a modest three- or five-node Hadoop cluster may not be very feasible at home due to the cost of hardware involved. Arranging the budgets for a five-node Hadoop cluster in a company will require you to go through a budgetary approval process and then order the hardware, which can be a time-consuming process.

Hadoop on Cloud offers a good alternative to having a multinode Hadoop setup in your own data center ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required