O'Reilly logo

Hadoop Operations and Cluster Management Cookbook by Shumin Guo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 8. Building a Hadoop Cluster with Amazon EC2 and S3

In this chapter, we will cover:

  • Registering with Amazon Web Services (AWS)
  • Managing AWS security credentials
  • Preparing a local machine for EC2 connection
  • Creating an Amazon Machine Image (AMI)
  • Using S3 to host data
  • Configuring a Hadoop cluster with the new AMI

Introduction

Amazon Elastic Cloud Computing (EC2) and Simple Storage Service (S3) are cloud computing web services provided by Amazon Web Services(AWS). EC2 offers platform as a service (PaaS), with which we can start up theoretically an unlimited number of servers on the cloud. S3 offers storage services on the cloud. More information about AWS, EC2, and S3 can be obtained from aws.amazon.com.

From the previous chapters of this book, we ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required