Chapter 11
Running Hadoop Applications on AWS
WHAT’S IN THIS CHAPTER?
- Understanding options for running Hadoop on AWS
- Understanding Elastic MapReduce and its capabilities
- Understanding Simple Storage Service and its capabilities
- Programming for S3
- Programmatically configuring EMR
- Understanding the options for orchestrating job execution in EMR
WROX.COM CODE DOWNLOADS FOR THIS CHAPTER
The wrox.com code downloads for this chapter are found at www.wiley.com/go/prohadoopsolutions on the Download Code tab. The code is in the Chapter 11 download and individually named according to the names throughout the chapter.
Amazon Web Services (AWS) is becoming a more and more popular alternative to an on-premises hardware deployment. The advantages of AWS include (but are not limited to) the following:
- Fast and flexible on-demand deployment options enable you to provision only what is required when you need it. It also enables you to ramp up your hardware and software as your needs grow.
- Depending on your requirements, AWS provides a wide variety of flexible pricing options.
- AWS provides metered billing, which means that you know exactly how many resources you are using at any time, and how much you have to pay for them.
With the growing popularity of both Hadoop and AWS, the use of AWS for running Hadoop applications is rapidly becoming a hot topic.
Now that you know about the main Hadoop architecture and functionality, in this chapter you learn different approaches to running Hadoop on ...
Get Professional Hadoop Solutions now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.