O'Reilly logo

Mastering AWS Development by Uchit Vyas

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Introducing Amazon Elastic MapReduce

Amazon AWS provides Hadoop as a PaaS. Establishments and people can access Hadoop clusters on the fly, run their workloads, and download outcomes. Provisioning a Hadoop cluster using Elastic MapReduce (EMR) takes a few minutes and a few steps.

The common steps to form and run workloads on EMR are as follows:

  1. The application is developed locally in Java using Hadoop's MapReduce APIs, Hive (Hive is a data warehouse product that facilitates querying and managing huge datasets residing in distributed storage) or a language of the user's choice. Languages not based on Java can be executed in a Hadoop cluster using Hadoop Streaming.
  2. The application and the relevant data are stored in Amazon S3. Numbers of clients for ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required