How do I configure Apache Spark on an Amazon Elastic MapReduce (EMR) cluster?

Learn how to manage Apache Spark configuration overrides for an AWS Elastic MapReduce cluster to save time and money.

By Frank Kane
June 9, 2017
Screenshot from "How do I configure Apache Spark on an Amazon Elastic MapReduce (EMR) cluster?" Screenshot from "How do I configure Apache Spark on an Amazon Elastic MapReduce (EMR) cluster?" (source: O'Reilly)

Specifying default configuration overrides for Spark ensures your clusters are consistent and large jobs will not fail. Amazon Web Services pro Frank Kane shows you how to set these overrides for your Apache Spark/Elastic MapReduce (EMR) cluster.


Learn more about running Spark for big data analysis on the Amazon Elastic MapReduce service (EMR) with video training from Frank Kane.

Learn faster. Dig deeper. See farther.

Join the O'Reilly online learning platform. Get a free trial today and find answers on the fly, or master something new and useful.

Learn more
Post topics: Data
Post tags: Questions
Share: