July 2017
Intermediate to advanced
796 pages
18h 55m
English
Up to Spark 1.6.3 release, Spark distribution (that is, /SPARK_HOME/ec2) provides a shell script called spark-ec2 for launching Spark Cluster in EC2 instances from your local machine. This eventually helps in launching, managing, and shutting down the Spark Cluster that you will be using on AWS. However, since Spark 2.x, the same script was moved to AMPLab so that it would be easier to fix bugs and maintain the script itself separately.
The script can be accessed and used from the GitHub repo at https://github.com/amplab/spark-ec2.
Read now
Unlock full access