Paco Nathan

Getting Started Running Apache Spark on Apache Mesos

Date: This event took place live on January 24 2014

Presented by: Paco Nathan

Duration: Approximately 60 minutes.

Cost: Free

Apache Spark is a fast and general-purpose cluster computing system which makes parallel jobs easy to write. Apache Mesos is a cluster manager that provides efficient resource isolation and sharing across distributed frameworks. Both open source projects are foundational for the emerging "Berkeley Stack" for analytics at scale. This tutorial shows a simple way to launch a Mesos cluster in the cloud, how to configure run Spark on Mesos, then how to run jobs in Spark.

This webcast tutorial will show you:

  • A simple way to launch a Mesos cluster in the cloud
  • How to configure and run Spark on Mesos
  • How to run jobs in Spark

This webcast material is a preview for the Mesos tutorial at Strata SC 2014 led by Paco Nathan.

About Paco Nathan

Paco Nathan, Chief Scientist for Mesosphere in SF, is known as a "player/coach" data scientist who's led innovative Data teams building large-scale apps for 10+ years. As a recognized expert in distributed systems, machine learning, and Enterprise data workflows, Paco is an O'Reilly author "Enterprise Data Workflows with Cascading" and an evangelist for the Apache Mesos open source project. Paco received his BS Math Sci and MS Comp Sci degrees from Stanford University, and has 25+ years technology industry experience ranging from Bell Labs to early-stage start-ups. Newsletter and "official" web site: http://liber118.com/pxn/

You may also be interested in:

Strata Conference

Questions? Please send email to