Chapter 8. Mesos Big Data Frameworks

This chapter is a guide to deploy important big data processing frameworks, such as Hadoop, Spark, Storm, and Samza, on top of Mesos.

Hadoop on Mesos

This section will introduce Hadoop, explain how to set up the Hadoop stack on Mesos, and discuss the problems commonly encountered while setting up the stack.

Introduction to Hadoop

Hadoop was developed by Mike Cafarella and Doug Cutting in 2006 to manage the distribution for the Nutch project. The project was named after Doug's son's toy elephant.

The following modules make up the Apache Hadoop framework:

  • Hadoop Common: This has the common libraries and utilities required by other modules
  • Hadoop Distributed File System (HDFS): This is a distributed, scalable filesystem ...

Get Mastering Mesos now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.