Chapter 8

Developing Big Data Applications

This chapter goes into greater detail about the MapReduce programming model. The chapter debates the expectations that migration to a big data platform will automatically provide scalable performance, and provides an overview of parallel application development using MapReduce. We walk through the execution model, and consider aspects of performance in relation to data distribution and task parallelism. The chapter also discusses another open source big data application development framework. Lastly, we review the execution model for parallel applications.

Keywords

Big data applications; parallelism; data distribution; scalability; MapReduce; application development framework; programming model; application ...

Get Big Data Analytics now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.