Skip to Content
Big Data
book

Big Data

by Kuan-Ching Li, Hai Jiang, Laurence T. Yang, Alfredo Cuzzocrea
February 2015
Beginner to intermediate
498 pages
16h 57m
English
Chapman and Hall/CRC
Content preview from Big Data

Chapter 2

Scalability and Cost Evaluation of Incremental Data Processing Using Amazon’s Hadoop Service

Xing Wu, Yan Liu, and Ian Gorton

Abstract

Based on the MapReduce model and Hadoop Distributed File System (HDFS), Hadoop enables the distributed processing of large data sets across clusters with scalability and fault tolerance. Many data-intensive applications involve continuous and incremental updates of data. Understanding the scalability and cost of a Hadoop platform to handle small and independent updates of data sets sheds light on the design of scalable and cost-effective data-intensive applications. In this chapter, we introduce a motivating movie recommendation application implemented in the MapReduce model and deployed on Amazon Elastic ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Big Data

Big Data

Bernard Marr
Big Data

Big Data

Eglantine Schmitt
Big Data

Big Data

James Warren, Nathan Marz
Big Data

Big Data

James R. Kalyvas, Michael R. Overly

Publisher Resources

ISBN: 9781482240559