
127
4
Incremental MapReduce
Computations
Pramod Bhatotia, Alexander Wieder,
Umut A. Acar, and Rodrigo Rodrigues
4.1 INTRODUCTION
Distributed processing of large data sets has become an important task in the life of
various companies and organizations, for whom data analysis is an important vehicle
to improve the way they operate. This area has attracted a lot of attention from both
researchers and practitioners over the last few years, particularly after the introduc-
tion of the MapReduce paradigm for large-scale parallel data processing [19].
A usual characteristic of the data sets that are provided as inputs to large-scale
data-processing jobs ...