Chapter 3

Models and Techniques for Cloud-Based Data Analysis

Abstracts

This chapter discusses the main models and techniques used for designing Cloud-based data analysis applications. The models presented here are based on MapReduce, workflows, and NoSQL database management systems. In the further sections, how each one of these three main approaches offer scalability for mining Big Data repository on Clouds, has been explained. Section 3.1 introduces the MapReduce model and how it can be used to implement scalable data analysis algorithms and applications. Section 3.2 discusses the workflow systems, presents some workflow management systems implemented on Cloud architectures, and discusses their main features to implement data analysis applications. ...

Get Data Analysis in the Cloud now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.