O'Reilly logo

Hadoop MapReduce v2 Cookbook - Second Edition by Thilina Gunarathne

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 5. Analytics

In this chapter, we will cover the following recipes:

  • Simple analytics using MapReduce
  • Performing GROUP BY using MapReduce
  • Calculating frequency distributions and sorting using MapReduce
  • Plotting the Hadoop MapReduce results using gnuplot
  • Calculating histograms using MapReduce
  • Calculating Scatter plots using MapReduce
  • Parsing a complex dataset with Hadoop
  • Joining two datasets using MapReduce

Introduction

In this chapter, we will discuss how we can use Hadoop to process a dataset and to understand its basic characteristics. We will cover more complex methods like data mining, classification, clustering, and so on, in later chapters.

This chapter will show how you can calculate basic analytics using a given dataset. For the recipes in ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required