O'Reilly logo

Big Data Analytics with R and Hadoop by Vignesh Prajapati

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Understanding how to run Hadoop streaming with R

Now, we understood what Hadoop streaming is and how it can be called with Hadoop generic as well as streaming options. Next, it's time to know how an R script can be developed and run with R. For this, we can consider a better example than a simple word count program.

The four different stages of MapReduce operations are explained here as follows:

  • Understanding a MapReduce application
  • Understanding how to code a MapReduce application
  • Understanding how to run a MapReduce application
  • Understanding how to explore the output of a MapReduce application

Understanding a MapReduce application

Problem definition: The problem is to segment a page visit by the geolocation. In this problem, we are going to consider ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required