This tutorial is aimed at R users who want to use Hadoop to work on big data and Hadoop users who want to do sophisticated analytics. The presenters introduce you to R, Hadoop, and the RHadoop project. They cover three R packages for Hadoop and the mapreduce model and present numerous examples of incremental complexity including the combination of rmr and RevoscaleR to solve modeling problems. This tutorial was filmed at the O'Reilly Strata Conference + Hadoop World NY in October of 2013.
- Title: Using R and Hadoop for Statistical Computation at Scale
- Release date: July 2014
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491908761