Chapter 8. Integrating R and Hadoop for statistics and more
- Integrating your R scripts with MapReduce and Streaming
- Understanding Rhipe, RHadoop, and R + Streaming
R is a statistical programming language for performing data analysis and graphing the results. The capabilities of R let you perform statistical and predictive analytics, data mining, and visualization functions on your data. Its breadth of coverage and applicability across a wide range of sectors (such as finance, life sciences, manufacturing, retail, and more) make it a popular tool.