11Big Data
In this chapter we introduce some basic concepts of big data and revisit some analyses discussed in earlier chapters using big data specific software. In particular, big data implementation for our usual linear regression model, introduced in Chapter 4, is considered. We begin with a discussion of a rank-based estimation algorithm intended to be used for big data. Following that, we go over some general R packages that are suited for big data. Next, we cover some of the computational aspects for implementing rank-regression model fitting in a big data setting and provide example usage of our R package bigRfit. We close by discussing what is one of the main software tools for big data analytics, Spark, as well as an R interface to ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access