O'Reilly logo

Mastering Predictive Analytics with R - Second Edition by Rui Miguel Forte, James D. Miller

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 13. Scaling Up

Up until now, we have reviewed a steady stream of pertinent topics concerning statistics and specifically, predictive analytics. In this chapter, we look to provide a tutorial dedicated to applying those concepts and practices to very large datasets. First, we'll begin by defining the phrase very large – at least as it is used to describe data defined (that we want to train our predictive models on or run our statistical algorithms against). Next, we will review the list of the challenges imposed by using bigger data sources, and finally, we will offer some ideas for meeting these challenges.

Our chapter is broken down into the following sections:

  • Getting started
  • The phases of an analytics project
  • Experience and data of scale ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required