June 2017
Beginner to intermediate
576 pages
15h 22m
English
It is a good idea to periodically save your data frames, so that you can pick up your analysis from various checkpoints.
In this example, I will first sort them both by InvoiceNo, and then save the test and train data sets to disk, where I can always load them back into memory as needed:
setwd("C:/PracticalPredictiveAnalytics/Data") OnlineRetail <- OnlineRetail[order(OnlineRetail$InvoiceNo), ] OnlineRetail.test <- OnlineRetail.test[order(OnlineRetail.test$InvoiceNo), ]save(OnlineRetail,file='OnlineRetail.full.Rda')save(OnlineRetail.test,file='OnlineRetail.test.Rda')load('OnlineRetail.full.Rda') load('OnlineRetail.test.Rda') nrow(OnlineRetail) > [1] 268034 nrow(OnlineRetail.test) > [1] 268034 nrow(OnlineRetail) > [1] ...