If you are still in a session in which OnlineRetail is still in memory, you are OK! However, if you are picking up where we left off, you will need to load the data that we saved in the last session. Start by setting the working directory and then loading the OnlineRetail training dataset:
rm(list = ls())setwd("C:/PracticalPredictiveAnalytics/Data")load("OnlineRetail.full.Rda")# works for small dataOnlineRetail <- OnlineRetail[1:10000,]cat(nrow(OnlineRetail), "rows loaded\n") > 10000 rows loaded
The cat function in the previous step should reflect the number of rows in the training data set, which is 268034.