Loading and preparing the data

To get the data into your working directory, you can find it on my GitHub at this link: https://github.com/datameister66/MMLR3rd.

The file we're using is ames.csv. This data is from the sales of homes sold in Ames, Iowa, which is the location of Iowa State University, and I believe has a population of around 70,000. I downloaded the data from Kaggle.com, and the response we're trying to predict is the final sales price. It's a nice size to practice machine learning methods with 1,460 observations of 84 features, and many of the features are categorical.

Before we load the data, if not already done, load the necessary packages, call the magrittr library, and, if you so choose, update the options. I prefer not ...

Get Mastering Machine Learning with R - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.