June 2020
Intermediate to advanced
382 pages
11h 39m
English
Let's see how we can design a reusable processing pipeline to address the regressors challenge. As mentioned, we will prepare the data once and then use it in all the regression algorithms. Let's follow these steps:
We start by importing the dataset, as follows:
dataset = pd.read_csv('auto.csv')
Let's now preview the dataset:
dataset.head(5)
This is how the dataset will look:

Now, let's proceed on to feature selection. Let's drop the NAME column as it is only an identifier that is needed for cars. Columns that are used to identify the rows in our dataset are not relevant for training ...