October 2018
Beginner to intermediate
676 pages
18h 30m
English
This dataset has 11 attributes that influence the quality of the wine. The Quality rating varies from 3 to 8, and then we map 3 and 4 to Low, 5 and 6 to Med, and 7 and 8 to High to create a new variable: Quality.
The first five rows of the dataset looks as follows:
We will also compute the correlation matrix for the Wine Quality dataset, which we will use in some of the plots.
The following is the code block to read and add additional variables to the Wine Quality dataset:
import pandas as pd# Read the data from a csv file into pandas data framewine_quality = pd.read_csv('winequality.csv', delimiter=';')# Map numeric Quality ...Read now
Unlock full access