February 2019
Beginner to intermediate
308 pages
7h 42m
English
First, let's call the isnull() function to check whether there are any missing values in the dataset:
print(df.isnull().any())
We'll see the following output:

It seems like there are no missing values in the dataset, but are we sure? Let's get a statistical summary of the dataset to investigate further:
print(df.describe())
The output is as follows:

We can see that there are 768 rows of data, and the Pregnancies, Glucose, BloodPressure, SkinThickness, Insulin, and BMI columns have a minimum value of 0. This doesn't ...