To identify our missing values we will begin with an EDA of our dataset. We will be using some useful python packages, pandas and numpy, to store our data and make some simple calculations as well as some popular visualization tools to see what the distribution of our data looks like. Let's begin and dive into some code. First, we will do some imports:
# import packages we need for exploratory data analysis (EDA)import pandas as pd # to store tabular dataimport numpy as np # to do some mathimport matplotlib.pyplot as plt # a popular data visualization toolimport seaborn as sns # another popular data visualization tool%matplotlib inline plt.style.use('fivethirtyeight') # a popular data visualization theme ...