The exploratory data analysis (EDA)

To identify our missing values we will begin with an EDA of our dataset. We will be using some useful python packages, pandas and numpy, to store our data and make some simple calculations as well as some popular visualization tools to see what the distribution of our data looks like. Let's begin and dive into some code. First, we will do some imports:

# import packages we need for exploratory data analysis (EDA)import pandas as pd # to store tabular dataimport numpy as np # to do some mathimport matplotlib.pyplot as plt # a popular data visualization toolimport seaborn as sns  # another popular data visualization tool%matplotlib inline plt.style.use('fivethirtyeight') # a popular data visualization theme ...

Get Feature Engineering Made Easy now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.