June 2018
Intermediate to advanced
248 pages
5h 27m
English
Another important visual in exploratory data analysis is the box plot, also known as the box-and-whisker plot. It's built based on the five-number summary, which is the minimum, first quartile, median, third quartile, and maximum values. In a standard box plot, these values are represented as follows:

It's a very convenient way of comparing several distributions. In general, the whiskers of the plot generally extend to the extreme points. Alternatively, you can cut them with the 1.5 interquartile range. Let's check our CRIM and RM features:
In [60]: %matplotlib notebook %matplotlib notebook import matplotlib.pyplot as plt from scipy ...