April 2016
Beginner to intermediate
384 pages
8h 36m
English
To fully understand the distribution of any random variable, we need to know its mean and standard deviation, minimum and maximum values, median, mode, first and third quartiles, skewness, and kurtosis.
Sometimes, it is good to perform statistical testing to confirm (or disprove) whether our data follows a specific distribution. This, however, is beyond the scope of this book.
To execute this recipe, all you need is pandas. No other prerequisites are required.
Here is a piece of code that can quickly give you a basic understanding of your data. We assume that our data was read from a CSV file and stored in the csv_read variable (the data_describe.py file):
# calculate the descriptives: count, ...