June 2015
Beginner
348 pages
8h 44m
English
We can use some kind of threshold to weed out outliers, but there is a better way. It is called the median, and it basically picks the middle value of a sorted set of values (see https://www.khanacademy.org/math/probability/descriptive-statistics/central_tendency/e/mean_median_and_mode). One half of the data is below the median and the other half is above it. For example, if we have the values of 1, 2, 3, 4, and 5, then the median will be 3, since it is in the middle.
These are the steps to calculate the median:
simplestats.py. You already know how to load the data from a CSV file into an array. So, copy that line of code and make sure that it only gets the close ...Read now
Unlock full access