Measures of variation
It is good to have knowledge of the variation of values in the dataset. Various statistical functions facilitate:
span(arr)
: span is used to calculate the total spread of the dataset, which ismaximum(arr)
tominimum(arr)
:
variation(arr)
: Also called the coefficient of variance. It is the ratio of the standard deviation to the mean of the dataset. In relation to the mean of the population, CV denotes the extent of variability. Its advantage is that it is a dimensionless number and can be used to compare different datasets.
Get Julia for Data Science now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.