July 2017
Intermediate to advanced
796 pages
18h 55m
English
Kurtosis is a way of quantifying differences in the shape of distributions, which may look very similar in terms of means and variances, yet are actually different. In such cases, kurtosis becomes a good measure of the weight of the distribution at the tail of the distribution, as compared to the middle of the distribution.
The kurtosis API has several implementations, as follows. The exact API used depends on the specific use case.
def kurtosis(columnName: String): ColumnAggregate function: returns the kurtosis of the values in a group.def kurtosis(e: Column): ColumnAggregate function: returns the kurtosis of the values in a group.
Let's look at an example of invoking kurtosis on the DataFrame on the Population column:
import org.apache.spark.sql.functions._ ...
Read now
Unlock full access