June 2017
Beginner to intermediate
576 pages
15h 22m
English
Correlation and covariance functions are available which operate directly on Spark dataframes. The following example shows that, for patients with diabetes, there is an 11% correlation between age and glucose level:
corr <- corr(out_sd1, "glucose", "age", method = "pearson") corr
