June 2017
Beginner to intermediate
576 pages
15h 22m
English
This example uses the much larger diabetes dataset. Since most of the variables in this dataset are numeric, OneR can bin all of them:
library(OneR) df = sql("SELECT outcome, age, mass, triceps, pregnant, glucose, pressure, insulin, pedigree FROM global_temp.df_view") local = collect(sample(df, F,.15)) data <- optbin(local,outcome~.) summary(data)