October 2017
Beginner to intermediate
236 pages
7h 38m
English
In the dplyr framework, the first input is the entire dataset which works like the attach() function in base R. The use of the pipe operator gives you the facility to call the variable names without any quote similar to accessing the variables after applying the attach() function. The select() function takes the first input as the entire data frame and then takes the names of variables separated by commas. In this example, the variables were QUARTER, MONTH, ORIGIN, DEST, DEP_DELAY, and ARR_DELAY.
To show the significance of using select(), the example has more code such as group_by(), do(), and summarize(). These additional three functions have been used to show the capability of processing larger datasets efficiently and ...