After importing the data into the R environment, the code works as follows:
- It takes the entire dataset as an input in the first place. It will work as the input for the function select().
- Within the select() function, it only keeps the variables of month and departure delay.
- The filter() function takes the output from the select() function and drops those rows that have negative values in the DEP_DELAY variable.
- The group_by function instructs the summarize() function that the calculation should be grouped by the variable specified into the group_by section.
- The summarize() function then calculates the mean departure delay for each month and supplies the output data to the plotting function qplot().
- The qplot() function ...