October 2017
Beginner to intermediate
236 pages
7h 38m
English
The dataset for this recipe is the USA domestic airline 2016 data, which has been downloaded from the bureau of transportation statistics (https://www.transtat.bts.gov). In this recipe, the objective is to calculate four number summary statistics (minimum, mean, median, and maximum) of departure delay for each month and for each origin of flight. The variables of interest in this recipe are as follows:
The summary statistics will be calculated using the base R functionality and using dplyr to compare the processing times.