October 2017
Beginner to intermediate
236 pages
7h 38m
English
In R, there are many ways to do data summarization, but when the dataset gets bigger, some of the base functions take a longer time to complete the task. The dplyr library is designed to handle a larger dataset to perform this kind of task efficiently, and faster than the default base R functionality. In this recipe, you will explore the speed of the dplyr verb to calculate summary statistics from a large dataset and compare the elapsed time with that of the base R functionality.