Using data.table to manipulate data
In the first section, we reviewed some built-in functions used to manipulate data frames. Then, we introduced sqldf
, which makes simple data query and summary easier. However, both approaches have their limitations. Using built-in functions can be verbose and slow, and it is not easy to summarize data because SQL is not as powerful as the full spectrum of R functions.
The data.table
package provides a powerful enhanced version of data.frame
. It is blazing fast and has the ability to handle large data that fits into memory. It invents a natural syntax of data manipulation using []
. Run the following command to install the package from CRAN if you don't have it yet:
install.packages("data.table")
Once the package ...
Get Learning R Programming now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.