Using optimized packages
Many of the functionalities in base R have alternative implementations available in contributed packages. Quite often, these packages offer a faster or less memory-intensive substitute for the base R equivalent. For example, in addition to adding a ton of extra functionality, the glmnet
package performs regression far faster than glm
in my experience.
For faster data import, you might be able to use fread
from the data.table
package or the read_*
family of functions from the readr
package. It is not uncommon for data import tasks that used to take several hours to take only a few minutes with these read functions.
For common data manipulation tasks—like merging (joining), conditional selection, sorting, and so on—you will ...
Get R: Data Analysis and Visualization now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.