Book description
What you’ll learn—and how you can apply it
You’ll learn to perform efficient data carpentry—the process of taking rough, raw, and to some extent randomly arranged input data and creating neatly organized and tidy data. Working with clean data will be beneficial for every subsequent stage of your R project.
In this Lesson, readers will learn how to create user-friendly data frames with tibble, reshape data with tidyr operations such as gather and separate, process data efficiently with dplyr’s functions, and connect R to a range of database types.
This lesson is for you because
You are working on a project in R and have reached the data processing stage. You want to clean, manipulate, and tidy your dataset to get it ready for the next stage (typically modeling and visualization).
Prerequisites
- Some knowledge of R
Materials or downloads needed in advance
- Installed RStudio
This Lesson relies on a number of packages for data cleaning and processing. Check that they are installed on your computer and load them with:
- library("tibble")
- library("tidyr")
- library("stringr")
- library("readr")
- library("dplyr")
- library("data.table")
RSQLite and ggmap are also used in a couple of examples, though they are not central to the Lesson’s content.
Publisher resources
Product information
- Title: Efficient data processing with R
- Author(s):
- Release date: December 2016
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491980729
You might also like
book
R for Data Science
Learn how to use R to turn raw data into insight, knowledge, and understanding. This book …
book
R for Data Science, 2nd Edition
Use R to turn data into insight, knowledge, and understanding. With this practical book, aspiring data …
book
Introduction to Machine Learning with R
Machine learning is an intimidating subject until you know the fundamentals. If you understand basic coding …
book
Hands-On Programming with R
Learn how to program by diving into the R language, and then use your newfound skills …