Skip to Content
Beginning Data Science in R 4: Data Analysis, Visualization, and Modelling for the Data Scientist
book

Beginning Data Science in R 4: Data Analysis, Visualization, and Modelling for the Data Scientist

by Thomas Mailund
June 2022
Beginner
528 pages
10h 39m
English
Apress
Content preview from Beginning Data Science in R 4: Data Analysis, Visualization, and Modelling for the Data Scientist
© Thomas Mailund 2022
T. MailundBeginning Data Science in R 4https://doi.org/10.1007/978-1-4842-8155-0_3

3. Data Manipulation

Thomas Mailund1  
(1)
Aarhus, Denmark
 

Data science is as much about manipulating data as it is about fitting models to data. Data rarely arrives in a form that we can directly feed into the statistical models or machine learning algorithms we want to analyze them with. The first stages of data analysis are almost always figuring out how to load the data into R and then figuring out how to transform it into a shape you can readily analyze.

Data Already in R

There are some data sets already built into R or available in R packages. Those are useful for learning how to use new methods—if you already know a data set and what it can ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Beginning Data Science in R: Data Analysis, Visualization, and Modelling for the Data Scientist

Beginning Data Science in R: Data Analysis, Visualization, and Modelling for the Data Scientist

Thomas Mailund

Publisher Resources

ISBN: 9781484281550Purchase LinkPublisher Website