Skip to Content
Beginning Data Science in R 4: Data Analysis, Visualization, and Modelling for the Data Scientist
book

Beginning Data Science in R 4: Data Analysis, Visualization, and Modelling for the Data Scientist

by Thomas Mailund
June 2022
Beginner
528 pages
10h 39m
English
Apress
Content preview from Beginning Data Science in R 4: Data Analysis, Visualization, and Modelling for the Data Scientist
© Thomas Mailund 2022
T. MailundBeginning Data Science in R 4https://doi.org/10.1007/978-1-4842-8155-0_5

5. Working with Large Data Sets

Thomas Mailund1  
(1)
Aarhus, Denmark
 

The concept of Big Data refers to enormous data sets, sets of sizes where you need data warehouses to store it, where you typically need sophisticated algorithms to handle the data and distributed computations to get anywhere with it. At the very least, we talk many gigabytes of data but also often terabytes or exabytes.

Dealing with Big Data is also part of data science, but it is beyond the scope of this book. This chapter is on large data sets and how to deal with data that slows down your analysis, but it is not about data sets so large that you cannot analyze it on your desktop ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Beginning Data Science in R: Data Analysis, Visualization, and Modelling for the Data Scientist

Beginning Data Science in R: Data Analysis, Visualization, and Modelling for the Data Scientist

Thomas Mailund

Publisher Resources

ISBN: 9781484281550Purchase LinkPublisher Website