© Thomas Mailund 2017

Thomas Mailund, Beginning Data Science in R, 10.1007/978-1-4842-2671-1_2

2. Reproducible Analysis

Thomas Mailund

(1)Aarhus, Denmark

The typical data analysis workflow looks like this: you collect your data and you put it in a file or spreadsheet or database. Then you run some analyses, written in various scripts, perhaps saving some intermediate results along the way or maybe always working on the raw data. You create some plots or tables of relevant summaries of the data, and then you go and write a report about the results in a text editor or word processor. It is the typical workflow. Most people doing data analysis do this or variations thereof. But it is also a workflow that has many potential problems.

There is a separation ...

Get Beginning Data Science in R: Data Analysis, Visualization, and Modelling for the Data Scientist now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.