Chapter 2: Data Science, Statistics, and Time-Series

Patrick BangertArtificial Intelligence Team, Samsung SDSA, San Jose, CA, United Statesalgorithmica technologies GmbH, Küchlerstrasse 7, Bad Nauheim, Germany

Abstract

Most of the effort in a data science project lies in getting a clean, representative, informative dataset. This chapter discusses all the relevant steps in getting to this point. A brief discussion of measuring and storing data in control systems and historians starts us off and we observe along the way that all data is uncertain to some degree. Time-series have an inherent time scale and may be correlated with each other. These relationships can be encoded in a mathematical representation of the process that made the data, i.e. ...

Get Machine Learning and Data Science in the Oil and Gas Industry now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.