July 2017
Beginner to intermediate
378 pages
10h 26m
English
In this chapter, we reviewed how to increase data scientist productivity by reducing time spent manipulating data. The concept of LAD was introduced, along with a process to build them.
Concepts to help prevent data lakes from turning into a data swamp was discussed. Changing perspectives from a data lake to a series of active refineries and production chains helps to plan for a useful data storage area.
Strategies for data retention were also reviewed. Consider the twin goals of maximizing value while minimizing costs when designing a retention strategy. In the next chapter, we will discuss the economics of IoT analytics. The chapter covers ways to look for business value in the data along with a detailed predictive maintenance example. ...