Chapter 10

Rowing End-to-End across the Data Lake

IN THIS CHAPTER

check Deciding what to keep from your current data services portfolio

check Launching your data lake

check Loading up your data lake

check Adding a sandbox to your data lake

The bronze zone. The gold zone. And in between the two, the silver zone. Oh, and don’t forget about the sandbox.

When you examine the layers and zones of a data lake one by one, you get a pretty good idea of the roles and responsibilities of each one. But until you take an end-to-end look at a data lake, you don’t necessarily gain a full understanding of the flows of data from their sources into the data lake and then all the way through the pipelines that you’ll be constructing.

So, settle in for an end-to-end example of a hospital data lake.

Remember Before you dive deep into the hundreds or even thousands of data lake–related services and products at your disposal, you want to construct your data lake at the conceptual level. In other words, focus first on the forest that’s ...

Get Data Lakes For Dummies now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.