Chapter 6

Your Data Lake’s Water Treatment Plant: The Silver Zone

IN THIS CHAPTER

check Cleansing, refining, and enriching your raw data

check Adding master data into your data lake

check Coordinating your bronze and silver zones

check Exploring hierarchical storage for your silver zone data

check Making silver zone data available for analytics

The silver zone is the unglamorous part of your data lake. The silver zone isn’t the landing place where you ingest mountains of data from all over, teasing the possibility of unprecedented insights drawn from all that data. Nor is the silver zone where you build and deploy packages of ready-to-consume data tied to specific analytics, which in turn are linked to explicit business objectives; that’s done in your gold zone.

Your silver zone is a gateway between promises and delivery. In fact, you can think of the silver zone as the water treatment plant for your data lake. Think of the silver zone as doing the equivalent of:

  • Cleaning the water
  • Adding fluoride and minerals ...

Get Data Lakes For Dummies now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.