Skip to Main Content
Data Lake Development with Big Data
book

Data Lake Development with Big Data

by Pradeep Pasupuleti, Beulah Salome Purra
November 2015
Beginner to intermediate content levelBeginner to intermediate
164 pages
4h 10m
English
Packt Publishing
Content preview from Data Lake Development with Big Data

Chapter 4. Data Discovery and Consumption

In the previous chapters, we discussed the Data Intake and Data Management tiers. During intake, we have seen that the data is ingested from disparate sources and stored in the Raw Zone. The Data Management Tier performs data profiling and validation; integrates, cleanses, standardizes, and enriches the data and places it in the Data Hub Zone.

Let us now understand how this data can be discovered, packaged, and provisioned for it to be consumed by the downstream systems. Data Consumption comprises Data Discovery and Data Provisioning. In this chapter, we will enable you to understand the following topics:

  • The process of enabling discovery in the Data Lake
  • The various Data Discovery functionalities
  • The important ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Lake Maturity Model

Data Lake Maturity Model

Scott Gidley, Andy Oram
Data Lakes

Data Lakes

Anne Laurent, Dominique Laurent, Cédrine Madera
Architecting Data Lakes

Architecting Data Lakes

Ashish Thusoo, Ben Sharma

Publisher Resources

ISBN: 9781785888083