Skip to Main Content
Data Lake Development with Big Data
book

Data Lake Development with Big Data

by Pradeep Pasupuleti, Beulah Salome Purra
November 2015
Beginner to intermediate content levelBeginner to intermediate
164 pages
4h 10m
English
Packt Publishing
Content preview from Data Lake Development with Big Data

Data Discovery and metadata

Data Discovery deals with the identification of related data assets, making them discoverable and guiding the data consumers to relevant datasets.

The efficiency of Data Discovery depends upon the amount and quality of the metadata that is captured as the data moves across the various tiers in the Data Lake. Metadata keeps track of all the data assets that reside on a Data Lake; it helps data consumers to find the relevant data. Metadata identifies and maintains relationships between data, right from the time the data is ingested, enhanced, transformed, and evolved. It guides consumers to related datasets that can be combined and integrated.

Semantic metadata captures the semantics of the data; semantics is the ability ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Lake Maturity Model

Data Lake Maturity Model

Scott Gidley, Andy Oram
Data Lakes

Data Lakes

Anne Laurent, Dominique Laurent, Cédrine Madera
Architecting Data Lakes

Architecting Data Lakes

Ashish Thusoo, Ben Sharma

Publisher Resources

ISBN: 9781785888083