Skip to Content
What Is a Data Lake?
book

What Is a Data Lake?

by Alex Gorelik
November 2020
Beginner to intermediate content levelBeginner to intermediate
68 pages
1h 44m
English
O'Reilly Media, Inc.

Overview

A revolution is occurring in data management regarding how data is collected, stored, processed, governed, managed, and provided to decision makers. The data lake is a popular approach that harnesses the power of big data and marries it with the agility of self-service. With this report, IT executives and data architects will focus on the technical aspects of building a data lake for your organization.

Alex Gorelik from Facebook explains the requirements for building a successful data lake that business users can easily access whenever they have a need. You'll learn the phases of data lake maturity, common mistakes that lead to data swamps, and the importance of aligning data with your company's business strategy and gaining executive sponsorship.

You'll explore:

  • The ingredients of modern data lakes, such as the use of different ingestion methods for different data formats, and the importance of the three Vs: volume, variety, and velocity
  • Building blocks of successful data lakes, including data ingestion, integration, persistence, data governance, and business intelligence and self-service analytics
  • State-of-the-art data lake architectures offered by Amazon Web Services, Microsoft Azure, and Google Cloud
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Operationalizing the Data Lake

Operationalizing the Data Lake

Holden Ackerman, Jon King
Data Lakes

Data Lakes

Anne Laurent, Dominique Laurent, Cédrine Madera

Publisher Resources

ISBN: 9781492088899