Chapter 4

Imprinting Your Data Lake on a Reference Architecture

IN THIS CHAPTER

check Using a data lake reference architecture

check Complying with reference architecture principles

check Matching your analytical and data needs with the right reference architecture

check Dealing with existing data warehouses and data marts

Building your organization’s data lake can seem like an overwhelming proposition, with dozens of moving parts. Where and how do you even get started?

Fortunately, you don’t need to start from a totally clean slate. You should begin your data lake adventures by following a relevant reference architecture that will guide you with options and ideas for:

  • Bringing data into your data lake
  • Deciding what type(s) of data storage platforms make sense for your organization
  • Specifying how your business users will interact with the data lake
  • Deciding how (or if) you should incorporate existing data warehouses and data marts into your data lake ecosystem
  • Incorporating external data along with your data lake contents into your analytics
  • Allocating your enterprise analytics among your data lake and ...

Get Data Lakes For Dummies now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.