Chapter 12

Building Data Lakes in Amazon Web Services

IN THIS CHAPTER

check Inventorying Amazon tools and services for data lakes

check Mapping your conceptual data lake architecture to Amazon components

check Assembling Amazon components for your data lake pipelines

Building your data lake in Amazon Web Services (AWS) can sometimes feel like hacking your way through a jungle. (Get it? Amazon? A jungle?) You have dozens of services that you can stitch together for ingesting and then storing your data, as well as transforming and then moving your data along pipelines and then finally consuming data to drive decisions and actions.

Fortunately, you can follow some basic patterns with AWS that will help guide your pathway to the data lake if you elect to go down the AWS path.

The Elite Eight: Identifying the Essential Amazon Services

If you follow American college basketball, the term March Madness is familiar to you. The (supposedly) best 64 teams begin a tournament that ends up with that season’s U.S. college basketball champion winning the final game.

Along the way, after the first couple of rounds are played, the subsequent rounds are given catchy nicknames, beginning with the “Sweet 16.” Teams ...

Get Data Lakes For Dummies now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.