In this chapter, we will look at the following key topics:
- Analytics using Amazon Athena
- Analytics using Presto, Trino, and Hive on Amazon EMR
One of the fundamental principles of building a modern data architecture on AWS is hinged around using purpose-built tools for solving specific use cases. An enterprise data platform once fully built has many components, each with a specific purpose for solving a particular business use case.
In Chapter 2, Scalable Data Lakes, we went through the fundamentals of building a data lake on AWS using Amazon S3 as the storage layer and the AWS Glue Data Catalog as the technical metadata layer. Each layer of the data lake has data that may be of use to different personas in an organization. ...