A mini project on AWS Data Lake

In this section, we will build a completely new data lake using the AWS Data Lake solution. We will first review the business use case; then we will build the cluster, ingest data, and process the data; finally, we will analyze it using QuickSight.

Mini use case business context

For this mini use case, we are going to analyze air quality data from various states in the USA and see whether there is any relationship between population trends and air quality over time. Let's review the source datasets for this project.

Air quality index

The Environmental Protection Agency (EPA) calculates the air quality index based on the concentration of pollutants. The following are the key measures tracked to determine air quality: ...

Get Effective Business Intelligence with QuickSight now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.