Table of Contents
Preface
Part 1: Unleashing Data Wrangling with AWS
1
Getting Started with Data Wrangling
Introducing data wrangling
The 80-20 rule of data analysis
Advantages of data wrangling
The steps involved in data wrangling
Data discovery
Data structuring
Data cleaning
Data enrichment
Data validation
Data publishing
Best practices for data wrangling
Identifying the business use case
Identifying the data source and bringing the right data
Identifying your audience
Options available for data wrangling on AWS
AWS Glue DataBrew
SageMaker Data Wrangler
AWS SDK for pandas
Summary
Part 2: Data Wrangling with AWS Tools
2
Introduction to AWS Glue DataBrew
Why AWS Glue DataBrew?
AWS Glue DataBrew’s basic building blocks
Getting started with AWS ...
Get Data Wrangling on AWS now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.