11
Using Python with Glue
In this chapter, we are going to learn how to create a data integration pipeline with AWS Glue. AWS Glue is a data integration service that is used for the Extract, Transform, and Load (ETL) process. Glue is a serverless data integration service; therefore, you don’t need to create and manage a server, as the infrastructure is managed by AWS. With Glue, you can collect data from different data sources, such as S3, databases, or filesystems, to process and transform the data. The result is stored in S3 or the database, or you can call an API.
The chapter covers the following topics:
- What is the AWS Glue service?
- AWS Glue service creation
- Creating a simple Python application with AWS Glue
What is the AWS Glue service? ...
Get Python Essentials for AWS Cloud Developers now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.