Skip to Content
Learn AWS Serverless Computing
book

Learn AWS Serverless Computing

by Scott Patterson
December 2019
Intermediate to advanced
382 pages
9h 43m
English
Packt Publishing
Content preview from Learn AWS Serverless Computing

Data transformation – Glue

Data transformation is the process of taking our raw data from one format and mapping it to a new structure or format that we choose. This process is a fundamental part of all data processing and usually requires a lot of storage space, as well as large-scale computations. We can speed up the transformation process by parallelizing the processing, which is something that Glue can do for us out of the box.

In this section, we are going to create our own transformation process using Glue. We will take the existing knowledge about our data, which can be found in our data catalog, and create a target structure. It will be clear which fields we want to map to our new structure, and we'll also learn a tip about a file ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Practical Amazon EC2, SQS, Kinesis, and S3: A Hands-On Approach to AWS

Practical Amazon EC2, SQS, Kinesis, and S3: A Hands-On Approach to AWS

Sunil Gulabani

Publisher Resources

ISBN: 9781789958355