Chapter 1. Data Ingestion and Transformation

The ability to efficiently collect, ingest, and transform data from diverse sources is crucial for driving valuable insights and well-informed decisions. Organizations must constantly expand their data horizons by ingesting streaming data, data from databases, data in the form of files, integrating SaaS applications, handling on-premises files, bringing third party datasets etc - and then applying the necessary transformations to make this data useful for downstream analytics.

For data engineers, optimizing this end-to-end data ingestion and transformation process is a core responsibility. They are often tasked with building reliable pipelines that power an organization’s data-driven initiatives. However, selecting the appropriate data ingestion and transformation solutions is critical, as different data sources, volumes, velocities, and transformation needs may necessitate the use of various AWS services and approaches.

In this chapter, you will ...

Get AWS Certified Data Engineer Associate Study Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.