July 2024
Intermediate to advanced
296 pages
7h 4m
English
Apache Airflow has become the de facto standard for building, monitoring, and maintaining data pipelines. As data volumes and complexity grow, the need for robust and scalable orchestration is paramount. In this chapter, we will cover the fundamentals of Airflow – installing it locally, exploring its architecture, and developing your first Directed Acyclic Graphs (DAGs).
We will start by spinning up Airflow using Docker and the Astro CLI. This will allow you to get hands-on without the overhead of a full production installation. Next, we’ll get to know Airflow’s architecture and its key components, such as the scheduler, workers, and metadata database.
Moving on, you’ll create your first DAG – the core ...
Read now
Unlock full access