Skip to Content
Data Pipelines with Apache Airflow
book

Data Pipelines with Apache Airflow

by Julian de Ruiter, Bas Harenslak
May 2021
Beginner to intermediate
480 pages
12h 59m
English
Manning Publications
Content preview from Data Pipelines with Apache Airflow

10 Running tasks in containers

This chapter covers

  • Identifying some challenges involved in managing Airflow deployments
  • Examining how containerized approaches can help simplify Airflow deployments
  • Running containerized tasks in Airflow on Docker
  • Establishing a high-level overview of workflows in developing containerized DAGs

In previous chapters, we implemented several DAGs using different Airflow operators, each specialized to perform a specific type of task. In this chapter, we touch on some of the drawbacks of using many different operators, especially with an eye on creating Airflow DAGs that are easy to build, deploy, and maintain. In light of these issues, we look at how we can use Airflow to run tasks in containers using Docker and ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Pipelines with Apache Airflow

Data Pipelines with Apache Airflow

Julian de Ruiter, Bas Harenslak
Kubernetes: Up and Running, 3rd Edition

Kubernetes: Up and Running, 3rd Edition

Brendan Burns, Joe Beda, Kelsey Hightower, Lachlan Evenson

Publisher Resources

ISBN: 9781617296901Supplemental ContentPublisher SupportOtherPublisher WebsiteSupplemental ContentErrata PagePurchase Link