8
Powerful ETL Libraries and Tools in Python
Up to this point in the book, we have covered the fundamentals of building data pipelines. We’ve introduced some of Python’s most common modules that can be utilized to establish rudimentary iterations of data pipelines. While this is a great place to start, these methods are far from the most realistic approach; there is no lack of space for improvement. There are several powerful, ETL-specific Python libraries and pipeline management platforms that we can use to our advantage to make more durable, scalable, and resilient data pipelines suitable for real-world deployment scenarios.
We will divide this chapter into two parts. We start by introducing six of Python’s most popular ETL pipeline libraries. ...
Get Building ETL Pipelines with Python now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.