Overview
This book guides you through the process of building modern, scalable ETL pipelines using Python. You'll explore practical techniques and best practices for every step-extracting data from various sources, transforming it effectively, and loading it into your desired destination.
What this Book will help me do
- Set up a Python environment tailored for data pipeline development.
- Develop maintainable ETL pipelines using Python with functional and object-oriented programming.
- Implement CI/CD practices for smooth, automated deployments.
- Leverage Python libraries and AWS tools to enhance scalability and resilience.
- Understand testing strategies to ensure robust and error-free pipelines.
Author(s)
Brij Kishore Pandey and Emily Ro Schoof are experienced software engineers specializing in data engineering and process automation. Their combined expertise is reflected in a writing style that is both technically authoritative and accessible. Dedicated to fostering practical learning, they provide a hands-on approach that equips readers with essential skills.
Who is it for?
This book is ideal for data engineers and software professionals who wish to effectively build ETL pipelines using Python. A foundational understanding of Python programming is recommended. It suits those aiming to scale their data processing workflows and adopt best practices for enterprise-ready systems.
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access