Table of Contents
Preface
Part 1: Introduction to ETL, Data Pipelines, and Design Principles
1
A Primer on Python and the Development Environment
Introducing Python fundamentals
An overview of Python data structures
Python if…else conditions or conditional statements
Python looping techniques
Python functions
Object-oriented programming with Python
Working with files in Python
Establishing a development environment
Version control with Git tracking
Documenting environment dependencies with requirements.txt
Utilizing module management systems (MMSs)
Configuring a Pipenv environment in PyCharm
Summary
2
Understanding the ETL Process and Data Pipelines
What is a data pipeline?
How do we create a robust pipeline?
Pre-work – understanding your data ...
Get Building ETL Pipelines with Python now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.