Sourcing Insightful Data and Data Extraction Strategies

After reviewing the fundamentals of ETL pipelines and introducing powerful ETL modules in Python for creating data pipelines, we’re now ready to write some code.

During this chapter, we will hop into the specifics of dealing with sourcing data from different types of systems and look at why combining various data sources within your pipeline is the secret sauce to successful data projects. Welcome to the beginning of the action!

In this chapter, we will cover the following topics:

  • What is data sourcing?
  • Accessibility to data
  • Types of data sources
  • Getting started with data extraction
  • Creating a data extraction pipeline using Python

Technical requirements

You will find all code files ...

Get Building ETL Pipelines with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.