Skip to Content
Scripting with Python and SQL for Data Engineering
on-demand course

Scripting with Python and SQL for Data Engineering

with Alfredo Deza
August 2023
Intermediate
3h 26m
English
Pragmatic AI Labs
Closed Captioning available in German, English, Spanish, French, Italian, Japanese

Overview

Scripting with Python and SQL for Data Engineering

Learn Python data structures, web scraping, SQL, and MySQL from the ground up. Master essential skills for collecting, storing, and working with data.

In this hands-on course for beginners, you'll learn how to:

  • Store and manipulate data using Python lists, dictionaries, JSON
  • Write reusable scripts to process data
  • Connect Python to databases like SQLite and MySQL
  • Query, import, and export data with SQL
  • Scrape and parse websites using Beautiful Soup and Scrapy
  • Persist scraped data to databases and files

You'll use the following example repositories to practice:

Key topics include:

  • Mapping, iterating, and persisting data structures
  • Creating modules, scripts, and workflows in Python
  • SQL essentials - queries, statements, aggregations
  • Setting up connections from Python to SQLite and MySQL
  • Scraping data locally and at scale with spiders
  • Storing scraped data to optimize pipelines

You'll build your data wrangling skills through practical examples and hands-on coding exercises in every lesson. By the end of the course, you'll have experience building end-to-end data engineering scripts.

Whether you're a beginner looking to learn Python and SQL, or want to develop robust data engineering skills, this course will get you started. Enroll now and start collecting, storing, and working with data using Python and SQL

You'll gain hands-on experience building Python scripts and SQL queries for common data engineering tasks.

This course is divided in 4 weeks:

Week 1 Working with Data in Python

By the end of Week 1 you'll be able to:

  • Apply Python data structures like lists, dicts
  • Extract data from sources like CSV, JSON
  • Load and persist data using JSON

Lesson 1: Data Structures in Python

Lesson Outline

  • Lists, tuples, dictionaries
  • Working with pandas DataFrames
  • Loading data files like CSV into data structures

Lesson 2: Reading and Writing Data

Lesson Outline

  • Reading and writing CSV files
  • Serializing Python objects with JSON
  • Parsing and dumping JSON data

Lesson 3: Persisting and Loading Data in Python

Lesson Outline

  • Loading data from files
  • Saving data from Python to disk
  • Loading and saving data to JSON

Week 2 Python Scripting and SQL

By the end of Week 2 you'll be able to:

  • Write reusable Python scripts
  • Use SQLite to persist data
  • Query SQLite databases with Python

Lesson 1: Python Scripting Techniques

Lesson Outline

  • Writing modular, reusable Python scripts
  • Exception handling and logging
  • Python virtual environments

Lesson 2: Python with SQLite

Lesson Outline

  • Creating SQLite databases from Python
  • Writing tables with SQLAlchemy
  • Querying SQLite from Python with SQLAlchemy

Week 3 Learning Objectives

By the end of Week 3 you'll be able to:

  • Scrape and collect data from websites
  • Build scalable scraping scripts
  • Persist scraped data to files/databases

Lesson 1: Web Scraping with Python

Lesson Outline

  • HTML parsing and structure
  • Using Beautiful Soup for scraping
  • Storing scraped data in Python

Lesson 2: Scalable Web Scraping

Lesson Outline

  • Scraping best practices
  • Scaling scraping with multiprocessing
  • Storing scraped data in databases

Week 4 Learning Objectives

By the end of Week 4 you'll be able to:

  • Connect to MySQL from Python
  • Execute SQL statements and queries
  • Import and export data from MySQL

Lesson 1: Python and MySQL

Lesson Outline

  • Installing MySQL and configuration
  • Connecting Python to MySQL
  • Executing queries and statements

Lesson 2: Running SQL queries from VSCode

  • Use Visual Studio Code to build SQL queries
  • Execute and review SQL queries from Visual Studio Code

Lesson 3: Importing and Exporting Data

Lesson Outline

  • Loading and exporting CSV data
  • Best practices for moving data into MySQL
  • Automating data imports with Python

About your instructor

Alfredo Deza has over a decade of experience as a Software Engineer doing DevOps, automation, and scalable system architecture. Before getting into technology he participated in the 2004 Olympic Games and was the first-ever World Champion in High Jump representing Peru. He currently works in Developer Relations at Microsoft and is an Adjunct Professor at Duke University. This solid background in technology and teaching, including his experience teaching and authoring content about DevOps and MLOps will give you everything you need to get started applying these powerful concepts.

Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Beginner's Guide to Streamlit with Python: Build Web-Based Data and Machine Learning Applications

Beginner's Guide to Streamlit with Python: Build Web-Based Data and Machine Learning Applications

Sujay Raghavendra
Data Engineering with Python and AWS Lambda LiveLessons

Data Engineering with Python and AWS Lambda LiveLessons

Noah Gift, Robert Jordan, Kennedy Behrman
Building ETL Pipelines with Python

Building ETL Pipelines with Python

Brij Kishore Pandey, Emily Ro Schoof

Publisher Resources

ISBN: 28190528VIDEOPAIML