9

CI/CD with GitHub

In this chapter, we will introduce Continuous Integration/Continuous Delivery (CI/CD) and how to apply CI/CD in your Scala data engineering project using GitHub. CI/CD is a set of best practices and tools that automate the development, testing, and deployment of data pipelines and workflows. It involves the continuous integration of code changes, automated testing, version control, and continuous delivery of pipeline deployments. The goal of CI/CD is to streamline the development process by enabling rapid iteration, reducing errors, and ensuring consistent quality.

We will cover the following main topics:

  • Introducing CI/CD and GitHub
  • Working with GitHub
  • Deploying pipelines
  • Understanding GitHub Actions

Technical requirements ...

Get Data Engineering with Scala and Spark now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.