Skip to Content
Data Engineering with Databricks Cookbook
book

Data Engineering with Databricks Cookbook

by Pulkit Chadha
May 2024
Beginner to intermediate
438 pages
9h 41m
English
Packt Publishing
Content preview from Data Engineering with Databricks Cookbook

8

Orchestration and Scheduling Data Pipeline with Databricks Workflows

Databricks Workflows is a way to automate and orchestrate data processing tasks on the Databricks platform. A workflow is a sequence of tasks that can be defined using the Databricks Workflow API or the Databricks UI. Workflows can also include conditional logic, loops, and branching to handle complex scenarios.

Databricks Workflows can help you achieve various goals, such as the following:

  • Running data pipelines or ETL processes on a regular basis or in response to events
  • Training and deploying machine learning models in a scalable and reproducible way
  • Performing batch or streaming analytics on large datasets
  • Testing and validating data quality and integrity
  • Generating ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Fundamentals of Data Engineering

Fundamentals of Data Engineering

Joe Reis, Matt Housley
Fundamentals of Data Engineering

Fundamentals of Data Engineering

Joe Reis, Matt Housley

Publisher Resources

ISBN: 9781837633357Supplemental Content