Skip to Content
Data Engineering with Databricks Cookbook
book

Data Engineering with Databricks Cookbook

by Pulkit Chadha
May 2024
Beginner to intermediate
438 pages
9h 41m
English
Packt Publishing
Content preview from Data Engineering with Databricks Cookbook

7

Performance Tuning in Delta Lake

Delta Lake is an open source data lake that supports ACID transactions and provides reliable data versioning and schema evolution capabilities. This chapter covers several techniques to optimize query performance in Delta Lake, including optimizing table partitioning, caching tables for fast query response, organizing data with Z-ordering, skipping data for faster query execution, reducing table size and I/O cost with compression, and boosting query performance.

We will cover the following recipes in this chapter:

  • Optimizing Delta Lake table partitioning for query performance
  • Organizing data with Z-ordering for efficient query execution
  • Skipping data for faster query execution
  • Reducing Delta Lake table size ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Fundamentals of Data Engineering

Fundamentals of Data Engineering

Joe Reis, Matt Housley
Fundamentals of Data Engineering

Fundamentals of Data Engineering

Joe Reis, Matt Housley

Publisher Resources

ISBN: 9781837633357Supplemental Content