Data Engineering with Azure Databricks
by Dmitry Foshin, Dmitry Anoshin, Tonya Chernyshova, Sergii Volodarskyi
Preface
Data engineering has changed fast in the last few years. Companies now deal with more data than ever before. They need systems that can handle batch and streaming data, enforce data quality, and scale without breaking. Azure Databricks has become one of the most popular platforms for building these systems.
Azure Databricks combines the power of Apache Spark with the convenience of a managed cloud service on Microsoft Azure. It gives you Delta Lake for reliable data storage, Structured Streaming for real-time processing, and Unity Catalog for data governance. Together, these tools let you build data pipelines that are fast, reliable, and secure.
This book takes you on a hands-on journey through data engineering on Azure Databricks.
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access