1

Getting Started and Lakehouse Concepts

“Give me six hours to chop down a tree, and I will spend the first four sharpening the axe.”

– Abraham Lincoln

We will start with a basic overview of how Databricks Data Intelligence Platform (DI) is an open platform on a lakehouse architecture and the advantages of this in developing machine learning (ML) applications. For brevity, we will use terms such as Data Intelligence Platform and Databricks interchangeably throughout the book. This chapter will introduce the different projects and associated datasets we’ll use throughout the book. Each project intentionally highlights a function or component of the DI Platform. Use the example projects as hands-on lessons for each platform element we cover. We ...

Get Databricks ML in Action now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.