Finally! We have our data loaded and ready in Databricks – multiple exciting datasets to investigate. Now it’s time to start playing around with them. We’ll start by using one of the oldest data languages around.
In this chapter, we’ll look at Spark SQL and how it works in Databricks. We’ll pull data from tables, filter it according to our needs, and look at aggregation functions to investigate our datasets. We’ll also look at how Delta Lake pushes Spark SQL and Databricks closer to the classic database feature set and what that means for manipulating data in place. ...