R. IlijasonBeginning Apache Spark Using Azure Databrickshttps://doi.org/10.1007/978-1-4842-5781-4_6

6. Querying Data Using SQL

Robert Ilijason¹

(1)

Viken, Sweden

Finally! We have our data loaded and ready in Databricks – multiple exciting datasets to investigate. Now it’s time to start playing around with them. We’ll start by using one of the oldest data languages around.

In this chapter, we’ll look at Spark SQL and how it works in Databricks. We’ll pull data from tables, filter it according to our needs, and look at aggregation functions to investigate our datasets. We’ll also look at how Delta Lake pushes Spark SQL and Databricks closer to the classic database feature set and what that means for manipulating data in place. ...

Get Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud by Robert Ilijason

6. Querying Data Using SQL

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly