March 2019
Beginner to intermediate
182 pages
4h 6m
English
In this chapter, we'll learn how to manipulate DataFrames with Spark SQL schemas, and use the Spark DSL to build queries for structured data operations. By now we have already learned to get big data into the Spark Environment using RDDs and carried out multiple operations on that big data. Let us now look that how to manipulate our DataFrames and build queries for structured data operations.
In particular, we will cover the following topics:
Read now
Unlock full access