March 2022
Beginner to intermediate
456 pages
13h
English
This chapter covers
GroupedData transitional objectGroupedData with an aggregation methodnull values in your data frameIn chapter 4, we looked at how we can transform a data frame using selection, dropping, creation, renaming, reordering, and creating a summary of columns. Those operations constitute the foundation for working with a data frame in PySpark. In this chapter, I will complete the review of the most common operations you will perform on a data frame: linking or joining data frames, as well as grouping data (and performing operations on the