© Ramcharan Kakarla, Sundar Krishnan and Sridhar Alla 2021
R. Kakarla et al.Applied Data Science Using PySparkhttps://doi.org/10.1007/978-1-4842-6500-0_3

3. Utility Functions and Visualizations

Ramcharan Kakarla1  , Sundar Krishnan1 and Sridhar Alla2
Philadelphia, PA, USA
New Jersey, NJ, USA

In this chapter, we will dive into the utility of some of the advanced functions available in PySpark. You are encouraged to read the previous chapter and try the following operations on any dataset of your choice to improve your understanding. This chapter will focus on the windowing functions and other topics that will be useful in the creation and application of Spark programs on large datasets. We will also introduce the visualization and machine learning ...

Get Applied Data Science Using PySpark: Learn the End-to-End Predictive Model-Building Cycle now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.