June 2020
Intermediate to advanced
576 pages
15h 41m
English
This chapter covers
Spark is fast. It processes data easily across multiple nodes in a cluster or on your laptop. Spark also loves memory. That’s a key design for Spark’s performance. However, as your datasets grow from the sample that you use to develop applications to production datasets, you may feel that performance is going down.
In this chapter, you’ll get some foundational knowledge about how Spark uses memory. This knowledge will help you in optimizing ...
Read now
Unlock full access