Design patterns and techniques

In this section, we'll outline some design patterns and general techniques for use when writing your own analytics. These are a collection of hints and tips that represent the accumulation of experiences working with Spark. They are offered up as guidelines for effective Spark analytic authoring. They also serve as a reference for when you encounter the inevitable scalability problems and don't know what to do.

Spark APIs

Problem

With so many different sets of API's and functions to choose from, it's difficult to know which ones are the most performant.

Solution

Apache Spark currently has over one thousand contributors, many of whom are highly experienced world-class software professionals. It is a mature framework having ...

Get Mastering Spark for Data Science now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.