2

Understanding Apache Spark and Its Applications

With the advent of machine learning and data science, the world is seeing a paradigm shift. A tremendous amount of data is being collected every second, and it’s hard for computing power to keep up with this pace of rapid data growth. To make use of all this data, Spark has become a de facto standard for big data processing. Migrating data processing to Spark is not only a question of saving resources that will allow you to focus on your business; it’s also a means of modernizing your workloads to leverage the capabilities of Spark and the modern technology stack to create new business opportunities.

In this chapter, we will cover the following topics:

  • What is Apache Spark?
  • Why choose Apache ...

Get Databricks Certified Associate Developer for Apache Spark Using Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.