7

Structured Streaming in Spark

The world of data processing has evolved rapidly as data volume and data velocity increase every day. With that, the need to analyze and derive insights from real-time data is becoming increasingly crucial. Structured Streaming, a component of Apache Spark, has emerged as a powerful framework to process and analyze data streams in real time. This chapter delves into the realm of Structured Streaming, exploring its capabilities, features, and real-world applications.

In this chapter, we will cover the following topics:

  • Real-time data processing
  • The fundamentals of streaming
  • Streaming architectures
  • Spark Streaming
  • Structured Streaming
  • Streaming sources and sinks
  • Advanced topics in Structured Streaming
  • Joins in ...

Get Databricks Certified Associate Developer for Apache Spark Using Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.