© Subhashini Chellappan, Dharanitharan Ganesan 2018
Subhashini Chellappan and Dharanitharan GanesanPractical Apache Sparkhttps://doi.org/10.1007/978-1-4842-3652-9_5

5. Introduction to Spark Streaming

Subhashini Chellappan1  and Dharanitharan Ganesan2
(1)
Bangalore, India
(2)
Krishnagiri, Tamil Nadu, India
 

In Chapter 4 we discussed how to process structured data using DataFrames, Spark SQL, and Datasets.

The recommended background for this chapter is some prior experience with Scala.

In this chapter, we are going to focus on real-time processing using Apache Spark. We will be focusing on these areas:
  • Data processing.

  • Streaming data.

  • Why streaming data are important.

  • Introduction to Spark Streaming.

  • Spark Streaming example using TCP Socket.

  • Stateful streaming. ...

Get Practical Apache Spark: Using the Scala API now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.