© Subhashini Chellappan, Dharanitharan Ganesan 2018
Subhashini Chellappan and Dharanitharan GanesanPractical Apache Sparkhttps://doi.org/10.1007/978-1-4842-3652-9_5

5. Introduction to Spark Streaming

Subhashini Chellappan1  and Dharanitharan Ganesan2
Bangalore, India
Krishnagiri, Tamil Nadu, India

In Chapter 4 we discussed how to process structured data using DataFrames, Spark SQL, and Datasets.

The recommended background for this chapter is some prior experience with Scala.

In this chapter, we are going to focus on real-time processing using Apache Spark. We will be focusing on these areas:
  • Data processing.

  • Streaming data.

  • Why streaming data are important.

  • Introduction to Spark Streaming.

  • Spark Streaming example using TCP Socket.

  • Stateful streaming. ...

Get Practical Apache Spark: Using the Scala API now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.