© Hien Luu 2018
Hien LuuBeginning Apache Spark 2https://doi.org/10.1007/978-1-4842-3579-9_6

6. Spark Streaming

Hien Luu1 
(1)
SAN JOSE, California, USA
 

In addition to batch data processing, streaming data processing has become a must-have capability for any business that wants to harness the value of real-time data to either increase their competitive advantage or to improve their user experience. With the advent of the Internet of Things, the volume and velocity of real-time data have increased even more than before. For Internet companies such as Facebook, LinkedIn, and Twitter, millions of social activities happening every second on their platforms are represented as streaming data.

At a high level, streaming processing is about the continuous processing ...

Get Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.