Streaming engine
The Apache Spark streaming component is an integral part of the framework. It does not require any specific installation or configuration. Apache Spark In-memory capabilities are a good solution to problems dealing with large scale real-time processing.
There are numerous articles and books related to the Apache Spark streaming library. This section introduces some basic concepts in the context of machine learning algorithms.
Why streaming?
Many applications require real-time or pseudo real-time processing of data from weather reporting, automated manufacturing processing, ATMs, advertising targeting, to financial markets analysis. The implementation of such systems is challenging because of its stringent requirements:
- Low latency ...
Get Scala for Machine Learning - Second Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.