Spark Streaming and Machine Learning Library

In this chapter, we will cover the following recipes:

  • Structured streaming for near real-time machine learning
  • Streaming DataFrames for real-time machine learning
  • Streaming Datasets for real-time machine learning
  • Streaming data and debugging with queueStream
  • Downloading and understanding the famous Iris data for unsupervised classification
  • Streaming KMeans for a real-time online classifier
  • Downloading wine quality data for streaming regression
  • Streaming linear regression for a real-time regression
  • Downloading Pima Diabetes data for supervised classification
  • Streaming logistic regression for an on-line classifier

Get Apache Spark 2.x Machine Learning Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.