November 2017
Beginner to intermediate
290 pages
7h 34m
English
Machine learning (ML) has become a popular topic, so let's consider how stream processing and machine learning can work together. The Apex approach in this area has been to let the data scientist community work with the tooling of their choice (which often is Python or R), and focus on the data engineering aspects of reliable and efficient production pipelines. Instead of reinventing ML libraries as part of the stack, the approach therefore is ML integration.
Apache SAMOA (https://samoa.incubator.apache.org/) explores distributed online machine learning (the training of a model with continuously arriving data). SAMOA aims to allow for ML development without having to focus on the intricacies of the underlying ...
Read now
Unlock full access