Classification using massive online analysis

Massive Online Analysis (MOA), as discussed in Chapter 2Java Libraries and Platforms for Machine Learning, is another library that can be used to achieve classification. It is mainly designed to work with the stream. If it is working with the stream, a lot of data will be there; so, how do we evaluate the model? In the traditional batch learning mode, we usually divide the data into training and test sets and cross-validation is preferred if the data is limited. In stream processing, where the data seems to be unlimited, cross-validation proves to be expensive. Two approaches that we can use are as follows:

  • Holdout: This is useful when the data is already divided into two parts, which are predefined. ...

Get Machine Learning in Java - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.