Massive Online Analysis (MOA), as discussed in Chapter 2, Java Libraries and Platforms for Machine Learning, is another library that can be used to achieve classification. It is mainly designed to work with the stream. If it is working with the stream, a lot of data will be there; so, how do we evaluate the model? In the traditional batch learning mode, we usually divide the data into training and test sets and cross-validation is preferred if the data is limited. In stream processing, where the data seems to be unlimited, cross-validation proves to be expensive. Two approaches that we can use are as follows:
- Holdout: This is useful when the data is already divided into two parts, which are predefined. ...