November 2018
Intermediate to advanced
300 pages
7h 42m
English
Mallet supports reading from directory with the cc.mallet.pipe.iterator.FileIterator class. A file iterator is constructed with the following three parameters:
Consider the data structured into folders as shown in the following screenshot. We have documents organized in five topics by folders (tech, entertainment, politics, sport, and business). Each folder contains documents on particular topics, as shown in the following screenshot:
