November 2019
Intermediate to advanced
304 pages
8h 40m
English
In step 1, we used BasicLineIterator, which is a basic, single-line sentence iterator without any customization involved.
In step 2, we used LineSentenceIterator to iterate through multi-sentence text data. Each line is considered a sentence here. We can use them for multiple lines of text.
In step 3, CollectionSentenceIterator will accept a list of strings as text input where each string represents a sentence (document). This can be a list of tweets or articles.
In step 4, FileSentenceIterator processes sentences in a file/directory. Sentences will be processed line by line from each file.
For anything complex, we recommend that you use UimaSentenceIterator, which is a proper machine learning level pipeline. It iterates over ...