December 2015
Beginner to intermediate
122 pages
2h 23m
English
Now, we'll create a Storm topology that breaks sentences into words and then counts the number of occurrences of each word. Implementing this topology in Storm requires the following components:
randomsentence.py): A topology always begins with a spout; that's how data gets into Storm. The sentence spout will emit an infinite stream of sentences.splitsentence.py): This receives sentences and splits them into words.wordcount.py): This receives words and counts the occurrences. For each word processed, output the word along with the number of occurrences.The following figure shows how data flows through the topology:
Word count topology
Now that we've seen the basic data flow, ...
Read now
Unlock full access