O'Reilly logo

Practical Data Analysis by Hector Cuesta

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Word cloud visualization of the most common positive words in tweets

In this example, we will develop a simple application that counts the number of occurrences of each word in the positive tweets. First, we will split each tweet into words. Then, we remove all the URLs (http://...) and twitter users (@...). Next, we will remove all the words with three or less characters (such as the, why, she, him, and so on). Finally, the counted word frequencies will be visualized into a word cloud. In the code listed as follows, we implement the JavaScript map function to split words from tweets:

function(){ this.text.split(' ').forEach( function(word){ var txt = word.toLowerCase(); if(!(/^@/).test(txt) && txt.length >= 3 && !(/^http/).test(txt)){ emit(txt,1) ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required