October 2017
Beginner to intermediate
236 pages
7h 38m
English
A word cloud is one of the more popular ways to do exploratory text analysis. This is very intuitive and easy to understand. The first thing you need is a corpus of documents, and then, you need to do the pre-processing task such as converting all words into lowercase or uppercase, removing punctuation, and stopping words. The stemming also has been done to find the out root of a word. All of this pre-processing has been done using the functions available into the tm library.
After completing the pre-processing, the important step is to calculate term frequency from the term document matrix. Since the term document matrix is a sparse matrix with only 0 and 1 indicating whether a term is absent or present in the corpus, taking ...