April 2018
Beginner
238 pages
7h 13m
English
We can use the following script:
library(tm)#install.packages('wordcloud', repos='http://cran.us.r-project.org')library(wordcloud)#extracted from https://www.lifesitenews.com/news/jesus-birth-changed-the-course-of-human-history-trumps-extraordinary-2017-cpage <- readLines("trump-speech.txt")# produce corpus of textcorpus <- Corpus(VectorSource(page))# convert to lower casecorpus <- tm_map(corpus, tolower)# remove punctuationcorpus <- tm_map(corpus, removePunctuation)# remove numberscorpus <- tm_map(corpus, removeNumbers)# remove stop wordscorpus <- tm_map(corpus, removeWords, stopwords("English"))# reconfigure corpus as text document#corpus <- tm_map(corpus, PlainTextDocument)# create document term matrix from corpusdtm <- ...