Errata

Text Mining with R

Errata for Text Mining with R

Submit your own errata for this product.

The errata list is a list of errors and their corrections that were found after the product was released. If the error was corrected in a later version or reprint the date of the correction will be displayed in the column titled "Date Corrected".

The following errata were submitted by our customers and approved as valid errors by the author or editor.

Color key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update

Version Location Description Submitted By Date submitted Date corrected
Printed, PDF, ePub,
Page 33
last paragraph

last paragraphs currently reads:

"There are very long tails to the right for these novels(those extremely common words!)

It should read:


"There are very long tails to the right for these novels(those extremely rare words!)

Note from the Author or Editor:
Thanks for submitting this! The parenthetical on pg. 33 should read "(those extremely rare words!)" as reported.

Anonymous  May 12, 2019 
Printed, PDF, ePub,
Page 78
1st and 2nd code chunks

The first and second code chunks currently read:

ap_td %>%
cast_dfm(term, document, count)

## Document-feature matrix of: 10,473 documents, 2,246 features (98.7% sparse).

They should read:

ap_td %>%
cast_dfm(document, term, count)

## Document-feature matrix of: 2,246 documents, 10,473 features (98.7% sparse).

Julia Silge
Julia Silge
 
Dec 26, 2017 
Printed, PDF, ePub, Mobi,
Page 51
1st paragraph

The first paragraph currently begins:
"Our sentiment analysis approch in..."

It should read:
"Our sentiment analysis approach in..."

Julia Silge
Julia Silge
 
Dec 03, 2017 
Printed, ePub,
Page 58
1st paragraph

The paragraph currently begins:

"It may take a some experimentation..."

It should read:

"It may take some experimentation..."

Julia Silge
Julia Silge
 
Nov 25, 2017 
Printed, PDF, ePub,
Page 121
First block of example code

The first line of the code block should read "tidy_tweet <- tweets %>%"

The way it is currently written references "tidy_tweet" before it has been created.

Note from the Author or Editor:
Many thanks to this careful reader! The first line of the first code block on page 121 of the print version should read:

tidy_tweets <- tweet %>%

Instead of what it current reads:

tidy_tweet %>%

zachsilvey@gmail.com  Nov 12, 2017 
Printed, PDF, ePub, Mobi,
Page 53
caption of Figure 4-2

The caption of Figure 4-2 currently reads: 'The 20 words followed by ‘not’ that had the greatest contribution to sentiment scores, in either a positive or negative direction'

Instead, it should read: 'The 20 words preceded by ‘not’ that had the greatest contribution to sentiment scores, in either a positive or negative direction'

Julia Silge
Julia Silge
 
Sep 24, 2017 
Printed, ePub,
Page 73
caption of Figure 5-2

The caption of Figure 5-2 currently reads:
"Words from AP articles with the greatest contribution to positive or negative sentiments, computed as the product of the word’s AFINN sentiment score and its frequency."

It should instead read:
"Words from AP articles with the greatest contribution to positive or negative sentiments, using the Bing sentiment lexicon."

Julia Silge
Julia Silge
 
Aug 05, 2017