Data Analysis with Open Source Tools

Errata for Data Analysis with Open Source Tools

Submit your own errata for this product.


The errata list is a list of errors and their corrections that were found after the product was released. If the error was corrected in a later version or reprint the date of the correction will be displayed in the column titled "Date Corrected".

The following errata were submitted by our customers and approved as valid errors by the author or editor.

Color Key: Serious Technical Mistake Minor Technical Mistake Language or formatting error Typo Question Note Update



Version Location Description Submitted By Date Submitted Date Corrected
Printed
Page xvi
1st paragraph

The word "youre" should be "you're"

Note from the Author or Editor:
Third line from top.

Joshua Lande  Oct 17, 2012 
Printed, PDF
Page 20
Last paragraph, first sentence.

"Constructing a KDE requires to things:" "to" should be "two"

Shinichiro Wachi  Nov 17, 2010  May 27, 2011
Printed
Page 23
1st paragraph

Last sentence reads "Also, the most suitable to develop intuitive understanding might not be the one that minimizes a particular mathematical quantity." Shouldn't the word "way" or "method" be next to suitable?

Note from the Author or Editor:
Thanks for pointing this out. I think the sentence should read: "Also, the most suitable bandwidth to develop intuitive understanding..." (insert "bandwidth")

Jorge F. Reyes  Aug 18, 2014 
PDF
Page 70
8th line of text

text says, "...command line like at the shell..." "like" should be deleted?

Note from the Author or Editor:
Confirmed - strike "like" (p70, before first code insert)

Phillip Wilmarth  Feb 18, 2013 
Printed
Page 116
second line

As is: "... they seem to be more talked about then used." Should be: "... they seem to be more talked about than used."

Note from the Author or Editor:
2nd line from top - change "then" to "than" Currently: "... more talked about then" Should be: "... more talked about than"

Yuliya Gorlina  Nov 10, 2011 
Safari Books Online
128
first gnuplot command

Using the same dataset in the same format and Gnuplot 4.4, this command does not work for me: plot "data" u 1 w l I get the following error: warning: Skipping data file with no valid points x range is invalid I solved this by just using the second column, but more explanation would be nice so that the reader can follow along.

Note from the Author or Editor:
My bad - the correct column to use is column 2 (not column 1). The command should have been: plot "data" u 2 w l

Anonymous  Mar 08, 2011  May 27, 2011
Printed
Page 130
First line of first block of code

From what I can tell, the dollar signs weren't properly escaped. They are missing and the font switches to LaTeX "mathmode."

Note from the Author or Editor:
In the first line in the first code block on p130: - the entire line should be set in fixed-width "code" font - insert '$' before the digit '2' and before the second (!) digit '0' to read : plot "data" u 0:($2 - 315 - 35*($0/350)**1.35) w l \

Yuliya Gorlina  Nov 11, 2011 
Printed
Page 132
second to last line

In the function f(x), it says sin(2*pi*$0/6). It should be x instead of $0. I.e., sin(2*pi*x/6)

Note from the Author or Editor:
Second line from bottom (first line in code block): Currently: "... 0.75*sin(2*pi*$0/6) ..." Should be: "... 0.75*sin(2*pi*x/6) ..."

Yuliya Gorlina  Nov 18, 2011 
Printed
Page 133
second to last line

In the function f(x), it says sin(2*pi*$0/6). It should be x instead of $0. I.e., sin(2*pi*x/6)

Note from the Author or Editor:
Second line from bottom (first line in last code block): Currently: "... 0.75*sin(2*pi*$0/6) ..." Should be: "... 0.75*sin(2*pi*x/6) ..."

Yuliya Gorlina  Nov 11, 2011 
Printed
Page 134
third to last paragraph

In the function f(x), it says sin(2*pi*$0/6). It should be x instead of $0. I.e., sin(2*pi*x/6)

Note from the Author or Editor:
Third line from bottom (first line in code block): Currently: "... 0.75*sin(2*pi*$0/6) ..." Should be: "... 0.75*sin(2*pi*x/6) ..."

Yuliya Gorlina  Nov 18, 2011 
Printed, PDF
Page 159
First line

Comment "a = [ 3.0, 6.0 ]" should be "b = [ 3.0, 6.0 ]".

Brad Ediger  Nov 29, 2010  May 27, 2011
Printed
Page 165
First line

Extra space in "well- tested".

Philipp Janert
Philipp Janert
O'Reilly AuthorO'Reilly Blogger 
Apr 03, 2011  May 27, 2011
Printed
Page 173
First sentence of 3rd paragraph

The start of the sentence Force has units of mass * length squared / time squared, ... is incorrect. The length is not squared. The correct sentence begins: Force has units of mass * length / time squared, ...

Note from the Author or Editor:
My bad! Force has units: mass*length/seconds**2.

Bill Schneider  Dec 04, 2010  May 27, 2011
Printed
Page 177
second to last paragraph, third to last line

"... even it requires only one tenth..." Should be "... even if it requires..."

Note from the Author or Editor:
Insert "if": Currently: "... even it requires only one tenth..." Should be "... even if it requires..."

Yuliya Gorlina  Nov 11, 2011 
Printed
Page 177
7th line down; last word "feet" should be "foot"

"but one of these 53 FEET tractor-trailer long hauls)?" "FEET" should be "FOOT" NB: the capitalization is correct; I have used capital letters here for emphasis.

Note from the Author or Editor:
Confirmed, after checking with a professional editor.

Daniel Eldridge  Nov 28, 2011 
Printed
Page 188
First item in "Further Reading"

The book "How to Model It: Problem Solving for the Computer Age" is available directly from the publisher (not "out of print").

Philipp Janert
Philipp Janert
O'Reilly AuthorO'Reilly Blogger 
Apr 03, 2011  May 27, 2011
Printed
Page 206
First line in second paragraph

Instead of "... that could (or need be) estimated" should be: "... that could (or need) be estimated" => move closing parentheses one word to the left!

Philipp Janert
Philipp Janert
O'Reilly AuthorO'Reilly Blogger 
Apr 03, 2011  May 27, 2011
Printed
Page 257
6th full paragraph

"To find the width of the distribution, we would like see how..." [missing "to"]

Note from the Author or Editor:
Insert "to" after "like" : "... we would like TO see how ..."

Lane DeNicola  Dec 30, 2011 
PDF
Page 284
3rd line from bottom of page

The text says that the simulation runs for 1000 time units, however, the source code has 500 time units.

Note from the Author or Editor:
I think that's right - the text should agree with the code and the figure.

Phillip Wilmarth  Feb 23, 2013 
Printed
Page 312
formula in 1st paragraph

The paragraph says that the radius is the average deviation, but in the formula for r, the sum is not divided by the number of points. Also, if it's the average deviation, shouldn't we take square roots before summing? Or are we doing a root mean square?

Note from the Author or Editor:
The sum in the first formula should be divided by N. This implies to changes: 1) a factor of 1/N (as proper fraction) should be placed to the left of the summation sign. 2) the letter "N" should be set ABOVE the summation sign (indicating the upper limit of the summation) We should also define N as the number of points in the text, but that's a (slightly) bigger revision.

Yuliya Gorlina  Nov 13, 2011 
Other Digital Version
391
2nd paragraph

On this page where subscription NPV problem is revisited, you state: "This equation can be solved for p. The result is p = 30 percent!" I did the calculation manually and via Wolfram Alpha, but the result is the same (and different from yours) - 44%. It can be checked via http://www.wolframalpha.com/input/?i=solve+%281%2B+1%2F%281%2B0.01*p%29%29*130+%3D+220+for+p Or did I make some stupid mistake?

Note from the Author or Editor:
This report is correct. In the line under the equation on p391, the sentence should read: "The result is p=44 percent!"

Stanislav Tsybyshev  Aug 13, 2011 
Printed
Page 403
First item in "Further Reading"

The book "Industrial Mathematics: Modeling in Industry, Science and Government" is now available as Dover edition under the title: A Survey of Industrial Mathematics Charles R. MacCluer Dover Publications 2010

Philipp Janert
Philipp Janert
O'Reilly AuthorO'Reilly Blogger 
Apr 03, 2011  May 27, 2011
Printed
Page 413
Caption for figure 18.3

Extra "they" in the figure caption: "the data show the they employee type"

Note from the Author or Editor:
Strike "they" in Figure caption.

Jonathan Martin  Aug 12, 2012 
Printed
Page 492
8th paragraph (2nd last)

Title of paragraph: Keep data files readily available Missing character below where "may" should be "many": Being able to run... turnaround typical of may data warehouse installations...

Note from the Author or Editor:
Confirmed. Thanks for pointing it out.

Daniel St.Jacques  Sep 16, 2011