Errata

Agile Data Science

Errata for Agile Data Science

Submit your own errata for this product.

The errata list is a list of errors and their corrections that were found after the product was released.

The following errata were submitted by our customers and have not yet been approved or disproved by the author or editor. They solely represent the opinion of the customer.

Color Key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update

Version Location Description Submitted by Date submitted
PDF Page 10
5th paragraph

The "CRUD" acronym appears out of nowhere without explaining what it is, I assume it means "create, read, update and delete" but should be clarified.

Rodolfo Quesada  Sep 02, 2013 
PDF Page 27
Denmark

There is something wrong with parenthesis () and [] in figure 2.3. After the transform part there is (russell.jurney@gmail.com, [******@hotmail.com)]. I believe that ought to be (russell.jurney@gmail.com, [******@hotmail.com]). Same mistake continues in the next parts of the figure.

Carsten J?rgensen  Jan 28, 2014 
PDF Page 37
middle of page

The command to clone the git repository has in incorrect URI:

git clone https://github.com/rujrney/Agile_Data_Code.git

should be

git clone https://github.com/rjurney/Agile_Data_Code.git

Myron Weber  May 12, 2014 
Printed Page 39

The text mentions that "I have already created a venv environment in GitHub" ... where is this? I see that "venv" is on the ignore list.

Eric Bell  May 07, 2014 
PDF, ePub Page 42
United States

Not really a mistake, more of an update.

Github repo for wonderdog moved from
https://github.com/infochimps/wonderdog.git

to

https://github.com/infochimps-labs/wonderdog.git

Aaron Held  Mar 16, 2013 
PDF Page 50
middle of page

The command to clone mongo-hadoop as written requires the user to log in to Github:

git clone git@github.com:rjurney/mongo-hadoop.git

Should be changed to

git clone https://github.com/rjurney/mongo-hadoop.git

Myron Weber  May 12, 2014 
PDF Page 54
Denmark

The query
curl -XGET 'http://localhost:9200/sent_counts/sent_counts/_search?q=russell&pretty=true'

should be
curl -XGET 'http://localhost:9200/inbox/sent_counts/_search?q=russell&pretty=true'

Carsten J?rgensen  Feb 03, 2014 
PDF Page 54
Denmark

Please disregard my other post on page 54. It contains an error

The query
curl -XGET 'http://localhost:9200/sent_counts/sent_counts/_search?q=russell&pretty=true'

should be

curl -XGET 'http://localhost:9200/inbox/sentcounts/_search?q=russell&pretty=true'

Carsten J?rgensen  Feb 03, 2014 
PDF Page 54
Denmark

Last line

Using pyelasticsearch is easy: run ch03/pig/elasticsearch.pig

should be

Using pyelasticsearch is easy: run ch03/python/elasticsearch.py

Carsten J?rgensen  Feb 03, 2014 
Printed Page 60
United States

There is an error in ch03/web/index.py on line 19. The result of the hash needs to be cast to string or re.search() will throw an error. Instead use:
results['values'] = [[s['from'], s['to'], s['total']] for s in sent_counts if re.search('apache', str(s['from'])) or re.search('apache', str(s['to']))]
of course substituting 'apache' for whatever it is you're looking for.

Timothy Gushue  Nov 17, 2013