Errata

Hadoop: The Definitive Guide

Errata for Hadoop: The Definitive Guide

Submit your own errata for this product.

The errata list is a list of errors and their corrections that were found after the product was released.

The following errata were submitted by our customers and have not yet been approved or disproved by the author or editor. They solely represent the opinion of the customer.

Color Key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update

Version Location Description Submitted by Date submitted
ePub Page app. C
first paragraph

I am trying to get a copy of the example NCDC weather data used in the book, but encounter an error. There seems to be a problem in the security configuration of the S3 bucket containing the data.

Following the instructions at http://www.hadoopbook.com/ I am trying to access the data in Amazon S3, in the hadoopbook bucket. For example I should be able to access the data for the year 1901 on S3 like so:

http://hadoopbook.s3.amazonaws.com/ncdc/all/1901.gz

I get the following error message:

<Error>
<Code>AccessDenied</Code>
<Message>Access Denied</Message>
<RequestId>8BAABDC6C9B4A9B8</RequestId>
<HostId>
Zi3ozbSR63kfNbd+Ja6VKMnJHHiTxr3QXgiczGeafsnMfq0psOCmiSfMkVEDkQqr
</HostId>
</Error>

Marcel.G  May 26, 2013 
PDF Page 194
5th Paragraph

In book it is mentioned that "Requests for map tasks are made first and with a higher priority than those for reduce tasks", while in actual hadoop requests for containers for reducers are of higher priority. The containers requests for map tasks are of higher priority only when they were retried(in case previous attempt of map task failed).

Abhishek Modi  Jul 29, 2016 
Printed, PDF, Page 618
README file to build examples

The only instructions for building the example code are in the README file at the top of the GitHub tree. The compilation dies at the HBASE step with what appears to be a source code error in the routine HBaseStationCli.java


[javac] /home/johnelle/hadoop-book/build.xml:126: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds
[javac] Compiling 5 source files to /home/johnelle/hadoop-book/build/classes
[javac] /home/johnelle/hadoop-book/ch13/src/main/java/HBaseStationCli.java:23: addColumn(byte[],byte[]) in org.apache.hadoop.hbase.client.Get cannot be applied to (byte[])
[javac] get.addColumn(INFO_COLUMNFAMILY);
[javac] ^
[javac] Note: Some input files use or override a deprecated API.
[javac] Note: Recompile with -Xlint:deprecation for details.
[javac] 1 error

John Ellenberger  Aug 16, 2012