Errata

Errata for Developing Bioinformatics Computer Skills

Submit your own errata for this product.

The errata list is a list of errors and their corrections that were found after the product was released.

The following errata were submitted by our customers and have not yet been approved or disproved by the author or editor. They solely represent the opinion of the customer.

Color Key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update

Version	Location	Description	Submitted by	Date submitted
Printed	Page 1 Chapter 1	I read the example chapter online, it's Chapter 1 but there are no page numbers. Here's the phrase: "With a commonly used computer program called fsBLAST". There's no such thing as a program called fsBLAST, it's BLAST.	Anonymous
Printed	Page 7 middle line between two sequence alignments "Query: 24" and "Sbjct: 17"	In the book's first visual display of a sequence alignment on page 7, the middle line showing the relationship between the eyeless gene and the aniridia gene is formatted two spaces too far to the left so that HSGVNQLGGVFV GRPLPDSTRQKIVELAHSGARPCDISRILQVSN, which should begin at base pair 15, starts at base pair 13. This error could be rather confusing to someone who had little experience with sequence alignments and didn't pick up on the error, since in the text it explicitly says, "If there is a letter on the middle line, the sequences match exactly at that position... If there is nothing on the middle line, the two sequences don't match at that position." Yet due to the formatting error, this statement is no longer true on the first line of the alignment, though it should be.	Anonymous
Printed	Page 11 Figure 1-3	The marker for "The Origin of Life" (presumably the Last Common Ancestor) is indicated to be between Eukaryotes and Archaea, but should be moved to the right of Archaea to indicate that it preceeds the split from eubacteria. Eukaryotes and Archea are thought to be more closely related to each other than to eubacteria and all three have the origin of life in common as prokaryotic common ancestor.	Anonymous	Nov 18, 2008
Printed	Page 25 last paragraph	This section is about the translation of mRNA to protein, yet this paragraph says, "...the genetic code is the code that translates DNA into protein" and "it takes three bases of DNA...". It goes on to say, "Figure 2-4 shows how RNA is translated into protein". There seems to be confusion about whether DNA or mRNA or RNA is translated into protein.	Anonymous
Printed	Page 26 Figure 2-4	tRNA translation of GAU is shown as mapping to Gly, yet figure 2-3 shows GAU as mapping to Asp. This may confuse someone who is not familiar with the Genetic Code.	Anonymous
Printed	Page 36 1st paragraph, last sentence	The word "protein" begins the sentence and should be capitalized: "Protein structure alignment tools are introduced in Chapter 10."	Anonymous
Printed	Page 56 JDK/JRE;	The text reads "to use Java-based tools such as the Jalview sequence editor we discuss in Chapter 4, Files and Directories in Unix." Jalview is discussed in Chapter 8 (page 196).	Anonymous
Printed	Page 67 3rd paragraph	The directory "home/jambeck/mustelidae" should be changed to "/home/jambeck/mustelidae"	Anonymous
Printed	Page 74 long format of ls (code)	"Mar5" should be changed to "Mar 5" (chapters 3, 4, 5) all; Recurring inconsistencies: The authors do not differentiate between a file and a filename. Pages 100, 101, 102, and 103: in the Usage parts, they use "file", "files", "filename", "filenames", and even "filename(s)" on page 109.	Anonymous
Printed	Page 95 seventh paragraph	"meercat.txt" doesn't exist. It should be changed to "meercat10.txt".	Anonymous
Printed	Page 98 Status line	It is a colon (:) not a semicolon (;) that is used to get to the status line in vi. The error occurs twice in the first paragraph on page 98.	Anonymous
Printed	Page 99 2nd list item	"r]" should be ":r"	Anonymous
Printed	Page 101 last section	"- number" should be "-number"	Anonymous
Printed	Page 102 3rd paragraph	"num" should be "number"	Anonymous
Printed	Page 103 second half	If you use cut -f 1-2 sequence_data on the described file, you will get both fields (both columns) send to stdout, at least you will get ATC TAC (the first line) but never AAT TAC, will you?	Anonymous
Printed	Page 103 command line example after 3rd paragraph	the csplit example %csplit -f dbrecord. -n 6 fastadbfile /^>/ splits the input file just in two parts, splitting at the first occurence of ">". to split the input file into many single sequence files the option "{*}" has to be added (confirmed with csplit from GNU textutils 2.0.11)	Anonymous
Printed	Page 110 options	--help and --version are usable for all the given commands. Why did the authors specifically add it here ?	Anonymous
Printed	Page 112-3 last paragraph	The amino acid sequence in the seqres file starts at 14 and ends at 64, so that the command must be: cut -c14-64 seqres > seqs to get the output shown on toop of the next page. This is a little bit inconsistent and confusing, like the known error on page 7. BTW: the same numbering error occurs on page 104 in the piped command "% grep SEQRES pdbfile\| cut -c...."	Anonymous
Printed	Page 113 script	The temporary file is not removed, and since authors are dealing within GenBank files (several Mb at a time), they really need to remove it in the foreach loop. This is a major issue. Also, there isn't any indentation: if you want biologists to develop computer skills, you have to explain to them how to indent scripts and Perl programs.	Anonymous
Printed	Page 115 first paragraph	The directory "/home/httpd/html" is too specific. It would be better explain things using httpd.conf and grep '^ServerRoot' in this file to obtain the appropriate directory.	Anonymous
Printed	Page 115 Usages	inconsistencies: telnet full.hostname and ftp full.host.name.edu	Anonymous
Printed	Page 115 ftp section	"The File Transfer Protocol (ftp)" should be "The File Transfer Protocol (FTP)". ftp is a program. FTP is the protocol.	Anonymous
Printed	Page 116 3rd paragraph from the bottom	% xhost + is an inappropriate command. It should be: % xhost +remote_hostname to prevent potential security problems.	Anonymous
Printed	Page 120 3rd paragraph	Can you briefly explain this sentence: Even on a single processor system, it's possible to have multiple processes running concurrently as long as there is enough space for both jobs to remain in memory. I am not sure of what you really mean.	Anonymous
Printed	Page 122 list of top options	"-d" should be "-d delay"	Anonymous
Printed	Page 123 last paragraph	Since priority value is in range 1-19, writing "but unless you are root, you are limited to raising its priority to 1" doesn't make sense.	Anonymous
Printed	Page 126 last paragraph	/zeus: should be zeus:	Anonymous
Printed	Page 134 5th paragraph	If you search for protein structure on Excite.... ^^^^^^^ Google defaults to AND, so you'll find only references that contain protein and structure.... ^^^^^^^ ^^^^^^^^^ The indicated words should be in italic.	Anonymous
Printed	Page 142 3rd paragraph; The first sentence	Reads: The standard reduced representation of the 3D structure of biomolecule consists of.... It should read: The standard reduced representation of the 3D structure of biomolecules consists of....	Anonymous
Printed	Page 154 1st paragraph, 2nd sentence	The forward reference promising to discuss the differences between PDB and mmCIF in Chapter 12 is not kept. Checking the index, neither PDB nor mmCIF has any entries between pages 331 and 349 -- all of Chapter 12. Actually reading through the chapter confirms that neither of those formats are discussed in the course of the chapter. Similarly scanning through the pages where "mmCIF" appears (according to the index) does not give me any further details about differences between the two, other than that mmCIF is newer and the 'community is still attached' to PDB.	Anonymous
Printed	Page 160 Bottom paragraph	The last sentence reads: But keep this fact in mind: the single-letter sequence code that describes DNA and is a simplified representation.... It should read: But keep this fact in mind: the single-letter sequence code that describes DNA is a simplified representation....	Anonymous
Printed	Page 216 Section: A Word About.... 1st paragraph, last sentence	The sentence promises to give an example of file-format conversion in Chapter 12. The examples in Chapter 12 deal with string/pattern searching/matching and parsing BLAST data to compile a report. If the BLAST parsing is the intended example, some other phrasing should be used instead of "file-format conversion" on page 216. Most anyone reading those words will think something similar to converting GIF to JPEG (image formats) rather than what's actually provided in Chapter 12. Or perhaps the example was omitted at printing and the reference to it here and elsewhere was not also removed?	Anonymous
Printed	Page 218 Figure 9-1	N should be H on C with R2	Anonymous
Printed	Page 236 2nd full paragraph, line 2	"forwRasMolard" should be "forward".	Anonymous
Printed	Page 245 5th paragraph - section CATH	The URL for CATH sould read http://www.biochem.ucl.ac.uk/bsm/cath_new/ (original is missing the ucl portion of the hostname)	Anonymous
Printed	Page 246 last paragraph	You mention that you do not know of any software that allows a user to "create a unique data set based on your own choice of parameters". Actually, our website does allow this. There is a form where one can choose resolution, sequence identity, R-factor, length cutoffs as well as whether to include C-alpha only or NMR structures. The website address is: http://www.fccc.edu/research/labs/dunbrack/culledpdb.html	Anonymous
Printed	Page 294 2nd paragraph, 2nd to last sentence	"pheotype" should probably be "phenotype"	Anonymous
Printed	Page 301 2nd paragraph	The text at the end of paragraph 2 states: ...the fragment of a genome can be ordered into a highly specific map (see figure 11-2). Figure 11-2 (page 298) however is the detector output for modern sequencing experiment, and not show a figure of a highly specific map. Other figures in this chapter also do not fit this description, so it looks like a figure is missing from the book .	Anonymous
Printed	Page 324 3rd paragraph	Where would one locate PATH-DB?	Anonymous