good.rows<-ifelse(nchar(ufo$DateOccurred)>!=8 | nchar(ufo$DateReported)!=8,FALSE, TRUE) This has an errant '>' in the first call to 'nchar'. Removing it, the result of "length(which(!good.rows))" is 731, rather than the printed 371.

It is a typo, thank you for pointing it out. Thankfully, the error does not exist in the supplemental code.

R-squared should not be defined in terms of RMSE of a model, but in terms of MSE -- as the latter is equivalent to the variance of the distribution when the model is the expected value of the distribution. All references to RMSE in the full paragraphs on this page should be replaced with references to MSE.

