August 2003
Intermediate to advanced
624 pages
15h 3m
English
As I have said repeatedly, while efficiency and performance are important, in this book I'm putting more emphasis on simplicity, clarity, maintainability, and reusability. However, if we need to wring some efficiency out of the code, there are a few prime targets we can examine first.
The perceptive will notice that when processing source CSV files we scan the input record twice. We scan it once to find the record terminator(s) as we read the input row, then a second time as we parse the columns out of the row. We could modify the parsing algorithm in the CSVRecordReader's parseRecord method so that the input came directly from our file read operation rather than as a record buffer we have already read from disk. However, ...