This is my third big data book, and readers who have read my previous efforts will know that I am interested in open source systems integration. I am interested because this is a constantly changing field; and being open source, the systems are easy to obtain and use. Each Apache project that I will introduce in this book will have a community that supports it and helps it to evolve. I will concentrate on Apache systems (apache.com) and systems that are released under an Apache license.
To attempt the exercises used in this book, it would help ...