Appendix A. References

Amazon Web Services, http://aws.amazon.com/.

Amazon DynamoDB, http://aws.amazon.com/dynamodb/.

Amazon Elastic MapReduce (Amazon EMR), http://aws.amazon.com/elasticmapreduce/.

Amazon Simple Storage Service (S3), http://aws.amazon.com/s3.

Cassandra Database, http://cassandra.apache.org/.

Apache HBase, http://hbase.apache.org/.

Apache Hive, http://hive.apache.org/.

Apache Hive Wiki: https://cwiki.apache.org/Hive/.

Apache Oozie, http://incubator.apache.org/oozie/.

Apache Pig, http://pig.apache.org/.

Apache Zookeeper, http://zookeeper.apache.org/.

Cascading, http://cascading.org.

Data processing on Hadoop without the hassle, https://github.com/nathanmarz/cascalog.

Easy, efficient MapReduce pipelines in Java and Scala, https://github.com/cloudera/crunch.

Datalog, http://en.wikipedia.org/wiki/Datalog.

C.J. Date, The Relational Database Dictionary, O’Reilly Media, 2006, ISBN 978-0-596-52798-3.

Jeffrey Dean and Sanjay Ghemawat, MapReduce: simplified data processing on large clusters, Proceeding OSDI ’04 Proceedings of the 6th conference on Symposium on Operating Systems Design and Implementation - Volume 6, 2004.

Apache Derby, http://db.apache.org/derby/.

Jeffrey E.F. Friedl, Mastering Regular Expressions, 3rd Edition, O’Reilly Media, 2006, ISBN 978-0-596-52812-6.

Alan Gates, Programming Pig, O’Reilly Media, 2011, ISBN 978-1-449-30264-1.

Lars George, HBase: The Definitive Guide, O’Reilly Media, 2011, ISBN 978-1-449-39610-7.

Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung, The Google ...

Get Programming Hive now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.