Skip to Content
Mastering Apache Cassandra - Second Edition
book

Mastering Apache Cassandra - Second Edition

by Nishant Neeraj
March 2015
Beginner to intermediate
350 pages
7h 54m
English
Packt Publishing
Content preview from Mastering Apache Cassandra - Second Edition

Cassandra with Hadoop MapReduce

Cassandra provides built-in support for Hadoop. If you have ever written a MapReduce program, you will find out that writing a MapReduce task with Cassandra is quite similar to how one would write a MapReduce task for the data stored in HDFS. Cassandra supports input to Hadoop with ColumnFamilyInputFormat and output with the ColumnFamilyOutputFormat classes, respectively. Apart from these, you will need to put Cassandra-specific settings for Hadoop via ConfigHelper. These three classes are enough to get you started. Another class that might be worth looking at is BulkOutputFormat. All these classes are under the org.apache.cassandra.hadoop.* package.

To be able to compile the MapReduce code that uses Cassandra as ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Mastering Apache Cassandra 3.x - Third Edition

Mastering Apache Cassandra 3.x - Third Edition

Aaron Ploetz, Tejaswi Malepati
Learning Apache Cassandra - Second Edition

Learning Apache Cassandra - Second Edition

Sandeep Yarabarla, Graham Doman

Publisher Resources

ISBN: 9781784392611Supplemental Content