MapReduce and HBase

HBase supports writing MapReduce jobs for processing data from the HBase table using the org.apache.hadoop.hbase.mapreduce package, which has lots of methods for the same. This also provides HBase MapReduce input and output formats that can be utilized in MapReduce jobs, a table indexing MapReduce job, and many other MapReduce utilities. It utilizes Hadoop MapReduce framework to do so.

The following is a list of MapReduce classes provided by HBase:

  • Import: This utility is used to import sequence file from HDFS, which is exported by the export command.
  • ImportTsv: This utility is used to import the Tab-separated Value (TSV) file using the MapReduce task.
  • CellCounter: This counts the number of cells in the HBase table using the MapReduce ...

Get Learning HBase now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.